feat: Review docs #749

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

akoumpa wants to merge 6 commits into main from akoumparouli/update_docs2

+89 −87

Contributor

akoumpa commented Oct 31, 2025 •

edited

Loading

Docs review for:

copy-pr-bot bot commented Oct 31, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

akoumpa requested a review from jgerh

October 31, 2025 19:54

akoumpa changed the title ~~feat: Update docs~~ feat: Review docs

jgerh reviewed

View reviewed changes

Contributor

jgerh left a comment

Completed tech pubs review of files, providing copyedits, formatting recommendations, and style guide feedback.

docs/guides/overview.md Outdated Show resolved Hide resolved

docs/guides/overview.md Outdated Show resolved Hide resolved

docs/guides/overview.md Outdated Show resolved Hide resolved

docs/guides/overview.md Outdated Show resolved Hide resolved

docs/guides/overview.md Outdated Show resolved Hide resolved

docs/guides/llm/pretraining.md Outdated Show resolved Hide resolved

docs/guides/llm/pretraining.md Outdated Show resolved Hide resolved

docs/guides/llm/pretraining.md Outdated Show resolved Hide resolved

docs/guides/llm/pretraining.md Outdated Show resolved Hide resolved

docs/guides/llm/pretraining.md Outdated Show resolved Hide resolved

jgerh reviewed

View reviewed changes

Contributor

jgerh left a comment

Completed tech pubs review of files, providing copyedits, formatting recommendations, and style guide feedback.

jgerh reviewed

View reviewed changes

Contributor

jgerh left a comment

Resubmitting review comments on selected files

docs/guides/llm/pretraining.md

    
            @@ -1,16 +1,16 @@
          
              # LLM Pre-Training with NeMo AutoModel

Contributor

jgerh Nov 3, 2025

LLM Pre-Training with NeMo Automodel

docs/guides/llm/pretraining.md

    
              # LLM Pre-Training with NeMo AutoModel

              This guide covers **FineWeb** data preparation, **defining** a [NanoGPT‑style](https://github.com/KellerJordan/modded-nanogpt) model, and **launching and monitoring** a NeMo AutoModel pre‑training run.

Contributor

jgerh Nov 3, 2025

delete this section

docs/guides/llm/pretraining.md Outdated Show resolved Hide resolved

docs/guides/llm/pretraining.md Outdated

Comment on lines 5 to 12

    
              In particular, it will show you how to:

              In particular, it will show you how to: 

              1. [Install NeMo AutoModel from git](#1-environment-setup).

              2. [Pre-process and tokenize the FineWeb dataset](#2-pre-process-the-fineweb-dataset).

              3. [Introduction to the NeMo AutoModel training workflow](#3-introduction-to-the-nemo-automodel-training-workflow).

              4. [Define your own model architecture](#4-define-your-own-model-architecture).

              5. [Inspect and adjust the YAML configuration](#5-inspect-and-adjust-the-yaml-configuration).

              6. [Launch training](#6-launch-training).

              7. [Monitoring and evaluation](#7-monitoring-and-evaluation).

Contributor

jgerh Nov 3, 2025

delete this section

docs/guides/llm/pretraining.md Outdated

    
              ---

              ## 1. Environment setup

Contributor

jgerh Nov 3, 2025

Suggested change


	## Set Up Your Environment

docs/model-coverage/overview.md Outdated

    
              - Add custom implementations to extend support to models not yet covered upstream.

              - Provide optimized, extended or faster implementations for specific models while retaining the same AutoModel interface.

              ## Supported Hugging Face Auto classes

Contributor

jgerh Nov 4, 2025

Suggested change


	## Supported Hugging Face Auto Classes

docs/model-coverage/overview.md Outdated

    
              | `AutoModelForSequenceClassification`| Sequence Classification  | WIP        | Early support; interfaces may change.     |

              ## When a model listed on Hugging Face Hub may not be supported

Contributor

jgerh Nov 4, 2025

Suggested change


	## Troubleshooting Unsupported Models

docs/model-coverage/overview.md Outdated

    
              ## When a model listed on Hugging Face Hub may not be supported

              Sometimes a model listed on the Hugging Face Hub may not support finetuning in NeMo AutoModel.

Contributor

jgerh Nov 4, 2025

Suggested change

      
            Sometimes a model listed on the Hugging Face hub may not support fine-tuning in NeMo Automodel.

docs/model-coverage/overview.md Outdated

    
              | Model upper-bounds transformer version, requiring older version | — | Open a new GitHub issue. |

              | Unsupported checkpoint format | OSError: `meta-llama/Llama-2-70b` does not appear to have a file named pytorch_model.bin, model.safetensors, tf_model.h5, model.ckpt or flax_model.msgpack. | Open a new GitHub issue or request from the model publisher to share a safetensors checkpoint. |

              These cases typically stem from upstream packaging or dependency constraints and would surface similarly when using `transformers` directly. AutoModel mirrors the familiar load/finetune semantics.

Contributor

jgerh Nov 4, 2025

Suggested change

      
            These cases typically stem from upstream packaging or dependency constraints. You would encounter the same issues when using `transformers` directly, as Automodel mirrors the familiar load and fine-tune semantics.

docs/model-coverage/overview.md Outdated

    
              If you encounter any issue, you can try:

              - Upgrade to a NeMo AutoModel release that supports the required `transformers` version.

Contributor

jgerh Nov 4, 2025

Suggested change

      
            - Upgrade to a NeMo Automodel release that supports the required `transformers` version.

jgerh reviewed

View reviewed changes

Contributor

jgerh left a comment •

edited

Loading

Resubmitting review comments on selected files. Continue to get this error: Unexpected token '<', "<!DOCTYPE "... is not valid JSON

akoumpa added the docs-only label

akoumpa requested a review from jgerh

November 6, 2025 18:26

akoumpa and others added 4 commits

November 6, 2025 10:26


          misc

64bba91

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>


          Update docs/guides/overview.md

d8918e8

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>


          Update docs/guides/overview.md

8ac1502

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>


          doc fixes

9d21f68

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

akoumpa force-pushed the akoumparouli/update_docs2 branch from 2d86254 to 9d21f68 Compare

November 6, 2025 18:26

Contributor Author

akoumpa commented Nov 6, 2025

/ok to test 9d21f68

copy-pr-bot bot temporarily deployed to nemo-ci

November 6, 2025 18:27

Inactive

fix

d79426d

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

Contributor Author

akoumpa commented Nov 6, 2025

/ok to test d79426d

copy-pr-bot bot temporarily deployed to nemo-ci

November 6, 2025 20:30

Inactive

fix

f874220

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

Contributor Author

akoumpa commented Nov 6, 2025

/ok to test f874220

copy-pr-bot bot temporarily deployed to nemo-ci

November 6, 2025 20:34

Inactive

akoumpa marked this pull request as ready for review

November 6, 2025 20:53

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels