You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Integrate distributed inference into torchchat cli (pytorch#1327)
* add pp_dim, distributed, num_gpus, num_nodes as cmd line args
* add tp_dim
* add elastic_launch
* working, can now launch from cli
* Remove numpy < 2.0 pin to align with pytorch (pytorch#1301)
Fixpytorch#1296
Align with https://github.com/pytorch/pytorch/blame/main/requirements.txt#L5
* Update torchtune pin to 0.4.0-dev20241010 (pytorch#1300)
Co-authored-by: vmpuri <puri@meta.com>
* Unbreak gguf util CI job by fixing numpy version (pytorch#1307)
Setting numpy version to be the range required by gguf: https://github.com/ggerganov/llama.cpp/blob/master/gguf-py/pyproject.toml
* Remove apparently-unused import torchvision in model.py (pytorch#1305)
Co-authored-by: vmpuri <45368418+vmpuri@users.noreply.github.com>
* remove global var for tokenizer type + patch tokenizer to allow list of sequences
* make pp tp visible in interface
* Add llama 3.1 to dist_run.py
* [WIP] Move dist inf into its own generator
* Add initial generator interface to dist inference
* Added generate method and placeholder scheduler
* use prompt parameter for dist generation
* Enforce tp>=2
* Build tokenizer from TokenizerArgs
* Disable torchchat format + constrain possible models for distributed
* disable calling dist_run.py directly for now
* Restore original dist_run.py for now
* disable _maybe_parallelize_model again
* Reenable arg.model_name in dist_run.py
* Use singleton logger instead of print in generate
* Address PR comments; try/expect in launch_dist_inference; added comments
---------
Co-authored-by: lessw2020 <lessw@etrillium.com>
Co-authored-by: Mengwei Liu <larryliu0820@users.noreply.github.com>
Co-authored-by: vmpuri <45368418+vmpuri@users.noreply.github.com>
Co-authored-by: vmpuri <puri@meta.com>
Co-authored-by: Scott Wolchok <swolchok@meta.com>
0 commit comments