Skip to content

Conversation

@leclem
Copy link
Contributor

@leclem leclem commented May 6, 2023

I propose this pull request for convert_to_hf_gptneox.py so it supports conversion to the HF format for the special case of non-distributed training, so with n_stages = 1

Updating this conversion so it works for training made with only 1 GPU
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant