How to select batch size and μbatch size for llama-imatrix #9877

robbiemu · 2024-10-14T02:07:02Z

robbiemu
Oct 14, 2024

On my 2.2b model, my M3 Max laptop takes ~16 hours to produce it for a 96MB dataset (1k samples per language). For the 7.7b version that I will do next, it should be 5k samples per language, which is 464MB and I am guesstimating about 320 hours to train on my laptop (4 x model size * 5 times total tokens). I am considering using a cloud service for this.

The thing is, I've not set the values for batch size and ubatch on my MacBook yet, just threads. Before I migrate this task to a docker env to bring to a cloud service, or however that will go, I want to know how to dynamically set the batch/ubatch size for optimizing the time this will run.

Since I'm running this imatrix one last time on my MacBook currently, I haven't played with it much, but I have a Mac mini. I dont see much of a difference in efficiency changing batch size with my M1 mini, which can't fit the model it is building for into memory (16gb total memory, 7.7b model): going down from default to 1k yielded about 15 minutes improvement in 60 hours.

How can I optimize the run time?

Answered by robbiemu

Oct 14, 2024

@bartowski1182 @TheBloke please forgive this cold call out -- would either of you know?

View full answer

robbiemu · 2024-10-14T02:29:26Z

robbiemu
Oct 14, 2024
Author

@bartowski1182 @TheBloke please forgive this cold call out -- would either of you know?

1 reply

bartowski1182 Oct 14, 2024

Pretty sure it's just broken now pending @compilade changes here: #9400

robbiemu · 2024-10-25T02:31:58Z

robbiemu
Oct 25, 2024
Author

FYIW https://gist.github.com/robbiemu/4f53fd8d02eabbecbeb164ee0957e01b

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to select batch size and μbatch size for llama-imatrix #9877

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to select batch size and μbatch size for llama-imatrix #9877

Uh oh!

robbiemu Oct 14, 2024

Replies: 2 comments · 1 reply

Uh oh!

robbiemu Oct 14, 2024 Author

Uh oh!

bartowski1182 Oct 14, 2024

Uh oh!

robbiemu Oct 25, 2024 Author

robbiemu
Oct 14, 2024

Replies: 2 comments 1 reply

robbiemu
Oct 14, 2024
Author

robbiemu
Oct 25, 2024
Author