splitting imatrix generation #9872

robbiemu · 2024-10-13T13:38:49Z

robbiemu
Oct 13, 2024

I'm finding that if I follow recent papers, my imatrices are quite large in a multilingual model and it can take ~16hrs just to generate an imatrix for the 2b model I am working on. I have a home lab with two machines minis (older M1's with 16GB of ram) and my main laptop M3 Max MacBook Pro, and I would like to use them together. I know I can split the data and run each separately, but it is hard to eyeball what the split should look like (how large each dataset should be). Is there any guidance on how to best split up the task?

Is this something that could be fairly easily automated in the llama-imatrix command? It feels like it, since I believe I can literally just cat the results together.

Answered by robbiemu

Oct 15, 2024

In fact currently you can only just wait until the first chunk complete so you'll have an estimate, you can use this to fiddle with the batch size and micro batch size, and watch resource draw.

This section of the code I understand is being revised (see other post that was just answered on related topic).

View full answer

robbiemu · 2024-10-15T03:02:04Z

robbiemu
Oct 15, 2024
Author

In fact currently you can only just wait until the first chunk complete so you'll have an estimate, you can use this to fiddle with the batch size and micro batch size, and watch resource draw.

This section of the code I understand is being revised (see other post that was just answered on related topic).

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

splitting imatrix generation #9872

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

splitting imatrix generation #9872

Uh oh!

Uh oh!

robbiemu Oct 13, 2024

Replies: 1 comment

Uh oh!

robbiemu Oct 15, 2024 Author

robbiemu
Oct 13, 2024

robbiemu
Oct 15, 2024
Author