Batch size for 80GB GPU memory #6726

mehadi92 · 2023-05-25T12:27:11Z

mehadi92
May 25, 2023

In here you are suggesting batch 64 for 80 GB GPU memory. Which model type (small, medium, large, xlarge) does this batch size applicable?

Answered by titu1994

May 25, 2023

This is for the Large model (the default in the config) with 120 M params

View full answer

titu1994 · 2023-05-25T19:31:31Z

titu1994
May 25, 2023
Maintainer

This is for the Large model (the default in the config) with 120 M params

2 replies

mehadi92 Jun 7, 2023
Author

@titu1994 ,

you are suggesting batch size 32 for 32G GPU memory. But for 80G GPU memory, you are suggesting batch size 64. Is there any specific reason for it? should not it be 80 batch size for GPU memory 80G?

Thanks

titu1994 Jun 7, 2023
Maintainer

Training memory doesn't work like that. It's not 1 gb per sample.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Batch size for 80GB GPU memory #6726

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Batch size for 80GB GPU memory #6726

Uh oh!

mehadi92 May 25, 2023

Replies: 1 comment · 2 replies

Uh oh!

titu1994 May 25, 2023 Maintainer

Uh oh!

mehadi92 Jun 7, 2023 Author

Uh oh!

titu1994 Jun 7, 2023 Maintainer

mehadi92
May 25, 2023

Replies: 1 comment 2 replies

titu1994
May 25, 2023
Maintainer

mehadi92 Jun 7, 2023
Author

titu1994 Jun 7, 2023
Maintainer