Has anyone tried training the chat model with LLAMA-7B? #3230

alibabadoufu · 2023-04-14T08:38:49Z

alibabadoufu
Apr 14, 2023

I am wondering how much GPU memory needed for training the LLAMA-7B

My own experiment:

2 x V100 32GB running the LLAMA-7B model using lora implementation, I experienced the out of CUDA memory issue