Relative Content

Tag Archive for pytorchcudagoogle-colaboratoryllamafine-tuning

OutOfMemoryError: CUDA out of memory in Google Colab

I am trying to fine tune “Llama-2-7b-chat-hf” Model with “mlabonne/guanaco-llama2-1k” in Google Colab with T4 runtime environment.
I am using Qlora technique to fine tune this model. Below is the code I am using.