half() is not supported for quantized model when using FineTuned
I have fine tuned a Llama-3 model ( model_name=”meta-llama/Meta-Llama-3-8B”) in standard way per this notebook https://colab.research.google.com/drive/1Zmaceu65d7w4Tcd-cfnZRb6k_Tcv2b8g?usp=sharing