Relative Content

Unable to use flash attention to train llama model

I am performing a training run and use Flash attention 2 with the llama model.

ValueError: The model did not return a loss from the inputs, but `model` exists in `train_datasets column_names`

A complete error is:

ValueError: The model did not return a loss from the inputs, but `model` exists in `train_datasets column_names`

A complete error is:

How to use HuggingFace’s run_translation.py script to train a translation from scratch?

I tried various HuggingFace scripts to build language models, such as run_mlm.py (link), run_clm.py (link) and run_translation.py (link). For the former 2 scripts, it can train a language model from scratch (i.e. without a base model).

can not import tftrainer from trnasformers

i try to train gpt2 model on my own data

Seamless m4t T2TT finetuning

Is there a way to finetune the Seamless-m4t-large model for Text-to-Text task, as the official git repo “facebookresearch/seamless_communication” doesn’t support the same currently?

Python Accelerate package thrown error when using Trainer from Transformers

I’m trying out this Hugging Face tutorial

Basic save_pretrained / from_pretrained not retrieving the same model that was saved – Transformers

I created my model with:

Thiết kế website giá rẻ

Danh mục

Relative Content

Tag Archive for pythonhuggingface-transformers

Unable to use flash attention to train llama model

ValueError: The model did not return a loss from the inputs, but `model` exists in `train_datasets column_names`

ValueError: The model did not return a loss from the inputs, but `model` exists in `train_datasets column_names`

How to use HuggingFace’s run_translation.py script to train a translation from scratch?

can not import tftrainer from trnasformers

Seamless m4t T2TT finetuning

Python Accelerate package thrown error when using Trainer from Transformers

Basic save_pretrained / from_pretrained not retrieving the same model that was saved – Transformers