NLP input dataset Quality Evaluation
Is there any way to evaluate quality of text input data(using metrics like F1 score or some other) that is going to be used for LLM tasks?
Is there any way to evaluate quality of text input data(using metrics like F1 score or some other) that is going to be used for LLM tasks?