Using BART, T5, mBART, and mT5 for translation of a new language
I am working on developing a transliteration model to convert Romanized Tamil (Tamil written in the Latin alphabet) text into its native Tamil script. I’ve come across several projects that use transformer-based models like T5, BART, mT5, and mBART for translation tasks. I understand that mT5 and mBART models have been trained on Tamil language data. Romanized Tamil can be consider as another language known as “Tanglish”.