Translation Models

LLMs, as we know, are good for natural language processing including translations. There are now thousands of models being open sourced on platforms such as GitHub or Hugging Face. Every week about 5000 new translation models are being added.

Big Translate, an LLM developed by a team of Chinese researchers supports multi-lingual translations across 100 languages and it is available on GitHub.

Big Translate is built upon LLaMA of Facebook (introduced in February, 2023). It is designed to handle translation of low-resource language with high accuracy. It is focused on Chinese, and has parallel dataset of 102 languages. The corpus is drawn from various public and proprietary resources.

The model has been tested against Google Translate and ChatGPT. It surpassed ChatGPT in BLEU scores. It closely matches Google Translate.

It can translate Tibetan and Mongolian language. That makes it saleable in the Chinese market. Alibaba Group has released POLYLM to compete with this product.

print

Leave a Reply

Your email address will not be published. Required fields are marked *