Welcome to ISSAI’s Tilmash project, enabling two-way machine translation for four languages — Kazakh, Russian, English, and Turkish.
Our translation model was fine-tuned using Facebook’s NLLB model, designed to handle translation challenges across 202 languages.
Our model was trained using an array of data sources, including official government websites (e.g., the official site of the President of the Republic of Kazakhstan and the State of the Nation Address), news articles, phrasebooks, specialized terminology, and even inspiring TED Talks. Over two years, our dedicated team of linguists diligently reviewed and perfected these data. Additionally, we have incorporated English language resources, automatically translated into Kazakh, Russian, and Turkish.
The result is a state-of-the-art machine translation model that rivals the translation engines of industry giants like Google and Yandex in several standard metrics. We have compiled the results in the table below, showcasing the prowess of our Tilmash model alongside these top-notch translation systems.
Translation from Kazakh
English | Russian | Turkish | |||||||
Google Translate | Yandex Translate | Tilmash | Google Translate | Yandex Translate | Tilmash | Google Translate | Yandex Translate | Tilmash | |
BLEU | 0.32 | 0.29 | 0.32 | 0.26 | 0.26 | 0.26 | 0.21 | 0.13 | 0.15 |
ChrF | 62.81 | 60.68 | 62.23 | 58.89 | 59.51 | 59.65 | 57.96 | 52.11 | 54.57 |
Translation into Kazakh
English | Russian | Turkish | |||||||
Google Translate | Yandex Translate | Tilmash | Google Translate | Yandex Translate | Tilmash | Google Translate | Yandex Translate | Tilmash | |
BLEU | 0.27 | 0.18 | 0.21 | 0.21 | 0.2 | 0.19 | 0.17 | 0.13 | 0.15 |
ChrF | 63.21 | 58.44 | 59.78 | 59.69 | 59.73 | 59.68 | 56.01 | 52.95 | 54.43 |
We have also developed a demo presentation to give you the firsthand experience of our model.
Please keep your text under 300 characters.