Instructions for Kazakh TTS demo:
In order to stimulate research and innovation and also encourage the use of Kazakh in the digital sphere, the Institute of smart systems and artificial intelligence (ISSAI) of Nazarbayev University conducted a Kazakh language speech synthesis project. Text-to-speech (TTS) conversion is the artificial production of human speech. It allows to convert the written text to the speech signal. TTS is an essential component in many applications such as interactive smart assistant systems, navigation systems, announcement systems and assistive technologies for the visually-impaired. It enables human-technology interaction without requiring visual and tactile interfaces.
To build Kazakh TTS systems, we used the KazakhTTS database developed by ISSAI. KazakhTTS is a high-quality open-source speech database which contains over 90 hours of audio recorded by professional speakers (male and female voices). The database is publicly available for both academic and commercial use upon request under Creative Commons Attribution 4.0 International License.
If you use the ISSAI’s KazakhTTS database for commercial purposes, please add this statement to your product or service:
Our product uses ISSAI KazakhTTS (https://doi.org/10.48342/bkzq-tp58), which is available under a Creative Commons Attribution 4.0 International License.
If you use the ISSAI’s KazakhTTS database for research, please cite it as:
Mussakhojayeva, S., Janaliyeva, A., Mirzakhmetov, A., Khassanov, Y. and Varol, H.A., 2021. KazakhTTS: An Mussakhojayeva, S., Janaliyeva, A., Mirzakhmetov, A., Khassanov, Y., Varol, H.A. (2021) KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset. Proc. Interspeech 2021, 2786-2790, doi: 10.21437/Interspeech.2021-2124Open-Source Kazakh Text-to-Speech Synthesis Dataset. arXiv preprint arXiv:2104.08459
Please note: this is a KazakhTTS DATASET, not a demo of the Kazakh Text-To-Speech conversion technology