The Kazakh speech corpus (KSC) contains around 335 hours of transcribed audio comprising over 154,000 utterances spoken by participants from different regions, age groups, and gender. It was carefully inspected by native Kazakh speakers to ensure high quality. The KSC is the largest publicly available database developed to advance various Kazakh speech and language processing applications such as speech recognition, speech synthesis, and speaker recognition. The KSC database is available for public and commercial use upon request under Creative Commons Attribution 4.0 International License.
If you use the ISSAI Kazakh Speech Corpus for commercial purposes, please add this statement to your product or service:
Our product uses ISSAI Kazakh Speech Corpus (https://doi.org/10.48342/gkg9-gn84), which is available under a Creative Commons Attribution 4.0 International License.
If you use the ISSAI Kazakh Speech Corpus for research, please cite it as:
Y. Khassanov, S. Mussakhojayeva, A. Mirzakhmetov, A. Adiyev, M. Nurpeiissov and H. A. Varol. “A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline”. arXiv preprint arXiv:2009.10334 (2020).
Here is the demo of the automatic speech recognition system build using Kazakh Speech Corpus. Please click the “RECORD” button and speak immediately until the countdown reaches zero. The recognized output will be displayed above the “RECORD” button after 10 seconds. Please note that some browsers don’t support the audio recoding features.
In some models of browsers technology of audio records is not supported. If this is your case, please, consider using up-to-date browsers in desktop devices.