Indian Languages Corpus for Speech Recognition

被引:0
|
作者
Basu, Joyanta [1 ]
Khan, Soma [1 ]
Roy, Rajib [1 ]
Saxena, Babita [1 ]
Ganguly, Dipankar [1 ]
Arora, Sunita [1 ]
Arora, Karunesh Kumar [1 ]
Bansal, Shweta [2 ]
Agrawal, Shyam Sunder [2 ]
机构
[1] CDAC, Kolkata, India
[2] KIIT Coll Engn Gurgaon, Gurgaon, India
关键词
Speech Corpus; IVR; Transcription;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Robust Speech Recognition System for various languages have transcended beyond research labs to commercial products. It has been possible owing to the major developments in the area of machine learning, especially deep learning. However, development of advanced speech recognition systems could be leveraged only with the availability of specially curetted speech data. Such systems having usable quality are yet to be developed for most of the Indian languages. The present paper describes the design and development of a standard speech corpora which can be used for developing general purpose ASR systems and benchmarking them. This database has been developed for Indian languages namely Hindi, Bengali and Indian English. The corpus design incorporates important parameters such as phonetic coverage and distribution. The data was recorded by 1500 speakers in each language by male and female speakers of different age groups in varying environments. The data was recorded on a server using online recording system and transcribed using semi-automatic tools. The paper describes the corpus designing methodology, challenges faced and approach adopted to overcome them. The whole process of designing speech database has been generic enough to be used for other languages as well.
引用
收藏
页码:13 / 18
页数:6
相关论文
共 50 条
  • [31] Corpus Construction for Aviation Speech Recognition
    Cui, Yiyi
    Wang, Zhen
    Lu, Yanyu
    Fu, Shan
    [J]. HUMAN-COMPUTER INTERACTION: TECHNOLOGICAL INNOVATION, PT II, 2022, 13303 : 238 - 250
  • [32] Multilingual speech recognition for GlobalPhone languages
    Tachbelie, Martha Yifiru
    Abate, Solomon Teferra
    Schultz, Tanja
    [J]. SPEECH COMMUNICATION, 2022, 140 : 71 - 86
  • [33] Multilingual speech recognition in seven languages
    Uebler, U
    [J]. SPEECH COMMUNICATION, 2001, 35 (1-2) : 53 - 69
  • [34] Subword Speech Recognition for Agglutinative Languages
    Valizada, Alakbar
    [J]. 2021 IEEE 15TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2021), 2021,
  • [35] Multilingual Speech Recognition for Turkic Languages
    Mussakhojayeva, Saida
    Dauletbek, Kaisar
    Yeshpanov, Rustem
    Varol, Huseyin Atakan
    [J]. INFORMATION, 2023, 14 (02)
  • [36] Towards Speech to Speech Machine Translation focusing on Indian Languages
    Mujadia, Vandan
    Umesh, S.
    Murthy, Hema A.
    Sangal, Rajeev
    Sharma, Dipti Misra
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 161 - 168
  • [37] Multilingual Speaker Recognition on Indian Languages
    Sarkar, Sourjya
    Rao, K. Sreenivasa
    Nandi, Dipanjan
    Kumar, Sunil S. B.
    [J]. 2013 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2013,
  • [38] A Weight Based Approach for Emotion Recognition from Speech: An Analysis Using South Indian Languages
    Poorna, S. S.
    Anuraj, K.
    Nair, G. J.
    [J]. SOFT COMPUTING SYSTEMS, ICSCS 2018, 2018, 837 : 14 - 24
  • [39] An automatic speech recognition system in Indian and foreign languages: A state-of-the-art review analysis
    Gupta A.
    Kumar R.
    Kumar Y.
    [J]. Intelligent Decision Technologies, 2023, 17 (02) : 505 - 526
  • [40] Exploring the use of Common Label Set to Improve Speech Recognition of Low Resource Indian Languages
    Shetty, Vishwas M.
    Umesh, S.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7223 - 7227