Telugu Dialect Speech Dataset Creation and Recognition using Deep Learning Techniques

被引:0
|
作者
Podila, Rama Sai Abhishek [1 ]
Kommula, Ganga Sai Sudeep [1 ]
Ruthvik, K. [1 ]
Vekkot, Susmitha [2 ]
Gupta, Deepa [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept Comp Sci & Engn, Amrita Sch Comp, Bengaluru, India
[2] Amrita Vishwa Vidyapeetham, Dept Elect & Commun Engn, Amrita Sch Comp, Bengaluru, India
关键词
Speech samples; Telugu dialect; RNN; LSTM; GRU; BiLSTM; BiLSTM with attention layer; recognition; CONVERSION;
D O I
10.1109/INDICON56171.2022.10040194
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
According to India's 2011 demography, there seem to be approximately 8 crore Telugu communicators. Apart from that, the Telugu language has many dialects spread across the states Telangana and Andhra Pradesh. Telangana, Rayalaseema, and Coastal accents are the most common. The main concern is to understand the language irrespective of the dialects to have good communication near border areas of these states. Availability of data for analysis of Telugu speech dialects is of high scope for recognition. So, the creation of data is done for Telugu dialects with a total of 9 speakers, 3 speakers for each dialect. Once the data is created, analysis and recognition can help direct our needs. Classifying dialects cannot only solve this problem but also can act as a subset for solving bigger problems like machine translation, sentiment analysis, etc. We have used four RNN models viz. LSTM, GRU, BiLSTM & BiLSTM with attention layer for classification using speech data as input. Maximum test accuracy of 99.11% was obtained using the BiLSTM model with attention layer.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Persian speech recognition using deep learning
    Veisi, Hadi
    Haji Mani, Armita
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (04) : 893 - 905
  • [22] Speech Command Recognition Using Deep Learning
    Ayache, Mohammad
    Kanaan, Hussien
    Kassir, Kawthar
    Kassir, Yasser
    [J]. 2021 SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN BIOMEDICAL ENGINEERING (ICABME), 2021, : 24 - 29
  • [23] Speech Emotion Recognition Using Deep Learning
    Ahmed, Waqar
    Riaz, Sana
    Iftikhar, Khunsa
    Konur, Savas
    [J]. ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
  • [24] Fake Speech Recognition Using Deep Learning
    Camacho, Steven
    Maria Ballesteros, Dora
    Renza, Diego
    [J]. APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2021, 2021, 1431 : 38 - 48
  • [25] Persian speech recognition using deep learning
    Hadi Veisi
    Armita Haji Mani
    [J]. International Journal of Speech Technology, 2020, 23 : 893 - 905
  • [26] Data Augmentation Techniques for Speech Emotion Recognition and Deep Learning
    Antonio Nicolas, Jose
    de Lope, Javier
    Grana, Manuel
    [J]. BIO-INSPIRED SYSTEMS AND APPLICATIONS: FROM ROBOTICS TO AMBIENT INTELLIGENCE, PT II, 2022, 13259 : 279 - 288
  • [27] Sichuan dialect speech recognition with deep LSTM network
    Ying, Wangyang
    Zhang, Lei
    Deng, Hongli
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2020, 14 (02) : 378 - 387
  • [28] Sichuan dialect speech recognition with deep LSTM network
    Wangyang Ying
    Lei Zhang
    Hongli Deng
    [J]. Frontiers of Computer Science, 2020, 14 : 378 - 387
  • [29] Emotion Recognition System via Facial Expressions and Speech Using Machine Learning and Deep Learning Techniques
    Chaudhari A.
    Bhatt C.
    Nguyen T.T.
    Patel N.
    Chavda K.
    Sarda K.
    [J]. SN Computer Science, 4 (4)
  • [30] Machine Learning Paradigms for Speech Recognition of an Indian Dialect
    Londhe, N. D.
    Ahirwal, M. K.
    Lodha, P.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 780 - 786