Telugu Dialect Speech Dataset Creation and Recognition using Deep Learning Techniques

被引:0
|
作者
Podila, Rama Sai Abhishek [1 ]
Kommula, Ganga Sai Sudeep [1 ]
Ruthvik, K. [1 ]
Vekkot, Susmitha [2 ]
Gupta, Deepa [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept Comp Sci & Engn, Amrita Sch Comp, Bengaluru, India
[2] Amrita Vishwa Vidyapeetham, Dept Elect & Commun Engn, Amrita Sch Comp, Bengaluru, India
关键词
Speech samples; Telugu dialect; RNN; LSTM; GRU; BiLSTM; BiLSTM with attention layer; recognition; CONVERSION;
D O I
10.1109/INDICON56171.2022.10040194
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
According to India's 2011 demography, there seem to be approximately 8 crore Telugu communicators. Apart from that, the Telugu language has many dialects spread across the states Telangana and Andhra Pradesh. Telangana, Rayalaseema, and Coastal accents are the most common. The main concern is to understand the language irrespective of the dialects to have good communication near border areas of these states. Availability of data for analysis of Telugu speech dialects is of high scope for recognition. So, the creation of data is done for Telugu dialects with a total of 9 speakers, 3 speakers for each dialect. Once the data is created, analysis and recognition can help direct our needs. Classifying dialects cannot only solve this problem but also can act as a subset for solving bigger problems like machine translation, sentiment analysis, etc. We have used four RNN models viz. LSTM, GRU, BiLSTM & BiLSTM with attention layer for classification using speech data as input. Maximum test accuracy of 99.11% was obtained using the BiLSTM model with attention layer.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Language dialect based speech emotion recognition through deep learning techniques
    Rajendran, Sukumar
    Mathivanan, Sandeep Kumar
    Jayagopal, Prabhu
    Venkatasen, Maheshwari
    Pandi, Thanapal
    Sorakaya Somanathan, Manivannan
    Thangaval, Muthamilselvan
    Mani, Prasanna
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (03) : 625 - 635
  • [2] Language dialect based speech emotion recognition through deep learning techniques
    Sukumar Rajendran
    Sandeep Kumar Mathivanan
    Prabhu Jayagopal
    Maheshwari Venkatasen
    Thanapal Pandi
    Manivannan Sorakaya Somanathan
    Muthamilselvan Thangaval
    Prasanna Mani
    [J]. International Journal of Speech Technology, 2021, 24 : 625 - 635
  • [3] A Dataset For Turkish Dialect Recognition and Classification with Deep Learning
    Isik, Gultekin
    Artuner, Harun
    [J]. 2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [4] Dialect recognition from Telugu speech utterances using spectral and prosodic features
    Shivaprasad, S.
    Sadanandam, M.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 27 (2) : 515 - 515
  • [5] Emotional speech Recognition using CNN and Deep learning techniques
    Hema, C.
    Marquez, Fausto Pedro Garcia
    [J]. APPLIED ACOUSTICS, 2023, 211
  • [6] Speech Emotion Recognition Using Deep Learning Techniques: A Review
    Khalil, Ruhul Amin
    Jones, Edward
    Babar, Mohammad Inayatullah
    Jan, Tariqullah
    Zafar, Mohammad Haseeb
    Alhussain, Thamer
    [J]. IEEE ACCESS, 2019, 7 : 117327 - 117345
  • [7] Dialect Identification in Telugu Language Speech Utterance Using Modified Features with Deep Neural Network
    Satla, Shivaprasad
    Manchala, Sadanandam
    [J]. TRAITEMENT DU SIGNAL, 2021, 38 (06) : 1793 - 1799
  • [8] Exploring the Effect of Dialect Mismatched Language Models in Telugu Automatic Speech Recognition
    Yadavalli, Aditya
    Mirishkar, Ganesh S.
    Vuppala, Anil Kumar
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2022, : 292 - 301
  • [9] Urdu Speech Emotion Recognition using Speech Spectral Features and Deep Learning Techniques
    Taj, Soonh
    Shaikh, Ghulam Mujtaba
    Hassan, Saif
    Nimra
    [J]. 2023 4th International Conference on Computing, Mathematics and Engineering Technologies: Sustainable Technologies for Socio-Economic Development, iCoMET 2023, 2023,
  • [10] Accent Recognition System Using Deep Belief Networks for Telugu Speech Signals
    Mannepalli, Kasiprasad
    Sastry, Panyam Narahari
    Suman, Maloji
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS, FICTA 2016, VOL 1, 2017, 515 : 99 - 105