Telugu Dialect Speech Dataset Creation and Recognition using Deep Learning Techniques

被引:0
|
作者
Podila, Rama Sai Abhishek [1 ]
Kommula, Ganga Sai Sudeep [1 ]
Ruthvik, K. [1 ]
Vekkot, Susmitha [2 ]
Gupta, Deepa [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept Comp Sci & Engn, Amrita Sch Comp, Bengaluru, India
[2] Amrita Vishwa Vidyapeetham, Dept Elect & Commun Engn, Amrita Sch Comp, Bengaluru, India
关键词
Speech samples; Telugu dialect; RNN; LSTM; GRU; BiLSTM; BiLSTM with attention layer; recognition; CONVERSION;
D O I
10.1109/INDICON56171.2022.10040194
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
According to India's 2011 demography, there seem to be approximately 8 crore Telugu communicators. Apart from that, the Telugu language has many dialects spread across the states Telangana and Andhra Pradesh. Telangana, Rayalaseema, and Coastal accents are the most common. The main concern is to understand the language irrespective of the dialects to have good communication near border areas of these states. Availability of data for analysis of Telugu speech dialects is of high scope for recognition. So, the creation of data is done for Telugu dialects with a total of 9 speakers, 3 speakers for each dialect. Once the data is created, analysis and recognition can help direct our needs. Classifying dialects cannot only solve this problem but also can act as a subset for solving bigger problems like machine translation, sentiment analysis, etc. We have used four RNN models viz. LSTM, GRU, BiLSTM & BiLSTM with attention layer for classification using speech data as input. Maximum test accuracy of 99.11% was obtained using the BiLSTM model with attention layer.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Speech recognition using supervised and unsupervised learning techniques
    Singh, Amber
    Anand, R. S.
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, : 691 - 696
  • [32] Word-Level Speech Dataset Creation for Sourashtra and Recognition System Using Kaldi
    Vancha, Punitha
    Nagarajan, Harshitha
    Inakollu, Vishnu Sai
    Gupta, Deepa
    Vekkot, Susmitha
    [J]. 2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [33] A review on emotion recognition from dialect speech using feature optimization and classification techniques
    Thimmaiah, Sunil
    Vinay, N. A.
    Ravikumar, M. G.
    Prasad, S. R.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (29) : 73793 - 73793
  • [34] Activity Recognition for Locomotion and Transportation Dataset Using Deep Learning
    Naseeb, Chan
    Al Saeedi, Bilal
    [J]. UBICOMP/ISWC '20 ADJUNCT: PROCEEDINGS OF THE 2020 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2020 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2020, : 329 - 334
  • [35] Recognition of English speech - using a deep learning algorithm
    Wang, Shuyan
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
  • [36] Speech Emotion Recognition Using Deep Neural Networks, Transfer Learning, and Ensemble Classification Techniques
    Mihalache, Serban
    Burileanu, Dragos
    [J]. ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2023, 26 (3-4): : 375 - 387
  • [37] Deep features-based dialect and mood recognition using assamese telephonic speech
    Sharma, Mridusmita
    Sarma, Kandarpa Kumar
    [J]. International Journal of Information and Communication Technology, 2020, 17 (04): : 343 - 363
  • [38] Speech Recognition using Arithmetic Coding and MFCC for Telugu Language
    Kumar, Archek Praveen
    Kumar, Neeraj
    Kumar, Cheruku Sandesh
    Yadav, Ashwani Kumar
    Sharma, Abhay
    [J]. PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 265 - 268
  • [39] Detecting Hate Speech using Deep Learning Techniques
    Paul, Chayan
    Bora, Pronami
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 619 - 623
  • [40] Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models
    Abbaschian, Babak Joze
    Sierra-Sosa, Daniel
    Elmaghraby, Adel
    [J]. SENSORS, 2021, 21 (04) : 1 - 27