Improving Indian Spoken-Language Identification by Feature Selection in Duration Mismatch Framework

被引:0
|
作者
Bakshi A. [1 ]
Kopparapu S.K. [2 ]
机构
[1] UMIT, SNDT University, Mumbai
[2] TCS Research, TATA Consultancy Services, Yantra Park, Thane
关键词
Classifier fusion; Feature selection; Indian language; Spoken language identification;
D O I
10.1007/s42979-021-00750-1
中图分类号
学科分类号
摘要
Paper presents novel duration normalized feature selection technique and two-step modified hierarchical classifier to improve the accuracy of spoken language identification (SLID) using Indian languages for duration mismatched condition. Feature selection averages random forest-based importance vectors of open SMILE features of different duration utterances. Although it improves the SLID system’s accuracy for mismatched training and testing durations, the performance is significantly reduced for short-duration utterances. A cascade of inter-family and intra-family classifiers with an additional class to improve false language family estimation. All India Radio data set with nine Indian languages and different utterance durations was used as speech material. Experimental results showed that 150 optimal features with the proposed modified hierarchical classifier showed the highest accuracy of 96.9 % and 84.4 % for 30 s and 0.2 s utterances for the same train-test duration. However, we achieved an accuracy of 98.3 % and 61.9 % for 15 and 0.2 s test duration when trained with 30 s duration utterance. Comparative analysis showed a significant improvement in accuracy than several SLID systems in the literature. © 2021, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 44 条
  • [1] DURATION-NORMALIZED FEATURE SELECTION FOR INDIAN SPOKEN LANGUAGE IDENTIFICATION IN UTTERANCE LENGTH MISMATCH
    Bakshi, Aarti M.
    Kopparapu, Sunil K.
    [J]. JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2022, 17 (03): : 2120 - 2134
  • [2] SpeechActs: A spoken-language framework
    Martin, P
    Crabbe, F
    Adams, S
    Baatz, E
    Yankelovich, N
    [J]. COMPUTER, 1996, 29 (07) : 33 - &
  • [3] uGloss: A Framework for Improving Spoken Language Generation Understandability
    Langner, Brian
    Black, Alan W.
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2508 - 2511
  • [4] Spoken Indian language identification: a review of features and databases
    Aarti, Bakshi
    Kopparapu, Sunil Kumar
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2018, 43 (04):
  • [5] Spoken Indian language identification: a review of features and databases
    BAKSHI AARTI
    SUNIL KUMAR KOPPARAPU
    [J]. Sādhanā, 2018, 43
  • [6] Effect of Language Independent Transcribers on Spoken Language Identification for Different Indian Languages
    Saikia, Rajlakshmi
    Singh, Sanasam Ranbir
    Sarmah, Priyankoo
    [J]. 2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 214 - 217
  • [7] A Hybrid Meta-Heuristic Feature Selection Method for Identification of Indian Spoken Languages From Audio Signals
    Das, Aankit
    Guha, Samarpan
    Singh, Pawan Kumar
    Ahmadian, Ali
    Senu, Norazak
    Sarkar, Ram
    [J]. IEEE ACCESS, 2020, 8 : 181432 - 181449
  • [8] Semantic Role Labeling with Discriminative Feature Selection for Spoken Language Understanding
    Liu, Chao-Hong
    Wu, Chung-Hsien
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1039 - 1042
  • [9] Improving writer identification by means of feature selection and extraction
    Schlapbach, A
    Kilchherr, V
    Bunke, H
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 131 - 135
  • [10] Advances in Feature Extraction and Modelling for Short Duration Language Identification
    Fernando, Sarith
    Irtza, Saad
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathamby
    [J]. 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS' 2018), 2018,