Sentiment analysis with word-based Urdu speech recognition

被引:0
|
作者
Riyaz Shaik
S. Venkatramaphanikumar
机构
[1] Vignan’s Foundation for Science,Department of Computer Science & Engineering
[2] Technology & Research,undefined
关键词
Opinion mining; Mel-frequency cepstral coefficients; Spectral energy; Chroma vector; Perceptual linear prediction; Relative-spectral PLP; Dynamic time warping; Hidden Markov model;
D O I
暂无
中图分类号
学科分类号
摘要
Urdu is one of the popular languages across the world as approximately 70 million people speak Urdu in their day-to-day conversations. In general, Muslims prefer to share their opinion or feedback in speech format in the Urdu language. From the literature, it is evident that opinion extraction from naturalistic audio has emerged as a new field of research. In this automatic speech, recognition is carried with keyword spotting approaches on audio, and then opinion score is computed. In this paper, the authors propose a novel framework for the extraction of sentiment from Urdu audio data. Firstly, speech utterances are duly pre-processed, and then short-term features such as Mel-frequency cepstral coefficients, spectral energy, Chroma vector features, perceptual linear prediction (PLP) cepstral coefficients and relative-spectral PLP features are extracted. Five mid-term features, including mean, median, etc., are then derived from those short-term features. In the opinion extraction phase, midterm features of Urdu test utterances are compared with the midterm features of the dictionary of words to cite the opinion as positive, negative, and neutral. The originality of the work involves analyzing the perceptual features to find out the features that contain significant information to extract sentiment in Urdu utterances. In this work, weight mean vector fusion technique is used to fuse the outputs of hidden Markov model and dynamic time warping. In the experiments, 97.1% accuracy is achieved in the sentiment analysis task on the Urdu custom corpus of 600 utterances, which outperforms other state-of-the-art classifiers.
引用
收藏
页码:2511 / 2531
页数:20
相关论文
共 50 条
  • [1] Sentiment analysis with word-based Urdu speech recognition
    Shaik, Riyaz
    Venkatramaphanikumar, S.
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (5) : 2511 - 2531
  • [2] Fusion Architectures for Word-based Audiovisual Speech Recognition
    Wand, Michael
    Schmidhuber, Jurgen
    [J]. INTERSPEECH 2020, 2020, : 3491 - 3495
  • [3] Word-based confidence measures as a guide for stack search in speech recognition
    Neti, CV
    Roukos, S
    Eide, E
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 883 - 886
  • [4] A Word-Based Naive Bayes Classifier for Confidence Estimation in Speech Recognition
    Sanchis, Alberto
    Juan, Alfons
    Vidal, Enrique
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 565 - 574
  • [5] END-TO-END SPEECH RECOGNITION WITH WORD-BASED RNN LANGUAGE MODELS
    Hori, Takaaki
    Cho, Jaejin
    Watanabe, Shinji
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 389 - 396
  • [6] Prosodic Word-Based Error Correction in Speech Recognition Using Prosodic Word Expansion and Contextual Information
    Liu, Chao-Hong
    Wu, Chung-Hsien
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1385 - 1388
  • [7] Speech Emotion Recognition using Convolutional Neural Network with Audio Word-based Embedding
    Huang, Kun-Yi
    Wu, Chung-Hsien
    Hong, Qian-Bei
    Su, Ming-Hsiang
    Zeng, Yuan-Rong
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 265 - 269
  • [8] Word-Based Classification of Imagined Speech Using EEG
    Hashim, Noramiza
    Ali, Aziah
    Mohd-Isa, Wan-Noorshahida
    [J]. COMPUTATIONAL SCIENCE AND TECHNOLOGY, ICCST 2017, 2018, 488 : 195 - 204
  • [9] Urdu Sentiment Analysis
    Rehman, Iffraah
    Soomro, Tariq Rahim
    [J]. APPLIED COMPUTER SYSTEMS, 2022, 27 (01) : 30 - 42
  • [10] Urdu Sentiment Analysis
    Khan, Khairullah
    Rahman, Atta Ur
    Khan, Aurangzeb
    Khan, Ashraf Ullah
    Saqia, Bibi
    Khan, Wahab
    Khans, Asfandyar
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (09) : 646 - 651