Sentiment analysis with word-based Urdu speech recognition

被引:0
|
作者
Riyaz Shaik
S. Venkatramaphanikumar
机构
[1] Vignan’s Foundation for Science,Department of Computer Science & Engineering
[2] Technology & Research,undefined
关键词
Opinion mining; Mel-frequency cepstral coefficients; Spectral energy; Chroma vector; Perceptual linear prediction; Relative-spectral PLP; Dynamic time warping; Hidden Markov model;
D O I
暂无
中图分类号
学科分类号
摘要
Urdu is one of the popular languages across the world as approximately 70 million people speak Urdu in their day-to-day conversations. In general, Muslims prefer to share their opinion or feedback in speech format in the Urdu language. From the literature, it is evident that opinion extraction from naturalistic audio has emerged as a new field of research. In this automatic speech, recognition is carried with keyword spotting approaches on audio, and then opinion score is computed. In this paper, the authors propose a novel framework for the extraction of sentiment from Urdu audio data. Firstly, speech utterances are duly pre-processed, and then short-term features such as Mel-frequency cepstral coefficients, spectral energy, Chroma vector features, perceptual linear prediction (PLP) cepstral coefficients and relative-spectral PLP features are extracted. Five mid-term features, including mean, median, etc., are then derived from those short-term features. In the opinion extraction phase, midterm features of Urdu test utterances are compared with the midterm features of the dictionary of words to cite the opinion as positive, negative, and neutral. The originality of the work involves analyzing the perceptual features to find out the features that contain significant information to extract sentiment in Urdu utterances. In this work, weight mean vector fusion technique is used to fuse the outputs of hidden Markov model and dynamic time warping. In the experiments, 97.1% accuracy is achieved in the sentiment analysis task on the Urdu custom corpus of 600 utterances, which outperforms other state-of-the-art classifiers.
引用
收藏
页码:2511 / 2531
页数:20
相关论文
共 50 条
  • [31] Lexicon Based Sentiment Analysis of Urdu Text Using SentiUnits
    Syed, Afraz Z.
    Aslam, Muhammad
    Maria Martinez-Enriquez, Ana
    ADVANCES IN ARTIFICIAL INTELLIGENCE, MICAI 2010, PT I, 2010, 6437 : 32 - 43
  • [32] Effective lexicon-based approach for Urdu sentiment analysis
    Mukhtar, Neelam
    Khan, Mohammad Abid
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (04) : 2521 - 2548
  • [33] Resource Creation and Evaluation of Aspect Based Sentiment Analysis in Urdu
    Rani, Sadaf
    Anwar, Muhammad Waqas
    AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 72 - 77
  • [34] A Roman Urdu Corpus for sentiment analysis
    Khan, Marwa
    Naseer, Asma
    Wali, Aamir
    Tamoor, Maria
    Computer Journal, 2024, 67 (09): : 2864 - 2876
  • [35] A Roman Urdu Corpus for sentiment analysis
    Khan, Marwa
    Naseer, Asma
    Wali, Aamir
    Tamoor, Maria
    COMPUTER JOURNAL, 2024,
  • [36] Sentiment Analysis System for Roman Urdu
    Mehmood, Khawar
    Essam, Daryl
    Shafi, Kamran
    INTELLIGENT COMPUTING, VOL 1, 2019, 858 : 29 - 42
  • [37] Automatic speech recognition of Urdu words using linear discriminant analysis
    Ali, Hazrat
    Ahmad, Nasir
    Zhou, Xianwei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 28 (05) : 2369 - 2375
  • [38] Word-Based Arabic Handwritten Recognition Using SVM Classifier with a Reject Option
    El Qacimy, Bouchra
    Kerroum, Mounir Ait
    Hammouch, Ahmed
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 64 - 68
  • [39] Word-Based Method for Chinese Part-of-Speech via Parallel and Adversarial Network
    HUANG Kaiyu
    CAO Jingxiang
    LIU Zhuang
    HUANG Degen
    Chinese Journal of Electronics, 2022, (02) : 337 - 344
  • [40] Word-Based Method for Chinese Part-of-Speech via Parallel and Adversarial Network
    Huang, Kaiyu
    Cao, Jingxiang
    Liu, Zhuang
    Huang, Degen
    CHINESE JOURNAL OF ELECTRONICS, 2022, 31 (02) : 337 - 344