Sentiment analysis with word-based Urdu speech recognition

被引:0
|
作者
Riyaz Shaik
S. Venkatramaphanikumar
机构
[1] Vignan’s Foundation for Science,Department of Computer Science & Engineering
[2] Technology & Research,undefined
关键词
Opinion mining; Mel-frequency cepstral coefficients; Spectral energy; Chroma vector; Perceptual linear prediction; Relative-spectral PLP; Dynamic time warping; Hidden Markov model;
D O I
暂无
中图分类号
学科分类号
摘要
Urdu is one of the popular languages across the world as approximately 70 million people speak Urdu in their day-to-day conversations. In general, Muslims prefer to share their opinion or feedback in speech format in the Urdu language. From the literature, it is evident that opinion extraction from naturalistic audio has emerged as a new field of research. In this automatic speech, recognition is carried with keyword spotting approaches on audio, and then opinion score is computed. In this paper, the authors propose a novel framework for the extraction of sentiment from Urdu audio data. Firstly, speech utterances are duly pre-processed, and then short-term features such as Mel-frequency cepstral coefficients, spectral energy, Chroma vector features, perceptual linear prediction (PLP) cepstral coefficients and relative-spectral PLP features are extracted. Five mid-term features, including mean, median, etc., are then derived from those short-term features. In the opinion extraction phase, midterm features of Urdu test utterances are compared with the midterm features of the dictionary of words to cite the opinion as positive, negative, and neutral. The originality of the work involves analyzing the perceptual features to find out the features that contain significant information to extract sentiment in Urdu utterances. In this work, weight mean vector fusion technique is used to fuse the outputs of hidden Markov model and dynamic time warping. In the experiments, 97.1% accuracy is achieved in the sentiment analysis task on the Urdu custom corpus of 600 utterances, which outperforms other state-of-the-art classifiers.
引用
下载
收藏
页码:2511 / 2531
页数:20
相关论文
共 50 条
  • [21] A Speech Recognition System for Urdu Language
    Beg, Azam
    Hasnain, S. K.
    WIRELESS NETWORKS, INFORMATION PROCESSING AND SYSTEMS, 2008, 20 : 118 - +
  • [22] WORD-BASED TEXT COMPRESSION
    MOFFAT, A
    SOFTWARE-PRACTICE & EXPERIENCE, 1989, 19 (02): : 185 - 198
  • [23] Word-based text compression
    Moffat, Alistair
    Software - Practice and Experience, 1989, 19 (02) : 185 - 198
  • [24] The functional definition of the word and word-based syntax
    Shaumyan, S
    LACUS FORUM XXVI: THE LEXICON, 2000, 26 : 85 - 95
  • [25] Urdu Speech Corpus and Preliminary Results on Speech Recognition
    Ali, Hazrat
    Ahmad, Nasir
    Hafeez, Abdul
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2016, 2016, 629 : 317 - 325
  • [26] Word-Based Forward Coding
    Zavadskyi, Igor
    Klein, Shmuel T.
    Shapira, Dana
    2024 DATA COMPRESSION CONFERENCE, DCC, 2024, : 352 - 361
  • [27] Sentiment Analysis on Roman Urdu Students' Feedback Using Enhanced Word Embedding Technique
    Noureen
    Huspi, Sharin Hazlin
    Ali, Zafar
    BAGHDAD SCIENCE JOURNAL, 2024, 21 (02) : 725 - 739
  • [28] A hybrid dependency-based approach for Urdu sentiment analysis
    Urooba Sehar
    Summrina Kanwal
    Nasser I. Allheeib
    Sultan Almari
    Faiza Khan
    Kia Dashtipur
    Mandar Gogate
    Osama A. Khashan
    Scientific Reports, 13
  • [29] A hybrid dependency-based approach for Urdu sentiment analysis
    Sehar, Urooba
    Kanwal, Summrina
    Allheeib, Nasser I.
    Almari, Sultan
    Khan, Faiza
    Dashtipur, Kia
    Gogate, Mandar
    Khashan, Osama A.
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [30] Effective lexicon-based approach for Urdu sentiment analysis
    Neelam Mukhtar
    Mohammad Abid Khan
    Artificial Intelligence Review, 2020, 53 : 2521 - 2548