Floating to Fixed-point Translation with its Application to Speech-based Emotion Recognition

被引:2
|
作者
Kabi, Bibek [1 ]
Sahoo, Subhasmita [2 ]
Samantaray, Amiya Kumar [3 ]
Routray, Aurobinda [2 ]
机构
[1] Indian Inst Technol, Adv Technol Dev Ctr, Kharagpur, W Bengal, India
[2] Indian Inst Technol, Dept Elect Engn, Kharagpur, W Bengal, India
[3] Natl Inst Technol, Rourkela, India
关键词
Fixed-point arithmetic; hidden Markov model (HMM); mel-frequency cepstral coeffcients (MFCCs); quantization; range estimation; speech-based emotion recognition; wordlength optimization; OPTIMIZATION;
D O I
10.1109/EAIT.2014.57
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Speech-based emotion recognition is one of the latest challenges in speech processing. The algorithms are developed using floating-point arithmetic because of its wide dynamic range and constant relative accuracy. However, they are finally implemented in hand held devices which are required to consume less power, time and have a lower market price. Fixed-point arithmetic with proper determination of integer and fractional bitwidths can help in satisfying these requirements. Therefore we have made an attempt to develop a fixed-point speech-based emotion recognition system using Mel frequency cepstral coefficients (MFCCs) and hidden Markov model (HMM). Accurate range and precision analysis has been carried out to compute optimum integer and fractional wordlengths. The speech emotion engine has been evaluated using Berlin emotional speech database, EMO-DB. A speaker independent emotion recognition accuracy of 71.02% and 67.42% for floating-point and fixed-point formats with optimized wordlenghs respectively was achieved. Finite wordlength effect like quantization with range of relative errors and its effect on emotion recognition task has been analyzed.
引用
收藏
页码:21 / 26
页数:6
相关论文
共 50 条
  • [21] Feature selection enhancement and feature space visualization for speech-based emotion recognition
    Kanwal, Sofia
    Asghar, Sohail
    Ali, Hazrat
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [22] Fixed-point arithmetics trade-offs in adaptive filters for speech recognition
    Garcia-Alcantara, V
    Rodellar, V
    Gomez-Vilda, P
    MELECON '98 - 9TH MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, VOLS 1 AND 2, 1998, : 518 - 521
  • [23] Speaker normalisation for speech-based emotion detection
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathainby
    Epps, Julien
    PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 611 - +
  • [24] STOCHASTIC MODELING FOR FLOATING-POINT TO FIXED-POINT CONVERSION
    Banciu, Andrei
    Casseau, Emmanuel
    Menard, Daniel
    Michel, Thierry
    2011 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2011, : 180 - 185
  • [25] Floating-point DSP extends fixed-point architecture
    Myrvaagnes, R
    ELECTRONIC PRODUCTS MAGAZINE, 1998, 41 (04): : 26 - 26
  • [26] An automated floating-point to fixed-point conversion methodology
    Shi, CC
    Brodersen, RW
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 529 - 532
  • [27] Computing floating-point logarithms with fixed-point operations
    Le Maire, Julien
    Brunie, Nicolas
    de Dinechin, Florent
    Muller, Jean-Michel
    2016 IEEE 23nd Symposium on Computer Arithmetic (ARITH), 2016, : 156 - 163
  • [28] Robust Speech-Based Happiness Recognition
    Lin, Chang-Hong
    Siahaan, Ernestasia
    Chin, Yu-Hau
    Chen, Bo-Wei
    Wang, Jia-Ching
    Wang, Jhing-Fa
    1ST INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT 2013), 2013, : 227 - 230
  • [29] A review of speech-based bimodal recognition
    Chibelushi, CC
    Deravi, F
    Mason, JSD
    IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (01) : 23 - 37
  • [30] Speech-based Emotion Recognition and Speaker Identification: Static vs. Dynamic Mode of Speech Representation
    Sidorov, Maxim
    Minker, Wolfgang
    Semenkin, Eugene S.
    JOURNAL OF SIBERIAN FEDERAL UNIVERSITY-MATHEMATICS & PHYSICS, 2016, 9 (04): : 518 - 523