Floating to Fixed-point Translation with its Application to Speech-based Emotion Recognition

被引:2
|
作者
Kabi, Bibek [1 ]
Sahoo, Subhasmita [2 ]
Samantaray, Amiya Kumar [3 ]
Routray, Aurobinda [2 ]
机构
[1] Indian Inst Technol, Adv Technol Dev Ctr, Kharagpur, W Bengal, India
[2] Indian Inst Technol, Dept Elect Engn, Kharagpur, W Bengal, India
[3] Natl Inst Technol, Rourkela, India
关键词
Fixed-point arithmetic; hidden Markov model (HMM); mel-frequency cepstral coeffcients (MFCCs); quantization; range estimation; speech-based emotion recognition; wordlength optimization; OPTIMIZATION;
D O I
10.1109/EAIT.2014.57
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Speech-based emotion recognition is one of the latest challenges in speech processing. The algorithms are developed using floating-point arithmetic because of its wide dynamic range and constant relative accuracy. However, they are finally implemented in hand held devices which are required to consume less power, time and have a lower market price. Fixed-point arithmetic with proper determination of integer and fractional bitwidths can help in satisfying these requirements. Therefore we have made an attempt to develop a fixed-point speech-based emotion recognition system using Mel frequency cepstral coefficients (MFCCs) and hidden Markov model (HMM). Accurate range and precision analysis has been carried out to compute optimum integer and fractional wordlengths. The speech emotion engine has been evaluated using Berlin emotional speech database, EMO-DB. A speaker independent emotion recognition accuracy of 71.02% and 67.42% for floating-point and fixed-point formats with optimized wordlenghs respectively was achieved. Finite wordlength effect like quantization with range of relative errors and its effect on emotion recognition task has been analyzed.
引用
收藏
页码:21 / 26
页数:6
相关论文
共 50 条
  • [31] GENERATING AND PROTECTING AGAINST ADVERSARIAL ATTACKS FOR DEEP SPEECH-BASED EMOTION RECOGNITION MODELS
    Ren, Zhao
    Baird, Alice
    Han, Jing
    Zhang, Zixing
    Schuller, Bjoern
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7184 - 7188
  • [32] Improving Speech-Based Emotion Recognition by Using Psychoacoustic Modeling and Analysis-by-Synthesis
    Siegert, Ingo
    Lotz, Alicia Flores
    Egorow, Olga
    Wendemuth, Andreas
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 445 - 455
  • [33] MULTI-OBJECTIVE HEURISTIC FEATURE SELECTION FOR SPEECH-BASED MULTILINGUAL EMOTION RECOGNITION
    Brester, Christina
    Semenkin, Eugene
    Sidorov, Maxim
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2016, 6 (04) : 243 - 253
  • [34] Speech signal-based emotion recognition and its application to entertainment robots
    Song, Kai-Tai
    Han, Meng-Ju
    Wang, Shih-Chieh
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2014, 37 (01) : 14 - 25
  • [35] Dual fixed-point: An efficient alternative to floating-point computation
    Ewe, CT
    Cheung, PYK
    Constantinides, GA
    FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2004, 3203 : 200 - 208
  • [36] Automated floating-point to fixed-point conversion with the fixify environment
    Belanovic, P
    Rupp, M
    16TH INTERNATIONAL WORKSHOP ON RAPID SYSTEM PROTOTYPING, PROCEEDINGS: SHORTENING THE PATH FROM SPECIFICATION TO PROTOTYPE, 2005, : 172 - 178
  • [37] Optimal fixed-point VLSI structure of a floating-point based digital filter design
    Wu, AY
    Hwang, KF
    ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : D375 - D378
  • [38] An algorithm for converting floating-point computations to fixed-point in MATLAB based FPGA design
    Roy, S
    Banerjee, P
    41ST DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2004, 2004, : 484 - 487
  • [39] $10 floating-point DSP approaches fixed-point price
    Levy, M
    EDN, 1998, 43 (08) : 11 - 11
  • [40] An Investigation of Emotion Dynamics and Kalman Filtering for Speech-based Emotion Prediction
    Huang, Zhaocheng
    Epps, Julien
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3301 - 3305