Floating to Fixed-point Translation with its Application to Speech-based Emotion Recognition

被引：2

作者：

Kabi, Bibek ^{[1
]}

Sahoo, Subhasmita ^{[2
]}

Samantaray, Amiya Kumar ^{[3
]}

Routray, Aurobinda ^{[2
]}

机构：

[1] Indian Inst Technol, Adv Technol Dev Ctr, Kharagpur, W Bengal, India

[2] Indian Inst Technol, Dept Elect Engn, Kharagpur, W Bengal, India

[3] Natl Inst Technol, Rourkela, India

来源：

2014 FOURTH INTERNATIONAL CONFERENCE OF EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT) | 2014年

关键词：

Fixed-point arithmetic; hidden Markov model (HMM); mel-frequency cepstral coeffcients (MFCCs); quantization; range estimation; speech-based emotion recognition; wordlength optimization; OPTIMIZATION;

D O I：

10.1109/EAIT.2014.57

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Speech-based emotion recognition is one of the latest challenges in speech processing. The algorithms are developed using floating-point arithmetic because of its wide dynamic range and constant relative accuracy. However, they are finally implemented in hand held devices which are required to consume less power, time and have a lower market price. Fixed-point arithmetic with proper determination of integer and fractional bitwidths can help in satisfying these requirements. Therefore we have made an attempt to develop a fixed-point speech-based emotion recognition system using Mel frequency cepstral coefficients (MFCCs) and hidden Markov model (HMM). Accurate range and precision analysis has been carried out to compute optimum integer and fractional wordlengths. The speech emotion engine has been evaluated using Berlin emotional speech database, EMO-DB. A speaker independent emotion recognition accuracy of 71.02% and 67.42% for floating-point and fixed-point formats with optimized wordlenghs respectively was achieved. Finite wordlength effect like quantization with range of relative errors and its effect on emotion recognition task has been analyzed.

引用

页码：21 / 26

页数：6

共 50 条

[31] GENERATING AND PROTECTING AGAINST ADVERSARIAL ATTACKS FOR DEEP SPEECH-BASED EMOTION RECOGNITION MODELS
Ren, Zhao
Baird, Alice
Han, Jing
Zhang, Zixing
Schuller, Bjoern
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7184 - 7188
[32] Improving Speech-Based Emotion Recognition by Using Psychoacoustic Modeling and Analysis-by-Synthesis
Siegert, Ingo
Lotz, Alicia Flores
Egorow, Olga
Wendemuth, Andreas
SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 445 - 455
[33] MULTI-OBJECTIVE HEURISTIC FEATURE SELECTION FOR SPEECH-BASED MULTILINGUAL EMOTION RECOGNITION
Brester, Christina
Semenkin, Eugene
Sidorov, Maxim
JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2016, 6 (04) : 243 - 253
[34] Speech signal-based emotion recognition and its application to entertainment robots
Song, Kai-Tai
Han, Meng-Ju
Wang, Shih-Chieh
JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2014, 37 (01) : 14 - 25
[35] Dual fixed-point: An efficient alternative to floating-point computation
Ewe, CT
Cheung, PYK
Constantinides, GA
FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2004, 3203 : 200 - 208
[36] Automated floating-point to fixed-point conversion with the fixify environment
Belanovic, P
Rupp, M
16TH INTERNATIONAL WORKSHOP ON RAPID SYSTEM PROTOTYPING, PROCEEDINGS: SHORTENING THE PATH FROM SPECIFICATION TO PROTOTYPE, 2005, : 172 - 178
[37] Optimal fixed-point VLSI structure of a floating-point based digital filter design
Wu, AY
Hwang, KF
ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : D375 - D378
[38] An algorithm for converting floating-point computations to fixed-point in MATLAB based FPGA design
Roy, S
Banerjee, P
41ST DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2004, 2004, : 484 - 487
[39] $10 floating-point DSP approaches fixed-point price
Levy, M
EDN, 1998, 43 (08) : 11 - 11
[40] An Investigation of Emotion Dynamics and Kalman Filtering for Speech-based Emotion Prediction
Huang, Zhaocheng
Epps, Julien
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3301 - 3305

← 1 2 3 4 5 →