A Robust Pitch Extractor Based on DTW Lines and CASA with Application in Noisy Speech Recognition

被引：0

作者：

Morales-Cordovilla, Juan A. ^{[1
]}

Cabanas-Molero, Pablo

Peinado, Antonio M. ^{[1
]}

Sanchez, Victoria ^{[1
]}

机构：

[1] Univ Granada, Dept Teoria Senal Telemat & Comunicac, E-18071 Granada, Spain

来源：

ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES | 2012年 / 328卷

关键词：

pitch extractor; pitch line; CASA; DTW; noise; robust speech recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a robust pitch extractor with application in Automatic Speech Recognition and based on selecting pitch lines of a tonegram (a representation of the different pitch energies at each frame time). First, the tonegram and its maximum energy regions are extracted and a Dynamic Time Warping algorithm finds the most energetic trajectories or pitch lines from these regions. A second stage estimates the tonegram of the most energetic lines by applying Computational Auditory Scene Analysis rules which reject and group octave-related lines. The mean pitch of the speaker is estimated and the final pitch is estimated by rejecting lines which are outside from the mean pitch. The proposed pitch extractor is evaluated in a novel way - by means of the word accuracy of a Missing Data recognizer on Aurora-2 database.

引用

页码：197 / 206

页数：10

共 50 条

[1] CASA Based Speech Separation for Robust Speech Recognition
Han Runqiang
Zhao Pei
Gao Qin
Zhang Zhiping
Wu Hao
Wu Xihong
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 77 - 80
[2] Monaural speech separation based on MAXVQ and CASA for robust speech recognition
Li, Peng
Guan, Yong
Wang, Shijin
Xu, Bo
Liu, Wenju
COMPUTER SPEECH AND LANGUAGE, 2010, 24 (01): : 30 - 44
[3] A robust endpoint detection of speech for noisy environments with application to automatic speech recognition
Bou-Ghazale, SE
Assaleh, K
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3808 - 3811
[4] Pitch restoration for robust speech recognition
Lima, C
Tavares, A
Silva, C
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANAGUAGE, PROCEEDINGS, 2003, 2721 : 18 - 22
[5] Robust recognition of noisy speech using speech enhancement
Xu, YF
Zhang, JJ
Yao, KS
Cao, ZG
Ma, ZX
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 734 - 737
[6] COMBINED PNCC FEATURE EXTRACTOR FOR ROBUST SPEECH RECOGNITION
Liu, Xiaoyu
Zahorian, Stephen A.
2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 80 - 84
[7] Research and Improvement on Embedded System Application of DTW-based Speech Recognition
Wan, Chun
Liu, Lili
2008 2ND INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY AND IDENTIFICATION, 2008, : 401 - 404
[8] Noisy speech recognition based on speech enhancement
Wang, Xia
Tang, Hongmei
Zhao, Xiaoqun
SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 713 - +
[9] An effective cluster-based model for robust speech detection and speech recognition in noisy environments
Gorriz, J. M.
Ramirez, J.
Segura, J. C.
Puntonet, C. G.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (01): : 470 - 481
[10] An effective cluster-based model for robust speech detection and speech recognition in noisy environments
Górriz, J.M.
Ramírez, J.
Segura, J.C.
Puntonet, C.G.
Journal of the Acoustical Society of America, 2006, 120 (01): : 470 - 481

← 1 2 3 4 5 →