FSM-Based Pronunciation Modeling using Articulatory Phonological Code

被引：0

作者：

Hu, Chi ^{[1
]}

Zhuang, Xiaodan ^{[1
]}

Hasegawa-Johnson, Mark ^{[1
]}

机构：

[1] Univ Illinois, Beckman Inst, Dept Elect & Comp Engn, Urbana, IL 61801 USA

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年

关键词：

articulatory phonology; speech production; speech gesture; finite state machine; SPEECH RECOGNITION;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

According to articulatory phonology, the gestural score is an invariant speech representation. Though the timing schemes, i.e., the onsets and offsets, of the gestural activations may vary, the ensemble of these activations tends to remain unchanged, informing the speech content. In this work, we propose a pronunciation modeling method that uses a finite state machine (FSM) to represent the invariance of a gestural score. Given the "canonical" gestural score (CGS) of a word with a known activation timing scheme, the plausible activation onsets and offsets are recursively generated and encoded as a weighted FSM. An empirical measure is used to prune out gestural activation timing schemes that deviate too much from the CGS. Speech recognition is achieved by matching the recovered gestural activations to the FSM-encoded gestural scores of different speech contents. We carry out pilot word classification experiments using synthesized data from one speaker. The proposed pronunciation modeling achieves over 90% accuracy for a vocabulary of 139 words with no training observations, outperforming direct use of the CGS.

引用

下载

页码：2274 / 2277

页数：4

共 50 条

[1] An FSM-based approach for malicious code detection using the self-relocation gene
Zhang, Yu
Li, Tao
Sun, Jia
Qin, Renchao
ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2008, 5226 : 364 - +
[2] FSM-Based Object-Oriented Organization Modeling and Simulation
Merunka, Vojtech
ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS, CAISE 2012, 2012, 112 : 398 - 412
[3] On FSM-based fault diagnosis
Pap, Z
Csopaki, G
Dibuz, S
TESTING OF COMMUNICATING SYSTEMS, PROCEEDINGS, 2005, 3502 : 159 - 174
[4] FSM-based power modeling of wireless protocols: The case of bluetooth
Negri, L
Sami, M
Macii, D
Terranegra, A
ISLPED '04: PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2004, : 369 - 374
[5] Articulatory feature-based pronunciation modeling
Livescu, Karen
Jyothi, Preethi
Fosler-Lussier, Eric
COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 212 - 232
[6] FSM-based Properties and Abstraction of Components
Syed Alwi, Syed Hussein
Encrenaz, Emmanuelle
PROCEEDINGS OF THE 2014 25TH IEEE INTERNATIONAL SYMPOSIUM ON RAPID SYSTEM PROTOTYPING (RSP): SHORTENING THE PATH FROM SPECIFICATION TO PROTOTYPE, 2014, : 37 - 43
[7] Articulatory Phonological Code for Word Classification
Zhuang, Xiaodan
Nam, Hosung
Hasegawa-Johnson, Mark
Goldstein, Louis
Saltzman, Elliot
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2723 - +
[8] Secure FSM-based arithmetic codes
Ziyabar, Hashem Moradmand
Sinaie, Mahnaz
Payandeh, Ali
Vakili, Vahid Tabataba
SIGNAL IMAGE AND VIDEO PROCESSING, 2014, 8 (07) : 1263 - 1272
[9] Secure FSM-based arithmetic codes
Hashem Moradmand Ziyabar
Mahnaz Sinaie
Ali Payandeh
Vahid Tabataba Vakili
Signal, Image and Video Processing, 2014, 8 : 1263 - 1272
[10] Using an SMT Solver for Checking the Completeness of FSM-Based Tests
Vinarskii, Evgenii
Laputenko, Andrey
Yevtushenko, Nina
TESTING SOFTWARE AND SYSTEMS, ICTSS 2020, 2020, 12543 : 289 - 295

← 1 2 3 4 5 →