BAG OF ARCS: NEW REPRESENTATION OF SPEECH SEGMENT FEATURES BASED ON FINITE STATE MACHINES

被引:0
|
作者
Watanabe, Shinji [1 ]
Kubo, Yotaro [1 ]
Oba, Takanobu [1 ]
Hori, Takaaki [1 ]
Nakamura, Atsushi [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Kyoto, Japan
关键词
Speech segment feature; finite state machine; Bag Of Arcs (BOA); speaker recognition; utterance classification; MODEL;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a new feature representation, Bag Of Arcs (BOA) for speech segments. A speech segment in BOA is simply represented as a set of counts for unique arcs in a finite state machine. Similar to the Bag Of Words model (BOW), BOA disregards the order of arcs, and thus, efficiently models speech segments. A strong motivation to use BOA is provided by a fact that the BOA representation is tightly connected to the output of a Weighted Finite State Transducer (WFST) based ASR decoder. Thus, BOA directly represents elements in the search network of a WFST-based ASR decoder, and can include information about context-dependent HMM topologies, lexicons, and back-off smoothed n-gram networks. In addition, the counts of BOA are accumulated by using the WFST decoder output directly, and we do not require an additional overhead and a change of decoding algorithms to extract the features. Consequently, we can combine the ASR decoder and post-processing without a process to extract word features from the decoder outputs or re-compiling WFST networks. We show the effectiveness of the proposed approach for some ASR post-processing applications in utterance classification experiments, and in speaker adaptation experiments by achieving absolute 1% improvement in WER from baseline results. We also show examples of latent semantic analysis for BOA by using latent Dirichlet allocation.
引用
下载
收藏
页码:4201 / 4204
页数:4
相关论文
共 50 条
  • [31] HORIZONTAL PARTITIONING OF PLA-BASED FINITE STATE MACHINES
    PAULIN, PG
    26TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, 1989, : 333 - 338
  • [32] Automation of industrial serial processes based on finite state machines
    Jamhour, A.
    Garcia, C.
    CHISA 2012, 2012, 42 : 186 - 196
  • [33] Highly testable finite state machines based on EXOR logic
    Kalay, Ugur
    Venkataramaiah, Nagesh
    Mishchenko, Alan
    Hall, Douglas V.
    Perkowski, Marek A.
    IEEE Pacific RIM Conference on Communications, Computers, and Signal Processing - Proceedings, 1999, : 440 - 443
  • [34] Finite state automaton based control system for walking machines
    Hussain, Razeen
    Zielinska, Teresa
    Hexel, Rene
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2019, 16 (03):
  • [35] SDL Ontology for Specifying Systems Based on Finite State Machines
    Babac, Marina Bagic
    Kunstic, Marijan
    TRENDS IN PRACTICAL APPLICATIONS OF AGENTS AND MULTIAGENT SYSTEMS, 2010, 71 : 509 - 517
  • [36] EASILY TESTABLE PLA-BASED FINITE STATE MACHINES
    DEVADAS, S
    MA, HKT
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1990, 9 (06) : 604 - 611
  • [37] Implementing finite state machines in a computer based teaching system
    Hacker, CH
    Sitte, R
    EDUCATION IN MICROELECTRONICS AND MEMS, 1999, 3894 : 110 - 117
  • [38] Testing finite state machines based on a structural coverage metric
    Goren, S
    Ferguson, FJ
    INTERNATIONAL TEST CONFERENCE 2002, PROCEEDINGS, 2002, : 773 - 780
  • [39] Maximum mutual information based acoustic-features representation of phonological features for speech recognition
    Omar, MK
    Hasegawa-Johnson, M
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 81 - 84
  • [40] A new algorithm for exact reduction of incompletely specified finite state machines
    Pena, JM
    Oliveira, AL
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1999, 18 (11) : 1619 - 1632