Frame-level phoneme classification using inductive inference

被引:2
|
作者
Samouelian, A [1 ]
机构
[1] UNIV SYDNEY,DEPT ELECT ENGN,SPEECH TECHNOL RES LAB,SYDNEY,NSW 2006,AUSTRALIA
来源
COMPUTER SPEECH AND LANGUAGE | 1997年 / 11卷 / 03期
关键词
D O I
10.1006/csla.1997.0029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel approach to frame-level classification by the use of inductive inference (decision trees). The proposed system (Samouelian, 1994a) uses the C4.5 induction system (Quinlan, 1993, 1996) to capture the knowledge about the structure and characteristics of the speech signal explicitly from the database. The decision tree is generated automatically from the training speech database. The database contains labelled examples in the form of a feature vector and its corresponding label for each frame. The feature vector may consist of any number of different feature sets and the label may be at the phoneme, sub-word or word level. This approach allows the integration of features from existing signal processing techniques that are currently used in stochastic modelling such as hidden Markov models (HMMs), and acoustic-phonetic features, which have been the cornerstone of traditional knowledge-based techniques. The aim of this research is to demonstrate that induction systems can provide a viable alternative automatic speech recognition technique by allowing the combination of features from any of the above feature representations to achieve optimum classification. Using C4.5, the results on five experiments are reported. The first four experiments use a small corpus of Australian English consonants (plosives, liquids and nasals) and four different feature sets, and they report on frame-level classification results for speaker-dependent and independent modes. The fifth experiment uses the TIMIT database and the mel frequency cepstral coefficient (MFCC) feature set and reports on frame-level classification results for speaker-independent experiments on the training data and test data. (C) 1997 Academic Press Limited.
引用
收藏
页码:161 / 186
页数:26
相关论文
共 50 条
  • [21] Frame-level speech enhancement based on Wasserstein GAN
    Peng, Chuan
    Lan, Tian
    Li, Meng
    Li, Sen
    Liu, Qiao
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2019, 11384
  • [22] Power quality disturbance classification using the inductive inference approach
    Abdel-Galil, TK
    Kamel, M
    Youssef, AM
    El-Saadany, EF
    Salama, MMA
    [J]. IEEE TRANSACTIONS ON POWER DELIVERY, 2004, 19 (04) : 1812 - 1818
  • [23] Modeling frame-level errors in GSM wireless channels
    Ji, P
    Liu, BY
    Towsley, D
    Kurose, J
    [J]. GLOBECOM'02: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-3, CONFERENCE RECORDS: THE WORLD CONVERGES, 2002, : 2483 - 2487
  • [24] Inductive Inference and Partition Exchangeability in Classification
    Corander, Jukka
    Cui, Yaqiong
    Koski, Timo
    [J]. ALGORITHMIC PROBABILITY AND FRIENDS: BAYESIAN PREDICTION AND ARTIFICIAL INTELLIGENCE, 2013, 7070 : 91 - 105
  • [25] Exploiting detected visual objects for frame-level video filtering
    Du, Xingzhong
    Yin, Hongzhi
    Huang, Zi
    Yang, Yi
    Zhou, Xiaofang
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2018, 21 (05): : 1259 - 1284
  • [26] A scalable frame-level pipelined architecture for FSBM motion estimation
    He, Wei-Feng
    Zhao, Meng-Lian
    Tsui, Chi-Ying
    Mao, Zhi-Gang
    [J]. 20TH INTERNATIONAL CONFERENCE ON VLSI DESIGN, PROCEEDINGS: TECHNOLOGY CHALLENGES IN THE NANOELECTRONICS ERA, 2007, : 830 - +
  • [27] NeXtVLAD: An Efficient Neural Network to Aggregate Frame-Level Features for Large-Scale Video Classification
    Lin, Rongcheng
    Xiao, Jing
    Fan, Jianping
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 206 - 218
  • [28] Frame-level bit allocation based on incremental PID algorithm and frame complexity estimation
    Shen, Liquan
    Liu, Zhi
    Zhang, Zhaoyang
    Shi, Xuli
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2009, 20 (01) : 28 - 34
  • [29] A frame-level measurement apparatus for performance testing of ATM equipment
    Angirsani, L
    Baccigalupi, A
    D'Angiolo, G
    [J]. IMTC/2001: PROCEEDINGS OF THE 18TH IEEE INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, VOLS 1-3: REDISCOVERING MEASUREMENT IN THE AGE OF INFORMATICS, 2001, : 1630 - 1635
  • [30] Frame-level global context modeling for detection and localization of abnormality
    Sharma, Manoj Kumar
    Kumar, Vikas
    Sheet, Debdoot
    Biswas, Prabir Kumar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (25) : 38345 - 38370