FASTWORD ACQUISITION IN AN NMF-BASED LEARNING FRAMEWORK

被引:0
|
作者
Driesen, Joris [1 ]
Van Hamme, Hugo [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT, B-3001 Louvain, Belgium
关键词
Acoustic Sub-Word Generation; Unsupervised Learning; Vocabulary Acquisition; Machine Learning;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A speech recognition system that automatically learns word models for a small vocabulary from examples of its usage, without using prior linguistic information, can be of great use in cognitive robotics, human-machine interfaces, and assistive devices. In the latter case, the user's speech capabilities may also be affected. In this paper, we consider a NMF-based learning framework capable of doing this, and experimentally show that its learning rate crucially depends on how the speech data is represented. Higher-level units of speech, which hide some of the complex variability of the acoustics, are found to yield faster learning rates.
引用
收藏
页码:5137 / 5140
页数:4
相关论文
共 50 条
  • [21] NMF-Based Speech Enhancement Using Bases Update
    Kwon, Kisoo
    Shin, Jong Won
    Kim, Nam Soo
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (04) : 450 - 454
  • [22] An NMF-Based Method for the Fingerprint Orientation Field Estimation
    Shao, Guangqi
    Han, Congying
    Guo, Tiande
    Hao, Yang
    COMPUTER AND INFORMATION SCIENCE 2012, 2012, 429 : 93 - +
  • [23] Evaluation of distance measures for NMF-based face recognition
    Xue, Yun
    Tong, Chong Sze
    Zhang, Weipeng
    2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 651 - 656
  • [24] Extended Semantic Initialization for NMF-based Audio Source Separation
    Rohlfing, Christian
    Becker, Julian M.
    2015 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2015, : 95 - 100
  • [25] An Advanced NMF-Based Approach for Single Cell Data Clustering
    Zhao, Peng
    Sheng, Yongpan
    Zhan, Xiaohui
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SOFTWARE ENGINEERING (ICICSE 2022), 2022, : 1 - 5
  • [26] DEEP GENERATIVE MODEL LEARNING FOR BLIND SPECTRUM CARTOGRAPHY WITH NMF-BASED RADIO MAP DISAGGREGATION
    Shrestha, Sagar
    Fu, Xiao
    Hong, Mingyi
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4920 - 4924
  • [27] NMF-Based Speech Enhancement Using Multitaper Spectrum Estimation
    Attabi, Yazid
    Chung, Hanwook
    Champagne, Benoit
    Zhu, Wei-Ping
    2018 INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), 2018, : 36 - 41
  • [28] Automatic tuning of hyperparameters for NMF-based face recognition system
    Drgas, Szymon
    Zdunek, Rafal
    2016 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2016, : 40 - 44
  • [29] DOES INHARMONICITY IMPROVE AN NMF-BASED PIANO TRANSCRIPTION MODEL ?
    Rigaud, Francois
    Falaize, Antoine
    David, Bertrand
    Daudet, Laurent
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 11 - 15
  • [30] Evaluation of Distance Measures For NMF-Based Face Image Applications
    Xue, Yun
    Tong, Chong Sze
    Li, Tiechen
    JOURNAL OF COMPUTERS, 2014, 9 (07) : 1704 - 1711