Semi-Coupled Dictionary based Automatic Bandwidth Extension Approach for Enhancing Children's ASR

被引:0
|
作者
Sreeram, Ganji [1 ]
Sinha, Rohit [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
Speech bandwidth enhancement; sparse representation; semi-coupled dictionary;
D O I
10.21437/Interspeech.2016-798
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The work presented in this paper is motivated by our earlier work exploring sparse representation based approach for automatic bandwidth extension (ABWE) of speech signals. In that work, two dictionaries one for voiced and the other for unvoiced speech frames are created using KSVD algorithm on wideband data. Each of the atoms of these dictionaries is then decimated and interpolated by a factor of 2 to generate narrowband interpolated (NBI) dictionaries whose atoms have one-to-one correspondence with those of the WB dictionaries. The given narrowband speech frames are also interpolated to generated NBI targets and those are sparse coded over the NBI dictionaries. The resulting sparse codes are then applied to the WB dictionaries to estimate the WB target data. In this work, we extend the said approach by making use of an existing semi-coupled dictionary learning (SCDL) algorithm. Unlike the direct dictionary learning, the SCDL algorithm also learns a set of bidirectional transforms coupling the dictionaries more flexibly. The bandwidth enhanced speech obtained employing the SCDL approach and a modified high/low band gain adjustment yields significant improvements in terms of speech quality measures as well as in the context of children's mismatched speech recognition.
引用
收藏
页码:2577 / 2581
页数:5
相关论文
共 6 条
  • [1] Fast image super-resolution reconstruction algorithm based on semi-coupled dictionary
    Liu J.
    Chen D.
    Ma L.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2020, 48 (03): : 63 - 68
  • [2] Image Super-Resolution Reconstruction Based on fusion of K-SVD and Semi-Coupled Dictionary Learning
    Zhang, Xiu
    Zhou, Wei
    Duan, Zhemin
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [3] Sparsity-based inverse halftoning via semi-coupled multi-dictionary learning and structural clustering
    Zhang, Yan
    Zhang, Erhu
    Chen, Wanjun
    Chen, Yajun
    Duan, Jinghong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 72 : 43 - 53
  • [4] Super-resolution reconstruction based on non-local similarity and clustered semi-coupled dictionary learning
    Yang, Aiping
    Zhong, Tengfei
    He, Yuqing
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2015, 48 (01): : 87 - 94
  • [5] Semi-Coupled Dictionary Learning With Relaxation Label Space Transformation for Video-Based Person Re-Identification
    Sun, Lingchuan
    Jiang, Zhuqing
    Song, Hongchao
    Lu, Qishuo
    Men, Aidong
    IEEE ACCESS, 2018, 6 : 12587 - 12597
  • [6] Enhancing Children's Short Utterance Based ASV Using Data Augmentation Techniques and Feature Concatenation Approach
    Aziz, Shahid
    Shahnawazuddin, Syed
    SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 380 - 394