SEMI-NON-NEGATIVE MATRIX FACTORIZATION USING ALTERNATING DIRECTION METHOD OF MULTIPLIERS FOR VOICE CONVERSION

被引:0
|
作者
Aihara, Ryo [1 ]
Takiguchi, Tetsuya [1 ]
Ariki, Yasuo [1 ]
机构
[1] Kobe Univ, Grad Sch Syst Informat, Kobe, Hyogo, Japan
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年
关键词
NMF; ADMM; Voice Conversion; Speech Synthesis; Sparse Representation; SPARSE REPRESENTATION; ALGORITHMS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Voice conversion (VC) is being widely researched in the field of speech processing because of increased interest in using such processing in applications such as personalized Text-To-Speech systems. A VC method using Non-negative Matrix Factorization (NMF) has been researched because of its natural sounding voice, however, huge memory usage and high computational times have been reported as problems. We present in this paper a new VC method using Semi-Non-negative Matrix Factorization (Semi-NMF) using the Alternating Direction Method of Multipliers (ADMM) in order to tackle the problems associated with NMF-based VC. Dictionary learning using Semi-NMF can create a compact dictionary, and ADMM enables faster convergence than conventional Semi-NMF. Experimental results show that our proposed method is 76 times faster than conventional NMF, and its conversion quality is almost the same as that of the conventional method.
引用
收藏
页码:5170 / 5174
页数:5
相关论文
共 50 条
  • [1] Alternating Direction Method of Multipliers for Convolutive Non-Negative Matrix Factorization
    Li, Yinan
    Wang, Ruili
    Fang, Yuqiang
    Sun, Meng
    Luo, Zhangkai
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (12) : 7735 - 7748
  • [2] Non-negative Matrix Factorization using Stable Alternating Direction Method of Multipliers for Source Separation
    Zhang, Shaofei
    Huang, Dongyan
    Xie, Lei
    Chng, Eng Siong
    Li, Haizhou
    Dong, Minghui
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 222 - 228
  • [3] Regularized Semi-non-negative Matrix Factorization for Hashing
    Chen, Yong
    Zhang, Hui
    Zhang, Xiaopeng
    Liu, Rui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (07) : 1823 - 1836
  • [4] ALTERNATING DIRECTION METHOD OF MULTIPLIERS FOR NON-NEGATIVE MATRIX FACTORIZATION WITH THE BETA-DIVERGENCE
    Sun, Dennis L.
    Fevotte, Cedric
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [5] Speech Enhancement Using Non-negative Matrix Factorization Solved By Improved Alternating Direction Method Of Multipliers
    Qiao, Lin
    Zhang, Xiongwei
    Chen, Xushan
    Yang, Jibin
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 374 - 378
  • [6] Manhattan Nonnegative matrix factorization using the alternating direction method of multipliers
    Cao, Chan
    Tang, Shuyu
    Zhang, Nian
    Dai, Xiangguang
    Zhang, Wei
    Feng, Yuming
    Xiong, Jiang
    Liu, Jinkui
    Thompson, Lara
    2023 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE, ICACI, 2023,
  • [7] Regularized Non-negative Matrix Factorization Using Alternating Direction Method of Multipliers and Its Application to Source Separation
    Zhang, Shaofei
    Huang, Dongyan
    Xie, Lei
    Chng, Eng Siong
    Li, Haizhou
    Dong, Minghui
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1498 - 1502
  • [8] The Voice Conversion Method Based on Sparse Convolutive Non-negative Matrix Factorization
    Zhang, Qianmin
    Tao, Liang
    Zhou, Jian
    Wang, Huabin
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTRICAL AND INFORMATION TECHNOLOGIES FOR RAIL TRANSPORTATION: TRANSPORTATION, 2016, 378 : 259 - 267
  • [9] MULTIMODAL VOICE CONVERSION USING NON-NEGATIVE MATRIX FACTORIZATION IN NOISY ENVIRONMENTS
    Masaka, Kenta
    Aihara, Ryo
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [10] Multimodal voice conversion based on non-negative matrix factorization
    Kenta Masaka
    Ryo Aihara
    Tetsuya Takiguchi
    Yasuo Ariki
    EURASIP Journal on Audio, Speech, and Music Processing, 2015