Separation of voice and music by harmonic structure stability analysis

被引:0
|
作者
Zhang, YG [1 ]
Zhang, CS [1 ]
机构
[1] Tsing Hua Univ, Dept Automat, Beijing 100084, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Separation of voice and music is an interesting but difficult problem. It is useful for many other researches such as audio content analysis. In this paper, the difference between voice and music signals is carefully studied. It is proposed that the Harmonic Structure Stability is the key difference between them. A separation algorithm based on this theory is proposed. The main idea is to learn the average harmonic structure of the music, and then separate signals by using it to distinguish voice and music harmonic structures. Experimental results show that the algorithm can separate mixed signals and obtains not only a very high Signal-to-Noise Ratio (SNR) but also a rather good subjective audio quality.
引用
收藏
页码:562 / 565
页数:4
相关论文
共 50 条
  • [1] Study of Indian Classical Music by Singing Voice Analysis and Music Source Separation
    Ghisingh, Seema
    Sharma, Shivam
    Mittal, Vinay Kumar
    2017 2ND INTERNATIONAL CONFERENCE ON TELECOMMUNICATION AND NETWORKS (TEL-NET), 2017, : 133 - 138
  • [2] ADAPTIVE FILTERING FOR MUSIC/VOICE SEPARATION EXPLOITING THE REPEATING MUSICAL STRUCTURE
    Liutkus, Antoine
    Rafii, Zafar
    Badeau, Roland
    Pardo, Bryan
    Richard, Gael
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 53 - 56
  • [3] Harmonic peaks method for voice separation
    Zhang, B
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 678 - 681
  • [4] THE REPRESENTATION OF HARMONIC STRUCTURE IN MUSIC - HIERARCHIES OF STABILITY AS A FUNCTION OF CONTEXT
    BHARUCHA, J
    KRUMHANSL, CL
    COGNITION, 1983, 13 (01) : 63 - 102
  • [5] Single Channel Music Source Separation Based on Harmonic Structure Estimation
    Wang, Dongmei
    Huang, Qinghua
    ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 848 - 851
  • [6] A SIMPLE MUSIC/VOICE SEPARATION METHOD BASED ON THE EXTRACTION OF THE REPEATING MUSICAL STRUCTURE
    Rafii, Zafar
    Pardo, Bryan
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 221 - 224
  • [7] JOINT SINGING PITCH ESTIMATION AND VOICE SEPARATION BASED ON A NEURAL HARMONIC STRUCTURE RENDERER
    Nakano, Tomoyasu
    Yoshii, Kazuyoshi
    Wu, Yiming
    Nishikimi, Ryo
    Lin, Kin Wah Edward
    Goto, Masataka
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 160 - 164
  • [8] Music/voice separation based on the multi-repeating structure of Mel cepstrum coefficient
    ZHANG Tianqi
    XU Xin
    WU Wangjun
    LIU Yu
    Chinese Journal of Acoustics, 2015, 34 (04) : 424 - 435
  • [9] Unsupervised single-channel music source separation by average harmonic structure modeling
    Duan, Zhiyao
    Zhang, Yungang
    Zhang, Changshui
    Shi, Zhenwei
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (04): : 766 - 778
  • [10] Singing Voice Separation in Mono-Channel Music
    Chanrungutai, Angkana
    Ratanamahatana, Chotirat Ann
    2008 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, 2008, : 256 - 261