Separation of voice and music by harmonic structure stability analysis

被引:0
|
作者
Zhang, YG [1 ]
Zhang, CS [1 ]
机构
[1] Tsing Hua Univ, Dept Automat, Beijing 100084, Peoples R China
来源
2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2 | 2005年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Separation of voice and music is an interesting but difficult problem. It is useful for many other researches such as audio content analysis. In this paper, the difference between voice and music signals is carefully studied. It is proposed that the Harmonic Structure Stability is the key difference between them. A separation algorithm based on this theory is proposed. The main idea is to learn the average harmonic structure of the music, and then separate signals by using it to distinguish voice and music harmonic structures. Experimental results show that the algorithm can separate mixed signals and obtains not only a very high Signal-to-Noise Ratio (SNR) but also a rather good subjective audio quality.
引用
收藏
页码:562 / 565
页数:4
相关论文
共 50 条
  • [31] Harmonic Structure Predicts the Enjoyment of Uplifting Trance Music
    Agres, Kat
    Herremans, Dorien
    Bigo, Louis
    Conklin, Darrell
    FRONTIERS IN PSYCHOLOGY, 2017, 7
  • [32] Neural networks for harmonic structure in music perception and action
    Bianco, R.
    Novembre, G.
    Keller, P. E.
    Kim, Seung-Goo
    Scharf, F.
    Friederici, A. D.
    Villringer, A.
    Sammler, D.
    NEUROIMAGE, 2016, 142 : 444 - 454
  • [33] Automatic music transcription based on harmonic structure information
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
    不详
    Jisuanji Yanjiu yu Fazhan, 2006, 12 (2187-2192):
  • [34] Singing Voice Analysis Using Relative Harmonic Delays
    Sousa, Ricardo
    Ferreira, Anibal
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2008 - 2011
  • [35] REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation
    Rafii, Zafar
    Pardo, Bryan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (01): : 71 - 82
  • [36] A Tandem Algorithm for Singing Pitch Extraction and Voice Separation From Music Accompaniment
    Hsu, Chao-Ling
    Wang, DeLiang
    Jang, Jyh-Shing Roger
    Hu, Ke
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (05): : 1482 - 1491
  • [37] Separation of Singing Voice from Music Accompaniment using Matrix Factorization Method
    Burute, Harshada
    Mane, P. B.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2015, : 166 - 171
  • [38] Music ability, quantitative EEG and acoustic voice analysis
    Viana, Wackermann P.
    2ND INTERNATIONAL CONGRESS ON NEUROBIOLOGY, PSYCHOPHARMACOLOGY & TREATMENT GUIDANCE, 2012, : 51 - 57
  • [39] Spectro-Temporal Modeling of Harmonic Magnitude Tracks for Music Source Separation
    Gunawan, David
    Sen, D.
    2009 IEEE INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2009), 2009, : 190 - 195
  • [40] Music Genre Recognition Using Spectrograms with Harmonic-Percussive Sound Separation
    Aguiar, Rafael de Lima
    da Costa, Yandre Maldonado e Gomes
    Nanni, Loris
    PROCEEDINGS OF THE 2016 35TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2016,