Separation of voice and music by harmonic structure stability analysis

被引：0

作者：

Zhang, YG ^{[1
]}

Zhang, CS ^{[1
]}

机构：

[1] Tsing Hua Univ, Dept Automat, Beijing 100084, Peoples R China

来源：

2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2 | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Separation of voice and music is an interesting but difficult problem. It is useful for many other researches such as audio content analysis. In this paper, the difference between voice and music signals is carefully studied. It is proposed that the Harmonic Structure Stability is the key difference between them. A separation algorithm based on this theory is proposed. The main idea is to learn the average harmonic structure of the music, and then separate signals by using it to distinguish voice and music harmonic structures. Experimental results show that the algorithm can separate mixed signals and obtains not only a very high Signal-to-Noise Ratio (SNR) but also a rather good subjective audio quality.

引用

页码：562 / 565

页数：4

共 50 条

[31] Harmonic Structure Predicts the Enjoyment of Uplifting Trance Music
Agres, Kat
Herremans, Dorien
Bigo, Louis
Conklin, Darrell
FRONTIERS IN PSYCHOLOGY, 2017, 7
[32] Neural networks for harmonic structure in music perception and action
Bianco, R.
Novembre, G.
Keller, P. E.
Kim, Seung-Goo
Scharf, F.
Friederici, A. D.
Villringer, A.
Sammler, D.
NEUROIMAGE, 2016, 142 : 444 - 454
[33] Automatic music transcription based on harmonic structure information
School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
不详
Jisuanji Yanjiu yu Fazhan, 2006, 12 (2187-2192):
[34] Singing Voice Analysis Using Relative Harmonic Delays
Sousa, Ricardo
Ferreira, Anibal
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2008 - 2011
[35] REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation
Rafii, Zafar
Pardo, Bryan
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (01): : 71 - 82
[36] A Tandem Algorithm for Singing Pitch Extraction and Voice Separation From Music Accompaniment
Hsu, Chao-Ling
Wang, DeLiang
Jang, Jyh-Shing Roger
Hu, Ke
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (05): : 1482 - 1491
[37] Separation of Singing Voice from Music Accompaniment using Matrix Factorization Method
Burute, Harshada
Mane, P. B.
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2015, : 166 - 171
[38] Music ability, quantitative EEG and acoustic voice analysis
Viana, Wackermann P.
2ND INTERNATIONAL CONGRESS ON NEUROBIOLOGY, PSYCHOPHARMACOLOGY & TREATMENT GUIDANCE, 2012, : 51 - 57
[39] Spectro-Temporal Modeling of Harmonic Magnitude Tracks for Music Source Separation
Gunawan, David
Sen, D.
2009 IEEE INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2009), 2009, : 190 - 195
[40] Music Genre Recognition Using Spectrograms with Harmonic-Percussive Sound Separation
Aguiar, Rafael de Lima
da Costa, Yandre Maldonado e Gomes
Nanni, Loris
PROCEEDINGS OF THE 2016 35TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2016,

← 1 2 3 4 5 →