Cover Song Recognition Based on MPEG-7 Audio Features

被引：0

作者：

Ponighzwa, Mochammad Faris ^{[1
]}

Sarno, R. Riyanarto ^{[1
]}

Sunaryono, Dwi ^{[1
]}

机构：

[1] Inst Teknol Sepuluh Nopember, Dept Informat, Surabaya, Indonesia

来源：

2017 3RD INTERNATIONAL CONFERENCE ON SCIENCE IN INFORMATION TECHNOLOGY (ICSITECH) | 2017年

关键词：

cover song recognition; MPEG-7; KNN;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Lately, song industry has developed rapidly throughout the world. In the past, there were many applications which used song as their main themes, such as Shazam and Sound hound. Shazam and Sound hound could identify a song based on recorded one through the application. These applications work by matching the recorded song with an original song in the database. However, matching process is only based on the particular part of the spectrogram instead of an entire song's spectrogram. The disadvantages of this method arise though. This application could only identify the recorded original song. When application recorded a cover song, it cannot identify the title of the original song's since the spectrogram of a cover performance's and its original song's is entirely different. This paper exists to discuss how to recognize a cover song based on MPEG-7 standard ISO. KNN was used as classification method and combined with Audio Spectrum Projection and Audio Spectrum Flatness feature from MPEG-7 extraction. The result from this method identifies an original song from recorded cover of the original one. Result for experiment in this paper is about 75-80%, depends on testing data; whether the testing data is a dominant vocal song or dominant instrument song.

引用

页码：59 / 65

页数：7

共 50 条

[21] Classification of Music Mood Using MPEG-7 Audio Features and SVM with Confidence Interval
Sarno, Riyanarto
Ridoean, Johanes Andre
Sunaryono, Dwi
Wijaya, Dedy Rahman
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (05)
[22] Using MPEG-7 audio descriptors for music querying
Gruhne, M.
Dittmar, C.
APPLICATIONS OF DIGITAL IMAGE PROCESSING XXIX, 2006, 6312
[23] Audio thumbnailing using MPEG-7 low level audio descriptors
Wellhausen, J
Höynck, M
INTERNET MULTIMEDIA MANAGEMENT SYSTEMS IV, 2003, 5242 : 65 - 73
[24] Temporal audio segmentation using MPEG-7 descriptors
Wellhausen, J
Crysandt, H
STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2003, 2003, 5021 : 380 - 387
[25] MPEG-7 sound-recognition tools
Casey, M
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (06) : 737 - 747
[26] How good are the visual MPEG-7 features?
Eidenberger, H
VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2003, PTS 1-3, 2003, 5150 : 476 - 488
[27] MPEG-7
Department of Engineering Informatics, Faculty of Information and Communication Engineering, Osaka Electro-communication University, Osaka, Japan
Kyokai Joho Imeji Zasshi, 2007, 2 (176-178):
[28] Voice Pathology Detection and Classification Using MPEG-7 Audio Low-Level Features
Muhammad, Ghulam
Melhem, Moutasem
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3594 - 3598
[29] MPEG-7
Harald Kosch
Jörg Heuer
Informatik-Spektrum, 2003, 26 (2) : 105 - 107
[30] The MPEG-7 Multimedia Database System (MPEG-7 MMDB)
Doeller, Mario
Kosch, Harald
JOURNAL OF SYSTEMS AND SOFTWARE, 2008, 81 (09) : 1559 - 1580

← 1 2 3 4 5 →