Cover Song Recognition Based on MPEG-7 Audio Features

被引:0
|
作者
Ponighzwa, Mochammad Faris [1 ]
Sarno, R. Riyanarto [1 ]
Sunaryono, Dwi [1 ]
机构
[1] Inst Teknol Sepuluh Nopember, Dept Informat, Surabaya, Indonesia
关键词
cover song recognition; MPEG-7; KNN;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Lately, song industry has developed rapidly throughout the world. In the past, there were many applications which used song as their main themes, such as Shazam and Sound hound. Shazam and Sound hound could identify a song based on recorded one through the application. These applications work by matching the recorded song with an original song in the database. However, matching process is only based on the particular part of the spectrogram instead of an entire song's spectrogram. The disadvantages of this method arise though. This application could only identify the recorded original song. When application recorded a cover song, it cannot identify the title of the original song's since the spectrogram of a cover performance's and its original song's is entirely different. This paper exists to discuss how to recognize a cover song based on MPEG-7 standard ISO. KNN was used as classification method and combined with Audio Spectrum Projection and Audio Spectrum Flatness feature from MPEG-7 extraction. The result from this method identifies an original song from recorded cover of the original one. Result for experiment in this paper is about 75-80%, depends on testing data; whether the testing data is a dominant vocal song or dominant instrument song.
引用
收藏
页码:59 / 65
页数:7
相关论文
共 50 条
  • [21] Classification of Music Mood Using MPEG-7 Audio Features and SVM with Confidence Interval
    Sarno, Riyanarto
    Ridoean, Johanes Andre
    Sunaryono, Dwi
    Wijaya, Dedy Rahman
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (05)
  • [22] Using MPEG-7 audio descriptors for music querying
    Gruhne, M.
    Dittmar, C.
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXIX, 2006, 6312
  • [23] Audio thumbnailing using MPEG-7 low level audio descriptors
    Wellhausen, J
    Höynck, M
    INTERNET MULTIMEDIA MANAGEMENT SYSTEMS IV, 2003, 5242 : 65 - 73
  • [24] Temporal audio segmentation using MPEG-7 descriptors
    Wellhausen, J
    Crysandt, H
    STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2003, 2003, 5021 : 380 - 387
  • [25] MPEG-7 sound-recognition tools
    Casey, M
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (06) : 737 - 747
  • [26] How good are the visual MPEG-7 features?
    Eidenberger, H
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2003, PTS 1-3, 2003, 5150 : 476 - 488
  • [27] MPEG-7
    Department of Engineering Informatics, Faculty of Information and Communication Engineering, Osaka Electro-communication University, Osaka, Japan
    Kyokai Joho Imeji Zasshi, 2007, 2 (176-178):
  • [28] Voice Pathology Detection and Classification Using MPEG-7 Audio Low-Level Features
    Muhammad, Ghulam
    Melhem, Moutasem
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3594 - 3598
  • [29] MPEG-7
    Harald Kosch
    Jörg Heuer
    Informatik-Spektrum, 2003, 26 (2) : 105 - 107
  • [30] The MPEG-7 Multimedia Database System (MPEG-7 MMDB)
    Doeller, Mario
    Kosch, Harald
    JOURNAL OF SYSTEMS AND SOFTWARE, 2008, 81 (09) : 1559 - 1580