A Note Based Query By Humming System using Convolutional Neural Network

被引:1
|
作者
Mostafa, Naziba [1 ]
Fung, Pascale [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Kowloon, Clear Water Bay, Hong Kong, Peoples R China
关键词
query by humming; humming transcription; CNN; raw audio;
D O I
10.21437/Interspeech.2017-1590
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a note-based query by humming (QBH) system with Hidden Markov Model (HMM) and Convolutional Neural Network (CNN) since note-based systems are much more efficient than the traditional frame-based systems. A note-based QBH system has two main components: humming transcription and candidate melody retrieval. For humming transcription, we are the first to use a hybrid model using HMM and CNN. We use CNN for its ability to leam the features directly from raw audio data and for being able to model the locality and variability often present in a note and we use HMM for handling the variability across the time axis. For candidate melody retrieval. we use locality sensitive hashing to narrow down the candidates for retrieval and dynamic time warping and earth mover's distance for the final ranking of the selected candidates. We show that our HMM-CNN humming transcription system outperforms other state of the art humming transcription systems by similar to 2% using the transcription evaluation framework by Molina et. al and our overall query by humming system has a Mean Reciprocal Rank of 0.92 using the standard MIREX dataset, which is higher than other state of the art note-based query by humming systems.
引用
收藏
页码:3102 / 3106
页数:5
相关论文
共 50 条
  • [1] Query by humming system based on score
    Wang, Xiaofeng
    Zhou, Mingquan
    Geng, Guohua
    Guo, Hongbo
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2007, 19 (07): : 941 - 946
  • [2] Query by Humming by Using Locality Sensitive Hashing based on Combination of Pitch and Note
    Wang, Qiang
    Guo, Zhiyuan
    Liu, Gang
    Guo, Jun
    Lu, Yueming
    2012 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2012, : 302 - 307
  • [3] A Fast Query by Humming System Based on Notes
    Yang, Jingzhou
    Liu, Jia
    Zhang, Wei-Qiang
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2898 - 2901
  • [4] An implementation of web based query by humming system
    Chen, Lujia
    Hu, Bao-Gang
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1467 - 1470
  • [5] Query by humming with the vocalsearch system
    Birmingham, William
    Dannenberg, Roger
    Pardo, Bryan
    COMMUNICATIONS OF THE ACM, 2006, 49 (08) : 49 - 52
  • [6] Intelligent Query by Humming System
    Nam, Gi Pyo
    Park, Kang Ryoung
    Lee, Soek-Pil
    Lee, Eui Chul
    Kim, Moo-Young
    Kim, Kichul
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION TECHNOLOGIES & APPLICATIONS (ICUT 2009), 2009, : 480 - +
  • [7] Humming Note Segmentation Method in Query-byHumming
    Zhou, Shaojing
    Zhao, Zhijun
    Shi, Ping
    PROCEEDINGS OF 2016 10TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2016, : 25 - 29
  • [8] A query by humming system based on locality sensitive hashing indexes
    Guo, Zhiyuan
    Wang, Qiang
    Liu, Gang
    Guo, Jun
    SIGNAL PROCESSING, 2013, 93 (08) : 2229 - 2243
  • [9] Voice Recognition Based Security System Using Convolutional Neural Network
    Chandankhede, Pankaj H.
    Titarmare, Abhijit S.
    Chauhvan, Sarang
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, AND INTELLIGENT SYSTEMS (ICCCIS), 2021, : 738 - 743
  • [10] Query Classification Using Convolutional Neural Networks
    Zhang, Hanxiao
    Song, Wei
    Liu, Lizhen
    Du, Chao
    Zhao, Xinlei
    2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2017, : 441 - 444