A Note Based Query By Humming System using Convolutional Neural Network

被引:1
|
作者
Mostafa, Naziba [1 ]
Fung, Pascale [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Kowloon, Clear Water Bay, Hong Kong, Peoples R China
关键词
query by humming; humming transcription; CNN; raw audio;
D O I
10.21437/Interspeech.2017-1590
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a note-based query by humming (QBH) system with Hidden Markov Model (HMM) and Convolutional Neural Network (CNN) since note-based systems are much more efficient than the traditional frame-based systems. A note-based QBH system has two main components: humming transcription and candidate melody retrieval. For humming transcription, we are the first to use a hybrid model using HMM and CNN. We use CNN for its ability to leam the features directly from raw audio data and for being able to model the locality and variability often present in a note and we use HMM for handling the variability across the time axis. For candidate melody retrieval. we use locality sensitive hashing to narrow down the candidates for retrieval and dynamic time warping and earth mover's distance for the final ranking of the selected candidates. We show that our HMM-CNN humming transcription system outperforms other state of the art humming transcription systems by similar to 2% using the transcription evaluation framework by Molina et. al and our overall query by humming system has a Mean Reciprocal Rank of 0.92 using the standard MIREX dataset, which is higher than other state of the art note-based query by humming systems.
引用
收藏
页码:3102 / 3106
页数:5
相关论文
共 50 条
  • [31] WEB-BASED CATARACT DETECTION SYSTEM USING DEEP CONVOLUTIONAL NEURAL NETWORK
    Yusuf, Musa
    Theophilous, Samuel
    Adejoke, Jadesola
    Hassan, Annah B.
    2019 2ND INTERNATIONAL CONFERENCE OF THE IEEE NIGERIA COMPUTER CHAPTER (NIGERIACOMPUTCONF), 2019, : 102 - 108
  • [32] Intelligent query by humming system based on score level fusion of multiple classifiers
    Gi Pyo Nam
    Thi Thu Trang Luong
    Hyun Ha Nam
    Park, Kang Ryoung
    Park, Sung-Joo
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,
  • [33] A peer-to-peer music sharing system based on query-by-humming
    Wang, Jianrong
    Chang, Xinglong
    Zhao, Zheng
    Zhang, Yebin
    Shi, Qingwei
    NEXT-GENERATION COMMUNICATION AND SENSOR NETWORKS 2007, 2007, 6773
  • [34] A Spectrogram Image-Based Network Anomaly Detection System Using Deep Convolutional Neural Network
    Khan, Adnan Shahid
    Ahmad, Zeeshan
    Abdullah, Johari
    Ahmad, Farhan
    IEEE ACCESS, 2021, 9 : 87079 - 87093
  • [35] Retraction Note: An effective disease prediction system using incremental feature selection and temporal convolutional neural network
    S. Sandhiya
    U. Palani
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (Suppl 1) : 213 - 213
  • [36] Arabic handwriting recognition system using convolutional neural network
    Altwaijry, Najwa
    Al-Turaiki, Isra
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (07): : 2249 - 2261
  • [37] Intelligent query by humming system based on score level fusion of multiple classifiers
    Gi Pyo Nam
    Thi Thu Trang Luong
    Hyun Ha Nam
    Kang Ryoung Park
    Sung-Joo Park
    EURASIP Journal on Advances in Signal Processing, 2011
  • [38] Face Mask Detection System using Convolutional Neural Network
    Ibrahim, Alaa Adham
    Hashim, Yara Arjuman
    Omer, Truska Mustafa
    Ahmed, Rebin M.
    2022 8TH INTERNATIONAL ENGINEERING CONFERENCE ON SUSTAINABLE TECHNOLOGY AND DEVELOPMENT (IEC), 2022, : 7 - 11
  • [39] An Intelligent Smart Parking System Using Convolutional Neural Network
    Alsheikhy, Ahmed A.
    Shawly, Tawfeeq
    Said, Yahia F.
    Lahza, Husam
    JOURNAL OF SENSORS, 2022, 2022
  • [40] Intrusion Detection System Using Hybrid Convolutional Neural Network
    Samha, Amani K.
    Malik, Nidhi
    Sharma, Deepak
    Kavitha, S.
    Dutta, Papiya
    MOBILE NETWORKS & APPLICATIONS, 2023,