A Note Based Query By Humming System using Convolutional Neural Network

被引:1
|
作者
Mostafa, Naziba [1 ]
Fung, Pascale [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Kowloon, Clear Water Bay, Hong Kong, Peoples R China
关键词
query by humming; humming transcription; CNN; raw audio;
D O I
10.21437/Interspeech.2017-1590
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a note-based query by humming (QBH) system with Hidden Markov Model (HMM) and Convolutional Neural Network (CNN) since note-based systems are much more efficient than the traditional frame-based systems. A note-based QBH system has two main components: humming transcription and candidate melody retrieval. For humming transcription, we are the first to use a hybrid model using HMM and CNN. We use CNN for its ability to leam the features directly from raw audio data and for being able to model the locality and variability often present in a note and we use HMM for handling the variability across the time axis. For candidate melody retrieval. we use locality sensitive hashing to narrow down the candidates for retrieval and dynamic time warping and earth mover's distance for the final ranking of the selected candidates. We show that our HMM-CNN humming transcription system outperforms other state of the art humming transcription systems by similar to 2% using the transcription evaluation framework by Molina et. al and our overall query by humming system has a Mean Reciprocal Rank of 0.92 using the standard MIREX dataset, which is higher than other state of the art note-based query by humming systems.
引用
收藏
页码:3102 / 3106
页数:5
相关论文
共 50 条
  • [41] Smart staff attendance system using Convolutional Neural Network
    Natesan, P.
    Gothai, E.
    Rajalaxmi, R. R.
    Karthikeyan, K. V. Mohana
    Muthukumar, V
    Naveen, R. M.
    2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [42] Arabic handwriting recognition system using convolutional neural network
    Najwa Altwaijry
    Isra Al-Turaiki
    Neural Computing and Applications, 2021, 33 : 2249 - 2261
  • [43] Query by Humming System Through Multiscale Music Entropy
    Nagavi, Trisiladevi C.
    Bhajantri, Nagappa U.
    INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, 2015, 309 : 139 - 150
  • [44] Arabic handwriting recognition system using convolutional neural network
    Altwaijry, Najwa
    Al-Turaiki, Isra
    Neural Computing and Applications, 2021, 33 (07): : 2249 - 2261
  • [45] Autonomous football exercise system based on convolutional neural network
    Hang, Xiaochuan
    Cao, Dan
    REVISTA INTERNACIONAL DE MEDICINA Y CIENCIAS DE LA ACTIVIDAD FISICA Y DEL DEPORTE, 2022, 22 (85): : 231 - 249
  • [46] Design of Face Recognition System Based on Convolutional Neural Network
    Tao, Kezhu
    He, Yonglu
    Chen, Caihong
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 5403 - 5406
  • [47] Automatic Hospital Inspection System Based on Convolutional Neural Network
    Ao, Bangqian
    Lin, Yuan
    Gao, Zhiwu
    Han, Ye
    Zhang, Nanqing
    Proceedings - 2022 International Conference on Information Technology, Communication Ecosystem and Management, ITCEM 2022, 2022, : 51 - 55
  • [48] Smart Parking System Based on Convolutional Neural Network Models
    Zhang, Wenjin
    Yan, Jason
    Yu, Cui
    2019 6TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2019), 2019, : 561 - 566
  • [49] Convolutional Neural Network-based UWB System Localization
    Doan Tan Anh Nguyen
    Lee, Han-Gyeol
    Joung, Jingon
    Jeong, Eui-Rim
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 488 - 490
  • [50] Reaction diffusion system prediction based on convolutional neural network
    Li, Angran
    Chen, Ruijia
    Farimani, Amir Barati
    Zhang, Yongjie Jessica
    SCIENTIFIC REPORTS, 2020, 10 (01)