An ensemble of deep transfer learning models for handwritten music symbol recognition

被引:0
|
作者
Ashis Paul
Rishav Pramanik
Samir Malakar
Ram Sarkar
机构
[1] Jadavpur University,Department of Computer Science and Engineering
[2] Asutosh College,Department of Computer Science
来源
关键词
Music symbol recognition; Ensemble learning; Support vector machine; HOMUS dataset;
D O I
暂无
中图分类号
学科分类号
摘要
In ancient times, there was no system to record or document music. A basic notation system to write European music was formulated around 14th century in the Baroque period which slowly evolved into the standard notation system that we have today. Later, the musical pieces from the classical and post-classical period of European music were documented as scores using this standard European staff notations. These notations are used by most of the modern genres of music due to their versatility. Hence, it is very important to develop a method that can store such music sheets containing handwritten music scores digitally. Optical music recognition (OMR) is a system that automatically interprets the scanned handwritten music scores. In this work, we have proposed a classifier ensemble of deep transfer learning models with support vector machine (SVM) as the aggregator for handwritten music symbol recognition. We have applied three pre-trained deep learning models, namely ResNet50, GoogleNet and DenseNet161 (each trained on ImageNet), and fine-tuned on our target datasets i.e., music symbol image datasets. The proposed ensemble technique can capture a more complex association of the base classifiers, thus improving the overall performance. We have evaluated the proposed model on five publicly available standard datasets, namely Handwritten Online Music Symbols (HOMUS), Capitan_Score_Uniform, Capitan_Score_Non-uniform, Rebelo_real and Fornés, and achieved state-of-the-art results for all these datasets. Additionally, we have evaluated our model on publicly available two non-music symbols datasets, namely CMATERdb 2.1.2 containing 120 handwritten Bangla city names and CMATERdb 3.1.1 dataset containing handwritten Bangla numerals to validate its effectiveness on diversified datasets. The source code of this present work is available at https://github.com/ashis0013/Music-Symbol-Recognition.
引用
收藏
页码:10409 / 10427
页数:18
相关论文
共 50 条
  • [21] Handwritten Text Recognition using Deep Learning
    Nikitha, A.
    Geetha, J.
    JayaLakshmi, D. S.
    [J]. 2020 5TH IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS ON ELECTRONICS, INFORMATION, COMMUNICATION & TECHNOLOGY (RTEICT-2020), 2020, : 388 - 392
  • [22] Handwritten digits recognition using transfer learning
    Azawi, Nidhal
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 106
  • [23] Learning Symbol Relation Tree for Online Handwritten Mathematical Expression Recognition
    Thanh-Nghia Truong
    Hung Tuan Nguyen
    Cuong Tuan Nguyen
    Nakagawa, Masaki
    [J]. PATTERN RECOGNITION, ACPR 2021, PT II, 2022, 13189 : 307 - 321
  • [24] Ensemble of Deep Models for Event Recognition
    Ahmad, Kashif
    Mekhalfi, Mohamed Lamine
    Conci, Nicola
    Melgani, Farid
    De Natale, Francesco
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2018, 14 (02)
  • [25] Multilingual handwritten numeral recognition using a robust deep network joint with transfer learning
    Fateh, Amirreza
    Fateh, Mansoor
    Abolghasemi, Vahid
    [J]. INFORMATION SCIENCES, 2021, 581 : 479 - 494
  • [26] Enhancing Marathi Handwritten Character Recognition Using Ensemble Learning
    Chikmurge, Diptee Vishwanath
    Raghunathan, Shriram
    [J]. TRAITEMENT DU SIGNAL, 2023, 40 (01) : 327 - 334
  • [27] Deep Learning Ensemble for Melanoma Recognition
    Osowski, Stanislaw
    Les, Tomasz
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [28] Music Genre Recognition using Deep Neural Networks and Transfer Learning
    Ghosal, Deepanway
    Kolekar, Maheshkumar H.
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2087 - 2091
  • [29] Bangla Handwritten Digit Recognition Approach with an Ensemble of Deep Residual Networks
    Mamun, Mamunur Rahaman
    Al Nazi, Zabir
    Yusuf, Md. Salah Uddin
    [J]. 2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [30] Chemical Symbol Feature Set for Handwritten Chemical Symbol Recognition
    Tang, Peng
    Hui, Siu Cheung
    Fu, Chi-Wing
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2014, 8621 : 312 - 322