Exploring the two-dimensional nature of music notation for score recognition with end-to-end approaches

被引:9
|
作者
Rios-Vila, Antonio [1 ]
Calvo-Zaragoza, Jorge [1 ]
Inesta, Jose M. [1 ]
机构
[1] Univ Alicante, UI Comp Res, Alicante, Spain
关键词
D O I
10.1109/ICFHR2020.2020.00044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical Music Recognition workflows perform several steps to retrieve the content in music score images, being symbol recognition one of the key stages. State-of-the-art approaches for this stage currently address the coding of the output symbols as if they were plain text characters. However, music symbols have a two-dimensional nature that is ignored in these approaches. In this paper, we explore alternative output representations to perform music symbol recognition with state-of-the-art end-to-end neural technologies. We propose and describe new output representations which take into account the mentioned two-dimensional nature. We seek answers to the question of whether it is possible to obtain better recognition results in both printed and handwritten music scores. In this analysis, we compare the results given using three output encodings and two neural approaches. We found that one of the proposed encodings outperforms the results obtained by the standard one. This permits us to conclude that it is interesting to keep researching on this topic to improve end-to-end music score recognition.
引用
收藏
页码:193 / 198
页数:6
相关论文
共 50 条
  • [1] Decoupling music notation to improve end-to-end Optical Music Recognition
    Alfaro-Contreras, Maria
    Rios-Vila, Antonio
    Valero-Mas, Jose J.
    Inesta, Jose M.
    Calvo-Zaragoza, Jorge
    [J]. PATTERN RECOGNITION LETTERS, 2022, 158 : 157 - 163
  • [2] Exploiting the Two-Dimensional Nature of Agnostic Music Notation for Neural Optical Music Recognition
    Alfaro-Contreras, Maria
    Valero-Mas, Jose J.
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (08):
  • [3] Exploring Two Approaches for an End-to-End Scientific Analysis Workflow
    Dodelson, Scott
    Kent, Steve
    Kowalkowski, Jim
    Paterno, Marc
    Sehrish, Saba
    [J]. 21ST INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2015), PARTS 1-9, 2015, 664
  • [4] End-to-end optical music recognition for pianoform sheet music
    Rios-Vila, Antonio
    Rizo, David
    Inesta, Jose M.
    Calvo-Zaragoza, Jorge
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2023, 26 (03) : 347 - 362
  • [5] End-to-end optical music recognition for pianoform sheet music
    Antonio Ríos-Vila
    David Rizo
    José M. Iñesta
    Jorge Calvo-Zaragoza
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2023, 26 : 347 - 362
  • [6] Data Augmentation for End-to-End Optical Music Recognition
    Lopez-Gutierrez, Juan C.
    Valero-Mas, Jose J.
    Castellanos, Francisco J.
    Calvo-Zaragoza, Jorge
    [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I, 2021, 12916 : 59 - 73
  • [7] End-to-end Music-mixed Speech Recognition
    Woo, Jeongwoo
    Mimura, Masato
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 800 - 804
  • [8] On the Use of Transformers for End-to-End Optical Music Recognition
    Rios-Vila, Antonio
    Inesta, Jose M.
    Calvo-Zaragoza, Jorge
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2022), 2022, 13256 : 470 - 481
  • [9] EXPLORING NEURAL TRANSDUCERS FOR END-TO-END SPEECH RECOGNITION
    Battenberg, Eric
    Chen, Jitong
    Child, Rewon
    Coates, Adam
    Gaur, Yashesh
    Li, Yi
    Liu, Hairong
    Satheesh, Sanjeev
    Sriram, Anuroop
    Zhu, Zhenyao
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 206 - 213
  • [10] A Two-Dimensional Architecture for End-to-End Resource Management in Virtual Network Environments
    Wang, Ning
    Zhang, Yan
    Serrat, Joan
    Luis Gorricho, Juan
    Guo, Tao
    Hu, Zheng
    Zhang, Ping
    [J]. IEEE NETWORK, 2012, 26 (05): : 8 - 14