PHRASE-BASED RAGA RECOGNITION USING VECTOR SPACE MODELING

被引:0
|
作者
Gulati, Sankalp [1 ]
Serra, Joan [2 ]
Ishwar, Vignesh [1 ]
Senturk, Sertan [1 ]
Serra, Xavier [1 ]
机构
[1] Univ Pompeu Fabra, Mus Technol Grp, Barcelona, Spain
[2] Telefon Res, Barcelona, Spain
关键词
Raga recognition; raga motifs; melodic phrases; Indian art music; Carnatic music;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic raga recognition is one of the fundamental computational tasks in Indian art music. Motivated by the way seasoned listeners identify ragas, we propose a raga recognition approach based on melodic phrases. Firstly, we extract melodic patterns from a collection of audio recordings in an unsupervised way. Next, we group similar patterns by exploiting complex networks concepts and techniques. Drawing an analogy to topic modeling in text classification, we then represent audio recordings using a vector space model. Finally, we employ a number of classification strategies to build a predictive model for raga recognition. To evaluate our approach, we compile a music collection of over 124 hours, comprising 480 recordings and 40 ragas. We obtain 70% accuracy with the full 40-raga collection, and up to 92% accuracy with its 10-raga subset. We show that phrase-based raga recognition is a successful strategy, on par with the state of the art, and sometimes outperforms it. A by-product of our approach, which arguably is as important as the task of raga recognition, is the identification of raga-phrases. These phrases can be used as a dictionary of semantically-meaningful melodic units for several computational tasks in Indian art music.
引用
收藏
页码:66 / 70
页数:5
相关论文
共 50 条
  • [1] A vector-space dynamic feature for phrase-based statistical machine translation
    Costa-jussa, Marta R.
    Banchs, Rafael E.
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2011, 37 (02) : 139 - 154
  • [2] A vector-space dynamic feature for phrase-based statistical machine translation
    Marta R. Costa-jussà
    Rafael E. Banchs
    [J]. Journal of Intelligent Information Systems, 2011, 37 : 139 - 154
  • [3] Development of a Phrase-Based Speech-Recognition Test Using Synthetic Speech
    Ibelings, Saskia
    Brand, Thomas
    Ruigendijk, Esther
    Holube, Inga
    [J]. TRENDS IN HEARING, 2024, 28
  • [4] Using Prosodic Phrase-Based VQVAE on Audio ALBERT for Speech Emotion Recognition
    Hsu, Jia-Hao
    Wu, Chung-Hsien
    Yang, Tsung-Hsien
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 415 - 419
  • [5] Free-text medical document retrieval via phrase-based vector space model
    Mao, WL
    Chu, WW
    [J]. AMIA 2002 SYMPOSIUM, PROCEEDINGS: BIOMEDICAL INFORMATICS: ONE DISCIPLINE, 2002, : 489 - 493
  • [6] Leveraging External Knowledge for Phrase-based Topic Modeling
    Xu, Mingyang
    Yang, Ruixin
    Ranshous, Stephen
    Li, Shijie
    Samatova, Nagiza F.
    [J]. 2017 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2017, : 29 - 32
  • [7] Improving semistatic compression via phrase-based modeling
    Brisaboa, Nieves R.
    Farina, Antonio
    Navarro, Gonzalo
    Parama, Jose R.
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (04) : 545 - 559
  • [8] The phrase-based vector space model for automatic retrieval of free-text medical documents
    Mao, Wenlei
    Chu, Wesley W.
    [J]. DATA & KNOWLEDGE ENGINEERING, 2007, 61 (01) : 76 - 92
  • [9] Phrase-based correction model for improving handwriting recognition accuracies
    Farooq, Faisal
    Jose, Damien
    Govindaraju, Venu
    [J]. PATTERN RECOGNITION, 2009, 42 (12) : 3271 - 3277
  • [10] Statistical phrase-based translation
    Koehn, P
    Och, FJ
    Marcu, D
    [J]. HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2003, : 127 - 133