A cross-modal method of labeling music tags

被引:0
|
作者
Jia-Lien Hsu
Yen-Fu Li
机构
[1] Fu Jen Catholic University,Department of Computer Science and Information Engineering
来源
关键词
Cross-modal; (MFCCs); (GMM); Tag labeling; Music information retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we discuss various features of music objects in two kinds of domain. Among these features, Mel-frequency cepstral coefficients (MFCCs) are further discussed and described by Gaussian mixture model (GMM). Also, the similarity between GMMs are investigated accordingly. Then, we employ the multimedia graph as a cross-modal method to associate MFCCs and genre tags of music objects. By applying link analysis algorithm in the graph, we label appropriate genre tags for target music objects. Also, we perform experiments to show performance, effectiveness, and parameter setting of our approach.
引用
收藏
页码:521 / 541
页数:20
相关论文
共 50 条
  • [1] A cross-modal method of labeling music tags
    Hsu, Jia-Lien
    Li, Yen-Fu
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 58 (03) : 521 - 541
  • [2] A cross-modal crowd counting method combining CNN and cross-modal transformer
    Zhang, Shihui
    Wang, Wei
    Zhao, Weibo
    Wang, Lei
    Li, Qunpeng
    [J]. IMAGE AND VISION COMPUTING, 2023, 129
  • [3] Composing with Cross-modal Correspondences: Music and Odors in Concert
    Crisinel, Anne-Sylvie
    Jacquier, Caroline
    Deroy, Ophelia
    Spence, Charles
    [J]. CHEMOSENSORY PERCEPTION, 2013, 6 (01) : 45 - 52
  • [4] Cross-Modal Descriptions of Music in the Latin Middle Ages
    Hentschel, Frank
    [J]. ARCHIV FUR MUSIKWISSENSCHAFT, 2023, 80 (04): : 252 - 287
  • [5] The effect of expertise in music reading: cross-modal competence
    Drai-Zerbib, Veronique
    Baccino, Thierry
    [J]. JOURNAL OF EYE MOVEMENT RESEARCH, 2013, 6 (05):
  • [6] LEARNING CONTEXTUAL TAG EMBEDDINGS FOR CROSS-MODAL ALIGNMENT OF AUDIO AND TAGS
    Favory, Xavier
    Drossos, Konstantinos
    Virtanen, Tuomas
    Serra, Xavier
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 596 - 600
  • [7] MOSA: Music Motion With Semantic Annotation Dataset for Cross-Modal Music Processing
    Huang, Yu-Fen
    Moran, Nikki
    Coleman, Simon
    Kelly, Jon
    Wei, Shun-Hwa
    Chen, Po-Yin
    Huang, Yun-Hsin
    Chen, Tsung-Ping
    Kuo, Yu-Chia
    Wei, Yu-Chi
    Li, Chih-Hsuan
    Huang, Da-Yu
    Kao, Hsuan-Kai
    Lin, Ting-Wei
    Su, Li
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4157 - 4170
  • [8] Cross-Modal Music Retrieval and Applications An overview of key methodologies
    Mueller, Meinard
    Arzt, Andreas
    Balke, Stefan
    Dorfer, Matthias
    Widmer, Gerhard
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2019, 36 (01) : 52 - 62
  • [9] Emotional expression in speech and music - Evidence of cross-modal similarities
    Juslin, PN
    Laukka, P
    [J]. EMOTIONS INSIDE OUT: 130 YEARS AFTER DARWIN'S THE EXPRESSION OF THE EMOTIONS IN MAN AND ANIMALS, 2003, 1000 : 279 - 282
  • [10] Exploiting Temporal Dependencies for Cross-modal Music Piece Identification
    Carvalho, Luis
    Widmer, Gerhard
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 386 - 390