A cross-modal method of labeling music tags

被引：0

作者：

Jia-Lien Hsu

Yen-Fu Li

机构：

[1] Fu Jen Catholic University,Department of Computer Science and Information Engineering

来源：

Multimedia Tools and Applications | 2012年 / 58卷

关键词：

Cross-modal; (MFCCs); (GMM); Tag labeling; Music information retrieval;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, we discuss various features of music objects in two kinds of domain. Among these features, Mel-frequency cepstral coefficients (MFCCs) are further discussed and described by Gaussian mixture model (GMM). Also, the similarity between GMMs are investigated accordingly. Then, we employ the multimedia graph as a cross-modal method to associate MFCCs and genre tags of music objects. By applying link analysis algorithm in the graph, we label appropriate genre tags for target music objects. Also, we perform experiments to show performance, effectiveness, and parameter setting of our approach.

引用

页码：521 / 541

页数：20

共 50 条

[1] A cross-modal method of labeling music tags
Hsu, Jia-Lien
Li, Yen-Fu
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 58 (03) : 521 - 541
[2] A cross-modal crowd counting method combining CNN and cross-modal transformer
Zhang, Shihui
Wang, Wei
Zhao, Weibo
Wang, Lei
Li, Qunpeng
[J]. IMAGE AND VISION COMPUTING, 2023, 129
[3] Composing with Cross-modal Correspondences: Music and Odors in Concert
Crisinel, Anne-Sylvie
Jacquier, Caroline
Deroy, Ophelia
Spence, Charles
[J]. CHEMOSENSORY PERCEPTION, 2013, 6 (01) : 45 - 52
[4] Cross-Modal Descriptions of Music in the Latin Middle Ages
Hentschel, Frank
[J]. ARCHIV FUR MUSIKWISSENSCHAFT, 2023, 80 (04): : 252 - 287
[5] The effect of expertise in music reading: cross-modal competence
Drai-Zerbib, Veronique
Baccino, Thierry
[J]. JOURNAL OF EYE MOVEMENT RESEARCH, 2013, 6 (05):
[6] LEARNING CONTEXTUAL TAG EMBEDDINGS FOR CROSS-MODAL ALIGNMENT OF AUDIO AND TAGS
Favory, Xavier
Drossos, Konstantinos
Virtanen, Tuomas
Serra, Xavier
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 596 - 600
[7] MOSA: Music Motion With Semantic Annotation Dataset for Cross-Modal Music Processing
Huang, Yu-Fen
Moran, Nikki
Coleman, Simon
Kelly, Jon
Wei, Shun-Hwa
Chen, Po-Yin
Huang, Yun-Hsin
Chen, Tsung-Ping
Kuo, Yu-Chia
Wei, Yu-Chi
Li, Chih-Hsuan
Huang, Da-Yu
Kao, Hsuan-Kai
Lin, Ting-Wei
Su, Li
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4157 - 4170
[8] Cross-Modal Music Retrieval and Applications An overview of key methodologies
Mueller, Meinard
Arzt, Andreas
Balke, Stefan
Dorfer, Matthias
Widmer, Gerhard
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2019, 36 (01) : 52 - 62
[9] Emotional expression in speech and music - Evidence of cross-modal similarities
Juslin, PN
Laukka, P
[J]. EMOTIONS INSIDE OUT: 130 YEARS AFTER DARWIN'S THE EXPRESSION OF THE EMOTIONS IN MAN AND ANIMALS, 2003, 1000 : 279 - 282
[10] Exploiting Temporal Dependencies for Cross-modal Music Piece Identification
Carvalho, Luis
Widmer, Gerhard
[J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 386 - 390

← 1 2 3 4 5 →