Learning Multifaceted Self-Similarity for Musical Structure Analysis

被引:0
|
作者
Chen, Tsung-Ping [1 ]
Su, Li [2 ]
Yoshii, Kazuyoshi [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto, Japan
[2] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
关键词
D O I
10.1109/APSIPAASC58517.2023.10317473
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a data-driven music structure analysis (MSA) method that performs segmentation and clustering of musical sections for a music signal. Since the intra-section homogeneity and inter-section difference are important clues for MSA, most studies on MSA have focused on self-similarity matrices (SSMs) computed from various acoustic features of a music signal. The performance of this approach, however, might be limited because the acoustic features used for computing SSMs are designed manually, and multiple SSMs are often integrated in a heuristic manner. To overcome these limitations, we propose a method that learns latent features useful for MSA with a stack of convolution-augmented multi-head self-attention (CAMHSA) layers that compute and fuse multiple self-attention maps representing multifaceted self-similarity. The estimated features are then clustered into an appropriate number of sections with a Gaussian mixture model (GMM). In the segmentation and clustering tasks, the proposed method outperformed baseline methods based on hand-crafted SSMs. In particular, it achieved state-of-the-art performance on the segmentation task. We found that the internal attention maps represent the section boundaries at the fine and course levels.
引用
收藏
页码:165 / 172
页数:8
相关论文
共 50 条
  • [1] STRUCTURE AND SELF-SIMILARITY OF SILICA AEROGELS
    VACHER, R
    WOIGNIER, T
    PELOUS, J
    COURTENS, E
    [J]. PHYSICAL REVIEW B, 1988, 37 (11): : 6500 - 6503
  • [2] The self-similarity structure on infinite intervals
    He, Xing-Gang
    Wen, Zhi-Ying
    [J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2007, 329 (02) : 1094 - 1101
  • [3] A musical audio search method based on self-similarity features
    Izumitani, Tomonori
    Kashino, Kunio
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 68 - 71
  • [4] NOTE ON SELF-SIMILARITY AND DIMENSIONAL ANALYSIS
    KAHLIG, P
    [J]. ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1992, 72 (03): : 228 - 230
  • [5] SELF-SIMILARITY
    LEWELLEN, GB
    [J]. ROCKY MOUNTAIN JOURNAL OF MATHEMATICS, 1993, 23 (03) : 1023 - 1040
  • [6] Self-similarity Analysis of Time Series
    Zhang Xiao-yong
    Luo Lai-yuan
    [J]. PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 2063 - 2066
  • [7] Fractality and Self-Similarity in the Structure of Road Networks
    Zhang, Hong
    Li, Zhilin
    [J]. ANNALS OF THE ASSOCIATION OF AMERICAN GEOGRAPHERS, 2012, 102 (02) : 350 - 365
  • [8] SELF-SIMILARITY IN THE STRUCTURE OF DNA - WHAT ARE INTRONS
    GROSBERG, AY
    RABIN, Y
    HAVLIN, S
    NEER, A
    [J]. BIOFIZIKA, 1993, 38 (01): : 75 - 83
  • [9] Extended self-similarity and hierarchical structure in turbulence
    Ching, ESC
    She, ZS
    Su, WD
    Zou, ZP
    [J]. PHYSICAL REVIEW E, 2002, 65 (06): : 1 - 066303
  • [10] Self-similarity in the structure of coarsened nanoporous gold
    Jeon, Hansol
    Kang, Na-Ri
    Gwak, Eun-Ji
    Jang, Jae-il
    Han, Heung Nam
    Hwang, Jun Yeon
    Lee, Sukbin
    Kim, Ju-Young
    [J]. SCRIPTA MATERIALIA, 2017, 137 : 46 - 49