Multilingual I-Vector based Statistical Modeling for Music Genre Classification

被引:1
|
作者
Dai, Jia [1 ]
Xue, Wei
Liu, Wenju
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
关键词
i-vector; multilingual; music genre classification; statistical feature;
D O I
10.21437/Interspeech.2017-74
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For music signal processing, compared with the strategy which models each short-time frame independently, when the long-time features are considered, the time-series characteristics of the music signal can be better presented. As a typical kind of long-time modeling strategy. the identification vector (i-vector) uses statistical modeling to model the audio signal in the segment level. It can better capture the important elements of the music signal. and these important elements may benefit to the classification of music signal. In this paper, the ivector based statistical feature for music genre classification is explored. In addition to learn enough important elements for music signal, a new multilingual i-vector feature is proposed based on the multilingual model. The experimental results show that the multilingual i-vector based models can achieve better classification performances than conventional short-time modeling based methods.
引用
收藏
页码:459 / 463
页数:5
相关论文
共 50 条
  • [1] Application of I-Vector in Speech and Music Classification
    Zhang, Hao
    Yang, Xu-Kui
    Zhang, Wei-Qiang
    Zhang, Wen -Lin
    Liu, Jia
    [J]. 2016 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2016, : 1 - 5
  • [2] I-VECTOR BASED LANGUAGE MODELING FOR QUERY REPRESENTATION
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Chen, Berlin
    Chen, Hsin-His
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5211 - 5215
  • [3] "Multilingual" Deep Neural Network For Music Genre Classification
    Dai, Jia
    Liu, Wenju
    Ni, Chongjia
    Dong, Like
    Yang, Hong
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2907 - 2911
  • [4] PLDA in i-vector based underwater acoustic signals classification
    Song, Yongqiang
    Liu, Feng
    Shen, Tongsheng
    [J]. SHIPS AND OFFSHORE STRUCTURES, 2024, 19 (03) : 366 - 374
  • [5] 2016 BUT Babel system: Multilingual BLSTM acoustic model with i-vector based adaptation
    Karafiat, Martin
    Baskar, Murali Karthick
    Matejka, Pavel
    Vesely, Karel
    Grezl, Frantisek
    Burget, Lukas
    Cernocky, Jan Hanza
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 719 - 723
  • [6] I-VECTOR BASED LANGUAGE MODELING FOR SPOKEN DOCUMENT RETRIEVAL
    Chen, Kuan-Yu
    Lee, Hung-Shin
    Wang, Hsin-Min
    Chen, Berlin
    Chen, Hsin-Hsi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [7] Feature Vector Design for Music Genre Classification
    da Silva Muniz, Victor Hugo
    de Oliveira e Souza Filho, Joao Baptista
    [J]. 2021 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2021,
  • [8] Supervised i-vector Modeling - Theory and Applications
    Ramoji, Shreyas
    Ganapathy, Sriram
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1091 - 1095
  • [9] Active Learning Music Genre Classification Based on Support Vector Machine
    Deng G.
    Ko Y.C.
    [J]. Advances in Multimedia, 2022, 2022
  • [10] i-Vector with sparse representation classification for speaker verification
    Kua, Jia Min Karen
    Epps, Julien
    Ambikairajah, Eliathamby
    [J]. SPEECH COMMUNICATION, 2013, 55 (05) : 707 - 720