Multi-label Dysfluency Classification

被引:0
|
作者
Jouaiti, Melanie [1 ]
Dautenhahn, Kerstin [1 ]
机构
[1] Univ Waterloo, Elect & Comp Engn Dept, 20 Univ Ave, Waterloo, ON N2L3G1, Canada
来源
关键词
Dysfluency classification; Transfer learning; Multi-label classification; SPEECH; CHILDREN;
D O I
10.1007/978-3-031-20980-2_25
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Stuttering is a neuro-developmental disorder represented in 1% of the population. Dysfluency classification is still an open research question, with concerns of which feature representation or which classifier to use. Another issue, which has been neglected so far, is how to deal with audio samples that contain more than one type of dysfluency. Research has mostly preferred considering only single-labels problems, in part due to the lack of substantial multi-labels datasets. However, the FluencyBank and SEP-28K datasets are now available and contain multi-label data, which should pave the way for more research taking this aspect into account. In this paper, we give an overview of different ways to handle multi-label classification and compare them, while fine-tuning the ResNet50 network to perform multi-label dysfluency classification. We show that, fine-tuning the ResNet50, independently of the label representation, performs better than current state of the art results.
引用
收藏
页码:290 / 301
页数:12
相关论文
共 50 条
  • [31] Detection and Multi-label Classification of Bats
    Dierckx, Lucile
    Beauvois, Melanie
    Nijssen, Siegfried
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XX, IDA 2022, 2022, 13205 : 53 - 65
  • [32] Multi-label Scientific Document Classification
    Ali, Tariq
    Asghar, Sohail
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2018, 19 (06): : 1707 - 1716
  • [33] Locality in Multi-label Classification Problems
    Norov-Erdene, Batzaya
    Kudo, Mineichi
    Sun, Lu
    Kimura, Keigo
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2319 - 2324
  • [34] Metric Learning for Multi-label Classification
    Brighi, Marco
    Franco, Annalisa
    Maio, Dario
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2020, 2021, 12644 : 24 - 33
  • [35] Hyperspherical Learning in Multi-Label Classification
    Ke, Bo
    Zhu, Yunquan
    Li, Mengtian
    Shu, Xiujun
    Qiao, Ruizhi
    Ren, Bo
    [J]. COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 38 - 55
  • [36] Asymmetric Loss For Multi-Label Classification
    Ridnik, Tal
    Ben-Baruch, Emanuel
    Zamir, Nadav
    Noy, Asaf
    Friedman, Itamar
    Protter, Matan
    Zelnik-Manor, Lihi
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 82 - 91
  • [37] Multi-label classification of music by emotion
    Trohidis, Konstantinos
    Tsoumakas, Grigorios
    Kalliris, George
    Vlahavas, Ioannis
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 9
  • [38] Multi-label classification of music by emotion
    Konstantinos Trohidis
    Grigorios Tsoumakas
    George Kalliris
    Ioannis Vlahavas
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011
  • [39] ML-FOREST: A Multi-Label Tree Ensemble Method for Multi-Label Classification
    Wu, Qingyao
    Tan, Mingkui
    Song, Hengjie
    Chen, Jian
    Ng, Michael K.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (10) : 2665 - 2680
  • [40] Ensemble methods for multi-label classification
    Rokach, Lior
    Schclar, Alon
    Itach, Ehud
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (16) : 7507 - 7523