Multitask Semi-Supervised Learning for Class-Imbalanced Discourse Classification

被引:0
|
作者
Spangher, Alexander [1 ]
May, Jonathan [1 ]
Shiang, Sz-rung [2 ]
Deng, Lingjia [2 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90089 USA
[2] Bloomberg, New York, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As labeling schemas evolve over time, small differences can render datasets following older schemas unusable. This prevents researchers from building on top of previous annotation work and results in the existence, in discourse learning in particular, of many small class-imbalanced datasets. In this work, we show that a multitask learning approach can combine discourse datasets from similar and diverse domains to improve discourse classification. We show an improvement of 4.9% Micro F1-score over current state-of-the-art benchmarks on the NewsDiscourse dataset, one of the largest discourse datasets recently published, due in part to label correlations across tasks, which improve performance for under-represented classes. We also offer an extensive review of additional techniques proposed to address resource-poor problems in NLP, and show that none of these approaches can improve classification accuracy in our setting(1).
引用
收藏
页码:498 / 517
页数:20
相关论文
共 50 条
  • [1] A survey of class-imbalanced semi-supervised learning
    Gui, Qian
    Zhou, Hong
    Guo, Na
    Niu, Baoning
    [J]. MACHINE LEARNING, 2024, 113 (08) : 5057 - 5086
  • [2] A semi-supervised resampling method for class-imbalanced learning
    Jiang, Zhen
    Zhao, Lingyun
    Lu, Yu
    Zhan, Yongzhao
    Mao, Qirong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 221
  • [3] Class-Imbalanced Semi-Supervised Learning with Adaptive Thresholding
    Guo, Lan-Zhe
    Li, Yu-Feng
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [4] Assembly Quality Detection Based on Class-Imbalanced Semi-Supervised Learning
    Lu, Zichen
    Jiang, Jiabin
    Cao, Pin
    Yang, Yongying
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [5] ABC: Auxiliary Balanced Classifier for Class-Imbalanced Semi-Supervised Learning
    Lee, Hyuck
    Shin, Seungjae
    Kim, Heeyoung
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [6] ABAE: Auxiliary Balanced AutoEncoder for class-imbalanced semi-supervised learning
    Tang, Qianying
    Wei, Xiang
    Su, Qi
    Zhang, Shunli
    [J]. PATTERN RECOGNITION LETTERS, 2024, 182 : 118 - 124
  • [7] Mixed Re-Sampled Class-Imbalanced Semi-Supervised Learning for Skin Lesion Classification
    Tian, Ye
    Zhang, Liguo
    Shen, Linshan
    Yin, Guisheng
    Chen, Lei
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2021, 28 (01): : 195 - 211
  • [8] OCI-SSL: Open Class-Imbalanced Semi-Supervised Learning With Contrastive Learning
    Zhou, Yuting
    Gao, Can
    Zhou, Jie
    Ding, Weiping
    Shen, Linlin
    Lai, Zhihui
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, : 1 - 14
  • [9] Class-imbalanced Unsupervised and Semi-Supervised Domain Adaptation for Histopathology Images
    Hosseini, S. Maryam
    Shafique, Abubakr
    Babaie, Morteza
    Tizhoosh, H. R.
    [J]. 2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [10] Semi-Supervised and Class-Imbalanced Open Set Medical Image Recognition
    Xu, Yiqian
    Wang, Ruofan
    Zhao, Rui-Wei
    Xiao, Xingxing
    Feng, Rui
    [J]. IEEE ACCESS, 2024, 12 : 122852 - 122877