A Curriculum Learning Approach for Multi-Domain Text Classification Using Keyword Weight Ranking

被引:1
|
作者
Yuan, Zilin [1 ]
Li, Yinghui [1 ]
Li, Yangning [1 ]
Zheng, Hai-Tao [1 ,2 ]
He, Yaobin [3 ,4 ]
Liu, Wenqiang [5 ]
Huang, Dongxiao [5 ]
Wu, Bei [5 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China
[2] Pengcheng Lab, Shenzhen 518055, Peoples R China
[3] Smart City Res Inst CETC, Shenzhen 518055, Peoples R China
[4] Natl Ctr Appl Math Shenzhen, Shenzhen 518055, Peoples R China
[5] Tencent Inc, Interact Entertainment Grp, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-domain text classification; curriculum learning; keyword weight ranking;
D O I
10.3390/electronics12143040
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text classification is a well-established task in NLP, but it has two major limitations. Firstly, text classification is heavily reliant on domain-specific knowledge, meaning that a classifier that is trained on a given corpus may not perform well when presented with text from another domain. Secondly, text classification models require substantial amounts of annotated data for training, and in certain domains, there may be an insufficient quantity of labeled data available. Consequently, it is essential to explore methods for efficiently utilizing text data from various domains to improve the performance of models across a range of domains. One approach for achieving this is through the use of multi-domain text classification models that leverage adversarial training to extract domain-shared features among all domains as well as the specific features of each domain. After observing the varying distinctness of domain-specific features, our paper introduces a curriculum learning approach using a ranking system based on keyword weight to enhance the effectiveness of multi-domain text classification models. The experimental data from Amazon reviews and FDU-MTL datasets show that our method significantly improves the efficacy of multi-domain text classification models adopting adversarial learning and reaching state-of-the-art outcomes on these two datasets.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [1] DaCon: Multi-Domain Text Classification Using Domain Adversarial Contrastive Learning
    Dai, Yingjun
    El-Roby, Ahmed
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT V, 2023, 14258 : 40 - 52
  • [2] Learning Multi-Domain Adversarial Neural Networks for Text Classification
    Ding, Xiao
    Shi, Qiankun
    Cai, Bibo
    Liu, Ting
    Zhao, Yanyan
    Ye, Qiang
    IEEE ACCESS, 2019, 7 : 40323 - 40332
  • [3] Co-Regularized Adversarial Learning for Multi-Domain Text Classification
    Wu, Yuan
    Inkpen, Diana
    El-Roby, Ahmed
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [4] Dual Adversarial Co-Learning for Multi-Domain Text Classification
    Wu, Yuan
    Guo, Yuhong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6438 - 6445
  • [5] A Multi-domain Text Classification Method Based on Recurrent Convolution Multi-task Learning
    Xie Jinbao
    Li Jiahui
    Kang Shouqiang
    Wang Qingyan
    Wang Yujing
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (08) : 2395 - 2403
  • [6] Multi-domain Text-to-Speech Synthesis by Automatic Text Classification
    Alias, Francesc
    Socoro, Joan Claudi
    Sevillano, Xavier
    Iriondo, Ignasi
    Gonzalvo, Xavier
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1304 - 1307
  • [7] Text classification adapted for Multi-domain Text-To-Speech Synthesis
    Alias, Francesc
    Gonzalvo, Xavier
    Sevillano, Xavier
    Claudi Socoro, Joan
    Antonio Montero, Jose
    Garcia, David
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2006, (37): : 267 - 274
  • [8] Multi-domain Network Service Placement Optimization Using Curriculum Reinforcement Learning
    Shahbazi, Arzhang
    Cherrared, Sihem
    Guillemin, Fabrice
    2023 IEEE CONFERENCE ON NETWORK FUNCTION VIRTUALIZATION AND SOFTWARE DEFINED NETWORKS, NFV-SDN, 2023, : 21 - 26
  • [9] MAXIMUM BATCH FROBENIUS NORM FOR MULTI-DOMAIN TEXT CLASSIFICATION
    Wu, Yuan
    Inkpen, Diana
    El-Roby, Ahmed
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3763 - 3767
  • [10] A ROBUST CONTRASTIVE ALIGNMENT METHOD FOR MULTI-DOMAIN TEXT CLASSIFICATION
    Li, Xuefeng
    Lei, Hao
    Wang, Liwen
    Dong, Guanting
    Zhao, Jinzheng
    Liu, Jiachi
    Xu, Weiran
    Zhang, Chunyun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7827 - 7831