A Curriculum Learning Approach for Multi-Domain Text Classification Using Keyword Weight Ranking

被引:1
|
作者
Yuan, Zilin [1 ]
Li, Yinghui [1 ]
Li, Yangning [1 ]
Zheng, Hai-Tao [1 ,2 ]
He, Yaobin [3 ,4 ]
Liu, Wenqiang [5 ]
Huang, Dongxiao [5 ]
Wu, Bei [5 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China
[2] Pengcheng Lab, Shenzhen 518055, Peoples R China
[3] Smart City Res Inst CETC, Shenzhen 518055, Peoples R China
[4] Natl Ctr Appl Math Shenzhen, Shenzhen 518055, Peoples R China
[5] Tencent Inc, Interact Entertainment Grp, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-domain text classification; curriculum learning; keyword weight ranking;
D O I
10.3390/electronics12143040
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text classification is a well-established task in NLP, but it has two major limitations. Firstly, text classification is heavily reliant on domain-specific knowledge, meaning that a classifier that is trained on a given corpus may not perform well when presented with text from another domain. Secondly, text classification models require substantial amounts of annotated data for training, and in certain domains, there may be an insufficient quantity of labeled data available. Consequently, it is essential to explore methods for efficiently utilizing text data from various domains to improve the performance of models across a range of domains. One approach for achieving this is through the use of multi-domain text classification models that leverage adversarial training to extract domain-shared features among all domains as well as the specific features of each domain. After observing the varying distinctness of domain-specific features, our paper introduces a curriculum learning approach using a ranking system based on keyword weight to enhance the effectiveness of multi-domain text classification models. The experimental data from Amazon reviews and FDU-MTL datasets show that our method significantly improves the efficacy of multi-domain text classification models adopting adversarial learning and reaching state-of-the-art outcomes on these two datasets.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [21] Multi-Task Multi-Domain Learning for Digital Staining and Classification of Leukocytes
    Tomczak, Agnieszka
    Ilic, Slobodan
    Marquardt, Gaby
    Engel, Thomas
    Forster, Frank
    Navab, Nassir
    Albarqouni, Shadi
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (10) : 2897 - 2910
  • [22] Multi-Domain Alias Matching Using Machine Learning
    Ashcroft, Michael
    Johansson, Fredrik
    Kaati, Lisa
    Shrestha, Amendra
    2016 THIRD EUROPEAN NETWORK INTELLIGENCE CONFERENCE (ENIC 2016), 2016, : 77 - 84
  • [23] An algorithm for multi-domain website classification
    Ullah M.A.
    Tahrin A.
    Marjan S.
    International Journal of Web-Based Learning and Teaching Technologies, 2020, 15 (04) : 57 - 65
  • [24] Collaborative Multi-Domain Sentiment Classification
    Wu, Fangzhao
    Huang, Yongfeng
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 459 - 468
  • [25] Semi-supervised Multi-domain Learning for Medical Image Classification
    Chavhan, Ruchika
    Banerjee, Biplab
    Das, Nibaran
    RECENT TRENDS IN IMAGE PROCESSING AND PATTERN RECOGNITION, RTIP2R 2022, 2023, 1704 : 22 - 33
  • [26] Fuzzy Semantic Classification of Multi-Domain E-Learning Concept
    Ahmed, Rafeeq
    Ahmad, Tanvir
    Almutairi, Fadiyah M.
    Qahtani, Abdulrahman M.
    Alsufyani, Abdulmajeed
    Almutiry, Omar
    MOBILE NETWORKS & APPLICATIONS, 2021, 26 (05): : 2206 - 2215
  • [27] Manipulation Classification for JPEG Images Using Multi-Domain Features
    Yu, In-Jae
    Nam, Seung-Hun
    Ahn, Wonhyuk
    Kwon, Myung-Joon
    Lee, Heung-Kyu
    IEEE ACCESS, 2020, 8 : 210837 - 210854
  • [28] Manipulation Classification for JPEG Images Using Multi-Domain Features
    Yu, In-Jae
    Nam, Seung-Hun
    Ahn, Wonhyuk
    Kwon, Myung-Joon
    Lee, Heung-Kyu
    Lee, Heung-Kyu (heunglee@kaist.ac.kr), 1600, Institute of Electrical and Electronics Engineers Inc. (08): : 210837 - 210854
  • [29] Transfer Learning for the Multilingual and Multi-Domain Classification of Messages Relating to Crises
    Sanchez, Cinthia
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2708 - 2708
  • [30] Fuzzy Semantic Classification of Multi-Domain E-Learning Concept
    Rafeeq Ahmed
    Tanvir Ahmad
    Fadiyah M. Almutairi
    Abdulrahman M. Qahtani
    Abdulmajeed Alsufyani
    Omar Almutiry
    Mobile Networks and Applications, 2021, 26 : 2206 - 2215