Zero-shot Topic Classification via Automatic Tagging on Chinese Text Datasets

被引:0
|
作者
Cai, Xinyi [1 ]
Tian, Jiao [1 ]
Yu, Ke [1 ]
Xiao, Hongwang [2 ]
Zhang, Kai [1 ]
Tsai, Pei -Wei [1 ]
机构
[1] Swinburne Univ Technol, Melbourne, Australia
[2] Beijing Acad Artificial Intelligence BAAD, Beijing, Peoples R China
关键词
Topic Classification; Data Scarcity; Zero-shot Learning; Transformer-based Structure; Automatic Tagging; Chinese Text Datasets;
D O I
10.1109/ISPA-BDCloud-SocialCom-SustainCom57177.2022.00068
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data scarcity problem is often encountered for topic classification in many real-world applications. Zero-shot classification aims to deal with this problem by conducting a classification without any previously labelled data. However, only a few studies work on zero-shot topic classification on Chinese text. In this paper, we focus on providing an automatic tagging structure for zero-shot topic classification, which adopts labelled data for training based on a transformer-based model from external corpuses. Moreover, we show the effectiveness of fine-tuning large dataset in a downstream task, where the training data labels are not aligned with the test data labels in advance. Our experiments shows that the results outperform the performance of the benchmark approaches on two standard Chinese text datasets for the zero-shot setting.
引用
收藏
页码:482 / 488
页数:7
相关论文
共 50 条
  • [21] Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing
    Liu, Han
    Zhao, Siyang
    Zhang, Xiaotong
    Zhang, Feng
    Wang, Wei
    Ma, Fenglong
    Chen, Hongyang
    Yu, Hong
    Zhang, Xianchao
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18644 - 18652
  • [22] Cost Effective Annotation Framework Using Zero-Shot Text Classification
    Kasthuriarachchy, Buddhika
    Chetty, Madhu
    Shatte, Adrian
    Walls, Darren
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [23] Using Pseudo-Labelled Data for Zero-Shot Text Classification
    Wang, Congcong
    Nulty, Paul
    Lillis, David
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2022), 2022, 13286 : 35 - 46
  • [24] Prompt-based Zero-shot Text Classification with Conceptual Knowledge
    Wang, Yuqi
    Wang, Wei
    Chen, Qi
    Huang, Kaizhu
    Nguyen, Anh
    De, Suparna
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-SRW 2023, VOL 4, 2023, : 30 - 38
  • [25] A weakly supervised textual entailment approach to zero-shot text classification
    Pamies, Marc
    Llop, Joan
    Multari, Francesco
    Duran-Silva, Nicolau
    Parra-Rojas, Cesar
    Gonzalez-Agirre, Aitor
    Massucci, Francesco Alessandro
    Villegas, Marta
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 286 - 296
  • [26] HierCode: A lightweight hierarchical codebook for zero-shot Chinese text recognition
    Zhang, Yuyi
    Zhu, Yuanzhi
    Peng, Dezhi
    Zhang, Peirong
    Yang, Zhenhua
    Yang, Zhibo
    Jin, Lianwen
    [J]. PATTERN RECOGNITION, 2025, 158
  • [27] CLZT: A Contrastive Learning Based Framework for Zero-Shot Text Classification
    Li, Kun
    Lin, Meng
    Hu, Songlin
    Li, Ruixuan
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT II, 2022, : 623 - 630
  • [28] Zero-Shot Text Classification with Semantically Extended Graph Convolutional Network
    Liu, Tengfei
    Hu, Yongli
    Gao, Junbin
    Sun, Yanfeng
    Yin, Baocai
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8352 - 8359
  • [29] ProZe: Explainable and Prompt-Guided Zero-Shot Text Classification
    Harrando, Ismail
    Reboud, Alison
    Schleider, Thomas
    Ehrhart, Thibault
    Troncy, Raphael
    [J]. IEEE INTERNET COMPUTING, 2022, 26 (06) : 69 - 77
  • [30] Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification
    Xenouleas, Stratos
    Tsoukara, Alexia
    Panagiotakis, Giannis
    Chalkidis, Ilias
    Androutsopoulos, Ion
    [J]. PROCEEDINGS OF THE 12TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE, SETN 2022, 2022,