Zero-shot Topic Classification via Automatic Tagging on Chinese Text Datasets

被引:0
|
作者
Cai, Xinyi [1 ]
Tian, Jiao [1 ]
Yu, Ke [1 ]
Xiao, Hongwang [2 ]
Zhang, Kai [1 ]
Tsai, Pei -Wei [1 ]
机构
[1] Swinburne Univ Technol, Melbourne, Australia
[2] Beijing Acad Artificial Intelligence BAAD, Beijing, Peoples R China
关键词
Topic Classification; Data Scarcity; Zero-shot Learning; Transformer-based Structure; Automatic Tagging; Chinese Text Datasets;
D O I
10.1109/ISPA-BDCloud-SocialCom-SustainCom57177.2022.00068
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data scarcity problem is often encountered for topic classification in many real-world applications. Zero-shot classification aims to deal with this problem by conducting a classification without any previously labelled data. However, only a few studies work on zero-shot topic classification on Chinese text. In this paper, we focus on providing an automatic tagging structure for zero-shot topic classification, which adopts labelled data for training based on a transformer-based model from external corpuses. Moreover, we show the effectiveness of fine-tuning large dataset in a downstream task, where the training data labels are not aligned with the test data labels in advance. Our experiments shows that the results outperform the performance of the benchmark approaches on two standard Chinese text datasets for the zero-shot setting.
引用
收藏
页码:482 / 488
页数:7
相关论文
共 50 条
  • [41] Generalized Zero-Shot Video Classification via Generative Adversarial Networks
    Hong, Mingyao
    Li, Guorong
    Zhang, Xinfeng
    Huang, Qingming
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2419 - 2426
  • [42] Zero-Shot Image Classification via Coupled Discriminative Dictionary Learning
    Liu, Lehui
    Wu, Songsong
    Chen, Runqing
    Zhou, Mengquan
    [J]. INTELLIGENT COMPUTING, NETWORKED CONTROL, AND THEIR ENGINEERING APPLICATIONS, PT II, 2017, 762 : 363 - 372
  • [43] Deep Multiple Instance Learning for Zero-Shot Image Tagging
    Rahman, Shafin
    Khan, Salman
    [J]. COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 530 - 546
  • [44] Automatic Machine Translation Evaluation in Many Languages via Zero-Shot Paraphrasing
    Thompson, Brian
    Post, Matt
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 90 - 121
  • [45] Knowledge-embedded Prompt Learning for Zero-shot Social Media Text Classification
    Li, Jingyi
    Chen, Qi
    Wang, Wei
    Wu, Fangyu
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING, SMARTCOMP, 2023, : 222 - 224
  • [46] PESCO: Prompt-enhanced Self Contrastive Learning for Zero-shot Text Classification
    Wang, Yau-Shian
    Chi, Ta-Chung
    Zhang, Ruohong
    Yang, Yiming
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14897 - 14911
  • [47] Generalised Zero-shot Learning for Entailment-based Text Classification with Externa Knowledge
    Wang, Yuqi
    Wang, Wei
    Chen, Qi
    Huang, Kaizhu
    Anh Nguyen
    De, Suparna
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2022), 2022, : 19 - 25
  • [48] Improving Cross-lingual Text Classification with Zero-shot Instance-Weighting
    Li, Irene
    Sen, Prithviraj
    Zhu, Huaiyu
    Li, Yunyao
    Radev, Dragomir
    [J]. REPL4NLP 2021: PROCEEDINGS OF THE 6TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP, 2021, : 1 - 7
  • [49] Generating Visual Representations for Zero-Shot Classification
    Bucher, Maxime
    Herbin, Stephane
    Jurie, Frederic
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2666 - 2673
  • [50] Gaze Embeddings for Zero-Shot Image Classification
    Karessli, Nour
    Akata, Zeynep
    Schiele, Bernt
    Bulling, Andreas
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6412 - 6421