Zero-Shot Turkish Text Classification

被引:0
|
作者
Birim, Ahmet [1 ]
Erden, Mustafa [1 ]
Arslan, Levent M. [1 ,2 ]
机构
[1] Sestek, Istanbul, Turkey
[2] Bogazici Univ, Elekt Elekt Muhendisligi Bolumu, Istanbul, Turkey
关键词
zero-shot; text classification; natural language processing;
D O I
10.1109/SIU53274.2021.9477864
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The method frequently used for text classification is supervised modeling with a large training set with labels. In some cases, we may not have labeled data. Modeling in the absence of labeled data for target classes is called zero-shot modeling. For zero-shot text classification natural language inference is utilized which is another branch of natural language processing. In order to measure the effectiveness of zero-shot classification, a reference is generated with BERT method which contrarily uses labeled data. Two Turkish data sets are employed. TTC-3600 dataset is composed of 6 news categories. In this study we have generated a new Twitter sentiment dataset TTD, which is composed of weakly labeled Twitter messages in 2 classes as positive and negative depending on contained emoticons. The results show that without using any labeled data we can obtain classification accuracies of %65.5 and %91 for TTC-3600 and TTD respectively.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Unified benchmark for zero-shot Turkish text classification
    celik, Emrecan
    Dalyan, Tugba
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [2] Retrieval Augmented Zero-Shot Text Classification
    Abdullahi, Tassallah
    Singh, Ritambhara
    Eickhoff, Carsten
    [J]. PROCEEDINGS OF THE 2024 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2024, 2024, : 195 - 203
  • [3] Extreme Zero-Shot Learning for Extreme Text Classification
    Xiong, Yuanhao
    Chang, Wei-Cheng
    Hsieh, Cho-Jui
    Yu, Hsiang-Fu
    Dhillon, Inderjit
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5455 - 5468
  • [4] Learn to Adapt for Generalized Zero-Shot Text Classification
    Zhang, Yiwen
    Yuan, Caixia
    Wang, Xiaojie
    Bai, Ziwei
    Liu, Yongbin
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 517 - 527
  • [5] Generalized Zero-Shot Text Classification for ICD Coding
    Song, Congzheng
    Zhang, Shanghang
    Sadoughi, Najmeh
    Xie, Pengtao
    Xing, Eric
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4018 - 4024
  • [6] Issues with Entailment-based Zero-shot Text Classification
    Ma, Tingting
    Yao, Jin-Ge
    Lin, Chin-Yew
    Zhao, Tiejun
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 786 - 796
  • [7] ZeroBERTo: Leveraging Zero-Shot Text Classification by Topic Modeling
    Alcoforado, Alexandre
    Ferraz, Thomas Palmeira
    Gerber, Rodrigo
    Bustos, Enzo
    Oliveira, Andre Seidel
    Veloso, Bruno Miguel
    Siqueira, Fabio Levy
    Reali Costa, Anna Helena
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 125 - 136
  • [8] Integrating Semantic Knowledge to Tackle Zero-shot Text Classification
    Zhang, Jingqing
    Lertvittayakumjorn, Piyawat
    Guo, Yike
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1031 - 1040
  • [9] Zero-Shot Text Classification with Semantically Extended Textual Entailment
    Liu, Tengfei
    Hu, Yongli
    Chen, Puman
    Sun, Yanfeng
    Yin, Baocai
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [10] Cost Effective Annotation Framework Using Zero-Shot Text Classification
    Kasthuriarachchy, Buddhika
    Chetty, Madhu
    Shatte, Adrian
    Walls, Darren
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,