Semi-supervised learning for question classification in CQA

被引:1
|
作者
Yiyang Li
Lei Su
Jun Chen
Liwei Yuan
机构
[1] Kunming University of Science and Technology,School of Information Engineering and Automation
来源
Natural Computing | 2017年 / 16卷
关键词
Community Q&A; Semi-supervised learning; Ensemble learning; Question classification;
D O I
暂无
中图分类号
学科分类号
摘要
In a community question answering (CQA) system, the new questions are appeared endlessly which have no tags. And the questions must be marked as some labels. Therefore, the question classification is very important for CQA. In the traditional task of question classification, a mass of labeled questions are required. In the real world, it is effortless to obtain a large number of unlabeled question samples and the vast labeled question samples are fairly expensive to obtain. Therefore, how to utilize the unlabeled samples to improve the question classification accuracy has been the core question of the question classification. In this paper, a kind of semi-supervised question classification method based on ensemble learning is proposed. Firstly, several classifiers are combined as one, i.e. ensemble classifier. The ensemble classifier is trained firstly to utilize a small number of labeled question samples. Secondly, the trained preliminary classifier gives each of the unlabeled question samples a pseudo label. Then, the ensemble classifier is trained again to use the labeled question samples and a large number of unlabeled question samples which have pseudo labels. Finally, to verify the effectiveness of the method through the experiments on question samples of 15 classes extracted from the community question answering system. The experiments demonstrate that the method could effectively utilize a large number of unlabeled question samples to improve the question classification accuracy.
引用
收藏
页码:567 / 577
页数:10
相关论文
共 50 条
  • [21] Extreme semi-supervised learning for multiclass classification
    Chen, Chuangquan
    Gan, Yanfen
    Vong, Chi-Man
    NEUROCOMPUTING, 2020, 376 : 103 - 118
  • [22] Semi-Supervised Text Classification With Universum Learning
    Liu, Chien-Liang
    Hsaio, Wen-Hoar
    Lee, Chia-Hoang
    Chang, Tao-Hsing
    Kuo, Tsung-Hsun
    IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (02) : 462 - 473
  • [23] Participatory Learning based Semi-supervised Classification
    Deng, Chao
    Guo, Mao-Zu
    Liu, Yang
    Li, Hai-Feng
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2008, : 207 - 216
  • [24] News Classification with Semi-Supervised and Active Learning
    Guo C.
    Chao Y.
    Data Analysis and Knowledge Discovery, 2022, 6 (04) : 28 - 38
  • [25] Malware Classification Based on Semi-Supervised Learning
    Ding, Yu
    Zhang, XiaoYu
    Li, BinBin
    Xing, Jian
    Qiang, Qian
    Qi, ZiSen
    Guo, MengHan
    Jia, SiYu
    Wang, HaiPing
    SCIENCE OF CYBER SECURITY, SCISEC 2022, 2022, 13580 : 287 - 301
  • [26] Semi-supervised learning for Bayesian pattern classification
    Center, JL
    Bayesian Inference and Maximum Entropy Methods in Science and Engineering, 2005, 803 : 517 - 524
  • [27] Spectral Kernel Learning for Semi-Supervised Classification
    Liu, Wei
    Qian, Buyue
    Cui, Jingyu
    Liu, Jianzhuang
    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1150 - 1155
  • [28] Deep graph learning for semi-supervised classification
    Lin, Guangfeng
    Kang, Xiaobing
    Liao, Kaiyang
    Zhao, Fan
    Chen, Yajun
    PATTERN RECOGNITION, 2021, 118
  • [29] A NOVEL SEMI-SUPERVISED LEARNING FOR SMS CLASSIFICATION
    Ahmed, Ishtiaq
    Guan, Donghai
    Chung, Teachoong
    PROCEEDINGS OF 2014 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2014, : 856 - 861
  • [30] Semi-supervised Metric Learning for Image Classification
    Hu, Jiwei
    Sun, ChenSheng
    Kin Man Lam
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT II, 2010, 6298 : 728 - 735