Semi-supervised learning for question classification in CQA

被引:1
|
作者
Yiyang Li
Lei Su
Jun Chen
Liwei Yuan
机构
[1] Kunming University of Science and Technology,School of Information Engineering and Automation
来源
Natural Computing | 2017年 / 16卷
关键词
Community Q&A; Semi-supervised learning; Ensemble learning; Question classification;
D O I
暂无
中图分类号
学科分类号
摘要
In a community question answering (CQA) system, the new questions are appeared endlessly which have no tags. And the questions must be marked as some labels. Therefore, the question classification is very important for CQA. In the traditional task of question classification, a mass of labeled questions are required. In the real world, it is effortless to obtain a large number of unlabeled question samples and the vast labeled question samples are fairly expensive to obtain. Therefore, how to utilize the unlabeled samples to improve the question classification accuracy has been the core question of the question classification. In this paper, a kind of semi-supervised question classification method based on ensemble learning is proposed. Firstly, several classifiers are combined as one, i.e. ensemble classifier. The ensemble classifier is trained firstly to utilize a small number of labeled question samples. Secondly, the trained preliminary classifier gives each of the unlabeled question samples a pseudo label. Then, the ensemble classifier is trained again to use the labeled question samples and a large number of unlabeled question samples which have pseudo labels. Finally, to verify the effectiveness of the method through the experiments on question samples of 15 classes extracted from the community question answering system. The experiments demonstrate that the method could effectively utilize a large number of unlabeled question samples to improve the question classification accuracy.
引用
收藏
页码:567 / 577
页数:10
相关论文
共 50 条
  • [1] Semi-supervised learning for question classification in CQA
    Li, Yiyang
    Su, Lei
    Chen, Jun
    Yuan, Liwei
    NATURAL COMPUTING, 2017, 16 (04) : 567 - 577
  • [2] Using semi-supervised learning for question classification
    Tri, Nguyen Thanh
    Le, Nguyen Minh
    Shimazu, Akira
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 31 - +
  • [3] Semi-Supervised Learning for ECG Classification
    Rodrigues, Rui
    Couto, Paula
    2021 COMPUTING IN CARDIOLOGY (CINC), 2021,
  • [4] Augmentation Learning for Semi-Supervised Classification
    Frommknecht, Tim
    Zipf, Pedro Alves
    Fan, Quanfu
    Shvetsova, Nina
    Kuehne, Hilde
    PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 85 - 98
  • [5] Semi-Supervised Learning for Classification with Uncertainty
    Zhang, Rui
    Liu, Tong-bo
    Zheng, Ming-wen
    MATERIALS SCIENCE AND INFORMATION TECHNOLOGY, PTS 1-8, 2012, 433-440 : 3584 - 3590
  • [6] Question classification based on co-training style semi-supervised learning
    Yu, Zhengtao
    Su, Lei
    Li, Lina
    Zhao, Quan
    Mao, Cunli
    Guo, Jianyi
    PATTERN RECOGNITION LETTERS, 2010, 31 (13) : 1975 - 1980
  • [7] A review of semi-supervised learning for text classification
    José Marcio Duarte
    Lilian Berton
    Artificial Intelligence Review, 2023, 56 : 9401 - 9469
  • [8] Semi-supervised tensor learning for image classification
    Zhang, Jianguang
    Han, Yahong
    Jiang, Jianmin
    MULTIMEDIA SYSTEMS, 2017, 23 (01) : 63 - 73
  • [9] A Semi-Supervised Learning Algorithm for Data Classification
    Kuo, Cheng-Chien
    Shieh, Horng-Lin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2015, 29 (05)
  • [10] Semi-supervised tensor learning for image classification
    Jianguang Zhang
    Yahong Han
    Jianmin Jiang
    Multimedia Systems, 2017, 23 : 63 - 73