Questions Feature Extraction and Semi-supervised Classification Based on Terms Relevance

被引:0
|
作者
Zhao, Quan [1 ]
Yu, Zhengtao [1 ]
Su, Lei [2 ]
Guo, Jianyi [1 ]
Mao, Yu [1 ]
机构
[1] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming, Peoples R China
[2] Yunnan Univ, Software Coll, Kunming, Peoples R China
关键词
question classification; term relationship; literal similarity; supervised learning; semi-supervised learning; Co-training;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question classification, an important component of question answering systems, has a direct impact on the answer extraction accuracy. In this paper, a question classification method is proposed by combined the question feature extracting of term relevance with semi-supervised classification. In detail, the method extracts structure terms in interrogative sentences as the feature space through statistical means, and calculates the relevance among terms by literal similarity method, besides, feature vectors of question classification are obtained by using term similarity relationship to build the questions' feature value in feature space. And then, utilizing unlabeled samples classify questions with the help of Co-training style and semi-supervised learning algorithm. Experimented on 20,000 questions in Yunnan tourism domain, the results show that more remarkable effects have been achieved by adopting the method above. The classification accuracy rate reaches 82.34%, which is higher than the TFIDF feature extraction methods and supervised learning methods by 15.4 percentage points and 1.4 percentage points separately.
引用
收藏
页码:518 / 521
页数:4
相关论文
共 50 条
  • [31] Semi-supervised local feature selection for data classification
    Li, Zechao
    Tang, Jinhui
    SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (09)
  • [32] Semi-Supervised Feature Transformation for Tissue Image Classification
    Watanabe, Kenji
    Kobayashi, Takumi
    Wada, Toshikazu
    PLOS ONE, 2016, 11 (12):
  • [33] Semi-supervised local feature selection for data classification
    Zechao LI
    Jinhui TANG
    Science China(Information Sciences), 2021, 64 (09) : 127 - 138
  • [34] Efficient Semi-Supervised Feature Selection: Constraint, Relevance, and Redundancy
    Benabdeslem, Khalid
    Hindawi, Mohammed
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (05) : 1131 - 1143
  • [35] Ant Based Semi-supervised Classification
    Halder, Anindya
    Ghosh, Susmita
    Ghosh, Ashish
    SWARM INTELLIGENCE, 2010, 6234 : 376 - +
  • [36] Semi-supervised learning for classification on Chinese drug treatment questions
    Wang, Xinyuan
    Ren, Jiangtao
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 991 - 994
  • [37] Structured optimal graph based sparse feature extraction for semi-supervised learning
    Liu, Zhonghua
    Lai, Zhihui
    Ou, Weihua
    Zhang, Kaibing
    Zheng, Ruijuan
    SIGNAL PROCESSING, 2020, 170
  • [38] Graph convolutional network-based semi-supervised feature classification of volumes
    He, Xiangyang
    Yang, Shuoliu
    Tao, Yubo
    Dai, Haoran
    Lin, Hai
    JOURNAL OF VISUALIZATION, 2022, 25 (02) : 379 - 393
  • [39] Graph convolutional network-based semi-supervised feature classification of volumes
    Xiangyang He
    Shuoliu Yang
    Yubo Tao
    Haoran Dai
    Hai Lin
    Journal of Visualization, 2022, 25 : 379 - 393
  • [40] POLSAR IMAGE CLASSIFICATION BASED-ON SEMI-SUPERVISED POLARIMETRIC FEATURE SELECTION
    Huang, Xiayuan
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 196 - 200