Questions Feature Extraction and Semi-supervised Classification Based on Terms Relevance

被引:0
|
作者
Zhao, Quan [1 ]
Yu, Zhengtao [1 ]
Su, Lei [2 ]
Guo, Jianyi [1 ]
Mao, Yu [1 ]
机构
[1] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming, Peoples R China
[2] Yunnan Univ, Software Coll, Kunming, Peoples R China
关键词
question classification; term relationship; literal similarity; supervised learning; semi-supervised learning; Co-training;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question classification, an important component of question answering systems, has a direct impact on the answer extraction accuracy. In this paper, a question classification method is proposed by combined the question feature extracting of term relevance with semi-supervised classification. In detail, the method extracts structure terms in interrogative sentences as the feature space through statistical means, and calculates the relevance among terms by literal similarity method, besides, feature vectors of question classification are obtained by using term similarity relationship to build the questions' feature value in feature space. And then, utilizing unlabeled samples classify questions with the help of Co-training style and semi-supervised learning algorithm. Experimented on 20,000 questions in Yunnan tourism domain, the results show that more remarkable effects have been achieved by adopting the method above. The classification accuracy rate reaches 82.34%, which is higher than the TFIDF feature extraction methods and supervised learning methods by 15.4 percentage points and 1.4 percentage points separately.
引用
收藏
页码:518 / 521
页数:4
相关论文
共 50 条
  • [21] A semi-supervised approach for extracting TCM clinical terms based on feature words
    Liangliang Liu
    Xiaojing Wu
    Hui Liu
    Xinyu Cao
    Haitao Wang
    Hongwei Zhou
    Qi Xie
    BMC Medical Informatics and Decision Making, 20
  • [22] A semi-supervised approach for extracting TCM clinical terms based on feature words
    Liu, Liangliang
    Wu, Xiaojing
    Liu, Hui
    Cao, Xinyu
    Wang, Haitao
    Zhou, Hongwei
    Xie, Qi
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (Suppl 3)
  • [23] Geodesic based semi-supervised multi-manifold feature extraction
    Fan, Mingyu
    Zhang, Xiaoqin
    Lin, Zhouchen
    Zhang, Zhongfei
    Bao, Hujun
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 852 - 857
  • [24] Semi-supervised Image Classification Learning Based on Random Feature Subspace
    Liu Li
    Zhang Huaxiang
    Hu Xiaojun
    Sun Feifei
    PATTERN RECOGNITION (CCPR 2014), PT I, 2014, 483 : 237 - 242
  • [25] Mass Classification in Mammogram with Semi-Supervised Relief Based Feature Selection
    Liu, Xiaoming
    Liu, Jun
    Feng, Zhilin
    Xu, Xin
    Tang, J.
    FIFTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2013), 2014, 9069
  • [26] GMDH-based semi-supervised feature selection for customer classification
    Xiao, Jin
    Cao, Hanwen
    Jiang, Xiaoyi
    Gu, Xin
    Xie, Ling
    KNOWLEDGE-BASED SYSTEMS, 2017, 132 : 236 - 248
  • [27] A novel feature selection based semi-supervised method for image classification
    Tahir, M. A.
    Smith, J. E.
    Caleb-Solly, P.
    COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 484 - 493
  • [28] Semi-supervised feature learning for hyperspectral image classification
    Zhang, Pengfei
    Cao, Liujuan
    Wang, Cheng
    Li, Jonathan
    2ND ISPRS INTERNATIONAL CONFERENCE ON COMPUTER VISION IN REMOTE SENSING (CVRS 2015), 2016, 9901
  • [29] Image Classification via Semi-supervised Feature Extraction with Out-of-Sample Extension
    Dornaika, F.
    El Traboulsi, Y.
    Cases, B.
    Assoum, A.
    ADVANCES IN VISUAL COMPUTING (ISVC 2014), PT 1, 2014, 8887 : 182 - 192
  • [30] Semi-supervised local feature selection for data classification
    Zechao Li
    Jinhui Tang
    Science China Information Sciences, 2021, 64