Classifier combination approach for question classification for Bengali question answering system

被引:3
|
作者
Banerjee, Somnath [1 ]
Naskar, Sudip Kumar [1 ]
Rosso, Paolo [2 ]
Bndyopadhyay, Sivaji [1 ]
机构
[1] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
[2] Univ Politecn Valencia, PRHLT Res Ctr, Valencia, Spain
关键词
Bengali question classification; question classification; classifier combinations;
D O I
10.1007/s12046-019-1224-8
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Question classification (QC) is a prime constituent of an automated question answering system. The work presented here demonstrates that a combination of multiple models achieves better classification performance than those obtained with existing individual models for the QC task in Bengali. We have exploited state-of-the-art multiple model combination techniques, i.e., ensemble, stacking and voting, to increase QC accuracy. Lexical, syntactic and semantic features of Bengali questions are used for four well-known classifiers, namely Naive Bayes, kernel Naive Bayes, Rule Induction and Decision Tree, which serve as our base learners. Single-layer question-class taxonomy with 8 coarse-grained classes is extended to two-layer taxonomy by adding 69 fine-grained classes. We carried out the experiments both on single-layer and two-layer taxonomies. Experimental results confirmed that classifier combination approaches outperform single-classifier classification approaches by 4.02% for coarse-grained question classes. Overall, the stacking approach produces the best results for fine-grained classification and achieves 87.79% of accuracy. The approach presented here could be used in other Indo-Aryan or Indic languages to develop a question answering system.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] A statistical classification approach to question answering using web data
    Whittaker, E
    Furui, S
    Klakow, D
    [J]. 2005 International Conference on Cyberworlds, Proceedings, 2005, : 421 - 428
  • [32] Advances in question classification for open-domain question answering
    School of Computer Science and Technology, Anhui University of Technology, Maanshan
    Anhui
    243002, China
    不详
    Jiangsu
    210023, China
    [J]. Tien Tzu Hsueh Pao, 8 (1627-1636):
  • [33] Corpus-based question classification in question answering systems
    Tomas, David
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (44): : 155 - 156
  • [34] QAAN: Question Answering Attention Network for Community Question Classification
    Wang, Yuntao
    Huang, Weiqing
    [J]. 2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,
  • [35] Question and Answer Classification in Czech Question Answering Benchmark Dataset
    Kusnirakova, Dasa
    Medved, Marek
    Horak, Ales
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 701 - 706
  • [36] Exploring Answer Information for Question Classification in Community Question Answering
    Wang, Jian
    Lin, Hongfei
    Dong, Hualei
    Xiong, Daping
    Yang, Zhihao
    [J]. JOURNAL OF MULTIPLE-VALUED LOGIC AND SOFT COMPUTING, 2018, 31 (1-2) : 67 - 84
  • [37] BUCA: A Binary Classification Approach to Unsupervised Commonsense Question Answering
    He, Jie
    Lok, Simon Chi U.
    Gutierrez-Basulto, Victor
    Pan, Jeff Z.
    [J]. 61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 376 - 387
  • [38] Deep learning based Bengali question answering system using semantic textual similarity
    Arijit Das
    Diganta Saha
    [J]. Multimedia Tools and Applications, 2022, 81 : 589 - 613
  • [39] A Classification-based Approach to Question Answering in Discussion Boards
    Hong, Liangjie
    Davison, Brian D.
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 171 - 178
  • [40] A machine learning approach to introspection in a question answering system
    Czuba, K
    Prager, J
    Chu-Carroll, J
    [J]. PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 265 - 272