Ontology-Based Naive Bayes Short Text Classification Method for a Small Dataset

被引:0
|
作者
Sangounpao, Ketkaew [1 ]
Muenchaisri, Pornsiri [1 ]
机构
[1] Chulalongkorn Univ, Fac Engn, Dept Comp Engn, Bangkok, Thailand
关键词
requirements engineering; ontology; accounting domain knowledge; short text classification; small dataset; multi-classification; traditional classification;
D O I
10.1109/snpd.2019.8935711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Content less than two hundred words like comments or review statements is known as a short text. Short text classification is useful for automatically categorizing sentence into predefined group. There are several traditional short text classification methods by using bag-of-words with k nearest neighbors (k-NN), Naive Bayes, Maximum entropy, support vector machines (SVMs), and an algorithm based on statistics and rules. The deep learning method is outperformed other methods on classification of short text with normal size of dataset. Some researches classify requirements into functional and non-functional requirements. There is no research on multiclassification of functional requirements with a small dataset particularly for an accounting field. This paper presents an approach to classify short text for a small dataset into multiple categories of functional requirements on the accounting domain. The proposed approach uses an ontology to construct bag-of-words and uses Naive Bayes to classify for small dataset. The experiment is conducted using four hundred of datasets with 5-folds and 10-folds cross validation. The result shows that the method can correctly classify more than 80%. Additionally, comparisons between the ontology-based Naive Bayes method and other methods are investigated.
引用
收藏
页码:53 / 58
页数:6
相关论文
共 50 条
  • [41] Discrimination-based feature selection for multinomial naive Bayes text classification
    Zhu, Jingbo
    Wang, Huizhen
    Zhang, Xijuan
    [J]. COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 149 - +
  • [42] Semantic Text Classification with Tensor Space Model-based Naive Bayes
    Kim, Han-joon
    Kim, Jiyun
    Kim, Jinseog
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 4206 - 4210
  • [43] Personality Classification Based on Twitter Text Using Naive Bayes, KNN and SVM
    Pratama, Bayu Yudha
    Sarno, Riyanarto
    [J]. 2015 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2015, : 170 - 174
  • [44] Using Naive Bayes Method to Classify Text-based Email
    Kang, LanLan
    Chen, Ruey-Shun
    Chen, Yeh-Cheng
    Cao, WenLiang
    [J]. 2018 9TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP 2018), 2018, : 94 - 98
  • [45] A Novel Approach for Ontology-based Dimensionality Reduction for Web Text Document Classification
    Elhadad, Mohamed K.
    Badran, Khaled M.
    Salama, Gouda I.
    [J]. 2017 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2017), 2017, : 373 - 378
  • [46] Large Scale Text Classification using Map Reduce and Naive Bayes Algorithm for Domain Specified Ontology Building
    Santoso, Joan
    Yuniarno, Eko Mulyanto
    Hariadi, Mochamad
    [J]. 2015 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS IHMSC 2015, VOL I, 2015, : 428 - 432
  • [47] Assessment of text coherence using an ontology-based relatedness measurement method
    Giray, Gorkem
    Unalir, Murat Osman
    [J]. EXPERT SYSTEMS, 2020, 37 (03)
  • [48] Patent Text Classification Based on Naive Bayesian Method
    Xiao, Lizhong
    Wang, Guangzhong
    Liu, Yuan
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2018, : 57 - 60
  • [49] Automatic Concepts Classification based on Bloom's Taxonomy using Text Analysis and the Naive Bayes Classifier Method
    Nafa, Fatema
    Othman, Salem
    Khan, Javed
    [J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED EDUCATION, VOL 1 (CSEDU), 2016, : 391 - 396
  • [50] Text classification based on a combination of ontology with statistical method
    Yu, Feng
    Zheng, De-Quan
    Zhao, Tie-Jun
    Li, Sheng
    Yu, Hao
    [J]. PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1042 - +