An automatic document classifier system based on Naive Bayes Classifier and Ontology

被引:7
|
作者
Chang, Yi-Hsing [1 ]
Huang, Hsiu-Yi [1 ]
机构
[1] So Taiwan Univ, Dept Informat Management, Taipei, Taiwan
关键词
Naive bayes Classifier; ontology; formal concept analysis; document classification;
D O I
10.1109/ICMLC.2008.4620948
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
An Automatic Document Classifier System based on Ontology and the Naive Bayes Classifier is proposed in this paper. The main concept is to first establish a keyword synonymous table by experts for narrowing down the range and getting the consistency of keywords. The Formal Concept Analysis is then used for establishing knowledge ontology through the complex categories and attributes relation. Finally, the ontology is applied to a Naive Bayes Classifier to get the automatic document classifier system. In this system, 319 documents divided into 11 categories are used to assess the effectiveness of classification, where 224 and 95 documents are the training and testing documents respectively, and the F1-measure is as the assessment criteria. The experimental results show that nine from 11 categories reaches 80% effectiveness of the documents classification, whereas the other two categories reached over 60% effectiveness of the documents classification. In sum, the average effectiveness of document classification in 11 categories is about 89%. Thus, the automatic classifier system can indeed reach the effectiveness of document classification.
引用
收藏
页码:3144 / 3149
页数:6
相关论文
共 50 条
  • [1] Applying Naive Bayes Classifier to Document Clustering
    Ji, Jie
    Zhao, Qiangfu
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2010, 14 (06) : 624 - 630
  • [2] A METHOD FOR DETECTING DOCUMENT ORIENTATION BY USING NAIVE BAYES CLASSIFIER
    Deng, Xue
    Guo, Jun
    Chen, Youguang
    Liu, Xiaoping
    [J]. 2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE), 2012, : 429 - 432
  • [3] A Naive Bayes Classifier Based on Neighborhood Granulation
    Fu, Xingyu
    Chen, Yingyue
    Yao, Zhiyuan
    Chen, Yumin
    Zeng, Nianfeng
    [J]. ROUGH SETS, IJCRS 2022, 2022, 13633 : 132 - 142
  • [4] A Focused Crawler Based on Naive Bayes Classifier
    Wang, Wenxian
    Chen, Xingshu
    Zou, Yongbin
    Wang, Haizhou
    Dai, Zongkun
    [J]. 2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 517 - 521
  • [5] Naive Bayes Classifier Based Partitioner for MapReduce
    Chen, Lei
    Lu, Wei
    Bao, Ergude
    Wang, Liqiang
    Xing, Weiwei
    Cai, Yuanyuan
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2018, E101A (05) : 778 - 786
  • [6] Threshold-based Naive Bayes classifier
    Romano, Maurizio
    Contu, Giulia
    Mola, Francesco
    Conversano, Claudio
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024, 18 (02) : 325 - 361
  • [7] Naive Bayes text classifier
    Zhang, Haiyi
    Li, Di
    [J]. GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 708 - 711
  • [8] Automatic Exudate Detection with Improved Naive-Bayes Classifier
    Harangi, Balazs
    Antal, Balint
    Hajdu, Andras
    [J]. 2012 25TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2012,
  • [9] Design of agricultural ontology based on levy flight distributed optimization and Naive Bayes classifier
    Rajendran, Deepa
    Vigneshwari, S.
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2021, 46 (03):
  • [10] Building Naive Bayes Document Classifier Using Word Clusters Based on Bootstrap Averaging
    Wang Yuanzhe
    Zhang Qiang
    Bai Liyuan
    [J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON IT IN MEDICINE & EDUCATION, VOLS 1 AND 2, PROCEEDINGS, 2009, : 202 - +