Ontology-based automatic classification and ranking for web documents

被引:9
|
作者
Fang, Jun [1 ]
Guo, Lei [1 ]
Wang, XiaoDong [1 ]
Yang, Ning [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Control & Networks Lab, Xian, Peoples R China
关键词
D O I
10.1109/FSKD.2007.432
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The process of web document classification involves calculating similarities between documents and categories by using the information extracted from them. In recent years, ontology-based web documents classification method is introduced to solve the problem of classifier training and not considering semantic relations between words in traditional Machine Learning algorithms. However, previous works on ontology-based web documents classification miss some important tissues of automatic ontology construction and ranking of classified documents. In order to solve these problems, this paper proposes an ontology-based web documents classification and ranking method Firstly, weighted terms set are extracted from web documents, and ontology is build up by using an effective ontology construction method which clarifies and augments an existent ontology; then similarity score between documents and ontology is computed based on WordNet by using Earth Mover's Distance (EMD) method; finally, web documents are assigned to categories according to the similarity score, and a simple ranking method is used to sort the documents in the same categories. The experiment result shows our classification algorithm achieves better precision and recall compare with adaptive KNN method, and is competitive with SVM method, the ranking method also has good performance.
引用
收藏
页码:627 / 631
页数:5
相关论文
共 50 条
  • [41] Design and implementation of an ontology algorithm for web documents classification
    Wei, Guiyi
    Yu, Jun
    Ling, Yun
    Liu, Jun
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 4, 2006, 3983 : 649 - 658
  • [42] Automatic conversion of web content into ontology-based resource description language for tourism domain
    Jayaprabha, Palanisamy
    Saradha, Arumugam
    [J]. INTERNATIONAL JOURNAL OF INNOVATION AND LEARNING, 2012, 12 (03) : 267 - 282
  • [43] Ontology-based similarity between text documents on manifold
    Wen, Guihua
    Jiang, Lijun
    Shadbolt, Nigel R.
    [J]. SEMANTIC WEB - ASWC 2006, PROCEEDINGS, 2006, 4185 : 113 - 125
  • [44] Ontology-Based Indexing Method for Engineering Documents Retrieval
    Fang, Weiguang
    Guo, Yu
    Liao, Wenhe
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE ENGINEERING AND APPLICATIONS (ICKEA 2016), 2016, : 172 - 176
  • [45] An ontology-based index to retrieve documents with geographic information
    Luaces, Miguel R.
    Parama, Jose R.
    Pedreira, Oscar
    Seco, Diego
    [J]. SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2008, 5069 : 384 - 400
  • [46] Ontology-based automatic annotation of learning content
    Jovanovic, Jelena
    Gasevic, Dragan
    Devedzic, Vladan
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2006, 2 (02) : 91 - 119
  • [47] Bilingual Ontology-Based Automatic Question Generation
    Diatta, Baboucar
    Basse, Adrien
    Ouya, Samuel
    [J]. PROCEEDINGS OF 2019 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2019, : 679 - 684
  • [48] An Ontology-Based Automatic Approach for Lithologic Correlation
    Garcia, Luan Fonseca
    Carbonera, Joel
    Abel, Mara
    [J]. 2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, : 130 - 137
  • [49] Ontology-based automatic receipt accounting system
    Shen, ZhiNian
    Tijerino, Yuri
    [J]. 2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, : 236 - 239
  • [50] Automatic classification of epilepsy types using ontology-based and genetics-based machine learning
    Kassahun, Yohannes
    Perrone, Roberta
    De Momi, Elena
    Berghoefer, Elmar
    Tassi, Laura
    Canevini, Maria Paola
    Spreafico, Roberto
    Ferrigno, Giancarlo
    Kirchner, Frank
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2014, 61 (02) : 79 - 88