Applying semantic links for classifying Web pages

被引:0
|
作者
Choi, B [1 ]
Guo, Q [1 ]
机构
[1] Louisiana Tech Univ, Coll Engn & Sci, Ruston, LA 71272 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic hypertext classification is an essential technique for organizing vast amount of Internet Web pages or HTML documents. One the of problems in classifying Web pages is that Web pages are usually short and contain insufficient text to clearly identify its category. Text classification mechanisms, by analyzing only the contents of the document itself, are relatively ineffective in classifying short Web pages. This paper proposes a new hypertext classification mechanism to address the problem by analyzing not only the Web page itself but also its linked Web pages referred by the URLs contained within the page. The URLs are treated as semantic links. The hypothesis is that the linked Web pages contain related information to help identifying the category of the Web page. Experimental results show that the proposed approach could increase the accuracy by 35% over the approach of analyzing only the Web page itself.
引用
收藏
页码:148 / 153
页数:6
相关论文
共 50 条
  • [1] CLASSIFYING WEB PAGES BY GENRE
    Mason, Jane E.
    Shepherd, Michael
    Duffy, Jack
    [J]. WEBIST 2009: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2009, : 651 - 658
  • [2] CLASSIFYING WEB PAGES WITH VISUAL FEATURES
    de Boer, Viktor
    van Someren, Maarten
    Lupascu, Tiberiu
    [J]. WEBIST 2010: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGY, VOL 1, 2010, : 245 - 252
  • [3] Semantic partitioning of web pages
    Vadrevu, S
    Gelgi, F
    Davulcu, H
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2005, 2005, 3806 : 107 - 118
  • [4] Recycling course web pages for the semantic web
    Motz, Regina
    Sosa, Raquel
    Rodriguez, Andrea
    [J]. LA-WEB 06: FOURTH LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2006, : 82 - +
  • [5] Classifying web pages using adaptive ontology
    Noh, S
    Seo, H
    Choi, J
    Choi, K
    Jung, G
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 2144 - 2149
  • [6] Bridging the WWW to the Semantic Web by automatic semantic tagging of Web pages
    Yang, HC
    [J]. Fifth International Conference on Computer and Information Technology - Proceedings, 2005, : 238 - 242
  • [7] Semantic analysis of web pages using web patterns
    Kudelka, Milos
    Snasel, Vaclav
    Lehecka, Ondrej
    E-Qawasmeh, Eyas
    [J]. 2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 329 - +
  • [8] Semantic Annotation of Web Pages Using Web Patterns
    Kudelka, Milos
    Snasel, Vaclav
    Lehecka, Ondrej
    El-Qawasmeh, Eyas
    Pokorny, Jaroslav
    [J]. ADVANCED INTERNET BASED SYSTEMS AND APPLICATIONS, 2009, 4879 : 280 - +
  • [9] Representing and classifying arguments on the Semantic Web
    Rahwan, Iyad
    Banihashemi, Bita
    Reed, Chris
    Walton, Douglas
    Abdallah, Sherief
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2011, 26 (04): : 487 - 511
  • [10] Recognition of pornographic web pages by classifying texts and images
    Hu, Weiming
    Wu, Ou
    Chen, Zhouyao
    Fu, Zhouyu
    Maybank, Steve
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (06) : 1019 - 1034