Classification of Durian Characteristics for Semantic Representation from Web Documents

被引:0
|
作者
Abu Bakar, Zainab [1 ]
Ismail, Khairul Nurmazianna [1 ]
机构
[1] Fac Comp & Math Sci, Dept Comp Sci, Shah Alam, Selangor, Malaysia
关键词
semantic; Durian; !text type='HTML']HTML[!/text; RDF;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Web contains enormous size of information that is represented in various document structures. The information is scattered and redundant. Currently, search engine is the main medium for retrieving this information. Yet, the most popular search engine cannot satisfy user query. Alternatively, semantic technology can alleviate this problem. In this paper, only relevant web HTML documents on durian also known as king of fruits are chosen. The characteristics of durian will be extracted from those HTML documents. These characteristics are then employed in semantic representation and stored along with their Uniform Resource Identifier (URI) in Resource Description Framework (RDF). The RDF provides the ontology link to many other web documents on durian. Experiment on 40 HTML documents provides eleven new characteristics of durian that can be represent in RDF for semantic search engine.
引用
收藏
页码:111 / 115
页数:5
相关论文
共 50 条
  • [41] Classification Based on Semantic Extension for Medical Documents
    Zhang Ying
    PROCEEDINGS OF THE 23RD INTERNATIONAL BUSINESS ANNUAL CONFERENCE (2016), BKS ONE AND TWO, 2016, : 596 - 600
  • [42] Semantic Proximity in Information Retrieval and Documents Classification
    Vishnyakov, Yury
    Vishnyakov, Renat
    14TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI), 2013, : 131 - 134
  • [43] HιLεX: A system for semantic information extraction from web documents
    Ruffolo, Massimo
    Manna, Marco
    ENTERPRISE INFORMATION SYSTEMS-BOOK, 2008, 3 : 194 - +
  • [44] Fuzzy semantic tagging and flexible querying of XML documents extracted from the Web
    Patrice Buche
    Juliette Dibie-Barthélemy
    Ollivier Haemmerlé
    Gaëlle Hignette
    Journal of Intelligent Information Systems, 2006, 26 : 25 - 40
  • [45] Scientific Documents Ontologies for Semantic Representation of Digital Libraries
    Elizarov, Alexander
    Khaydarov, Shamil
    Lipachev, Evgeny
    2017 SECOND RUSSIA AND PACIFIC CONFERENCE ON COMPUTER TECHNOLOGY AND APPLICATIONS (RPC 2017), 2017,
  • [46] Deep neural annealing model for the semantic representation of documents
    de Mendonca, Leandro R. C.
    da Cruz Junior, Gelson
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 96
  • [47] Discovering semantic sibling associations from Web Documents with XTREEM-SP
    Brunzel, Marko
    Spiliopoulou, Myra
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 469 - 480
  • [48] Semantic Documents Relatedness using Concept Graph Representation
    Ni, Yuan
    Xu, Qiong Kai
    Cao, Feng
    Mass, Yosi
    Sheinwald, Dafna
    Zhu, Hui Jia
    Cao, Shao Sheng
    PROCEEDINGS OF THE NINTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'16), 2016, : 635 - 644
  • [49] Fuzzy semantic tagging and flexible querying of XML documents extracted from the Web
    Buche, P
    Dibie-Barthélemy, J
    Haemmerlé, O
    Hignette, G
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2006, 26 (01) : 25 - 40
  • [50] Discovering semantic sibling groups from web documents with XTREEM-SG
    Brunzel, Marko
    Spiliopoulou, Myra
    MANAGING KNOWLEDGE IN A WORLD OF NETWORKS, PROCEEDINGS, 2006, 4248 : 141 - 157