An intelligent system for semantic information retrieval information from textual web documents

被引:0
|
作者
Karthik, Mukundan [1 ]
Marikkannan, Mariappan [2 ]
Kannan, Arputharaj [1 ]
机构
[1] Anna Univ, Coll Engn, Dept Comp Sci, Madras 600025, Tamil Nadu, India
[2] IRTT, Dept Comp Sci & Engn, Erode 638316, India
来源
关键词
semantic relations; SEMINRET algorithm; text mining; Resources Description Framework (RDF); information extraction (IE); part-of-speech (POS) tag intelligent information retrieval;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text data, which are represented as free text in World Wide Web (WWW), are inherently unstructured and hence it becomes difficult to directly process the text data by computer programs. There has been great interest in text mining techniques recently for helping users to quickly gain knowledge from the Web. Text mining technologies usually involve tasks such as text refining which transforms free text into an intermediate representation form which is machine-processable and knowledge distillation which deduces patterns or knowledge from the intermediate form. These text representation methodologies consider documents as bags of words and ignore the meanings and ideas their authors want to convey. As terms are treated as individual items in such simplistic representations, terms lose their semantic relations and texts lose their original meanings. In this paper, we propose a system that overcomes the limitations of the existing technologies to retrieve the information from the knowledge discovered through data mining based on the detailed meanings of the text. For this, we propose a Knowledge representation technique, which uses Resources Description Framework (RDF) meta-data to represent the semantic relations, which are extracted from textual web document using natural language processing techniques. The main objective of the creation of RDF metadata in this system is to have flexibility for easy retrieval of the semantic information effectively. We also propose an effective SEMantic INformation RETrieval algorithm called SEMINRET algorithm. The experimental results obtained from this system show that the computations of Precision and Recall in RDF databases are highly accurate when compared to XML databases. Moreover, it is observed from our experiments that the document retrieval from the RDF database is more efficient than the document retrieval using XML databases. http://download.springer.com/static/pdf/268/chp%253A10.1007%252F978-3-540-85303-9_13.pdf?originUrl=http%3A%2F%2Flink.springer.com%2Fchapter%2F10.1007%2F978-3-540-85303-9_13 & acl=%2Fstatic%2Fpdf%2F268%2Fchp%25253A10.1007%25252F978-3-540-85303-9_13.pdf%3ForiginUrl%3Dhttp%253A%252F%252Flink.springer.com%252Fchapter%252F10.1007%252F978-3-540-85303-9_13*similar to hmac=b6bcdacdbafdb16067bb029d6343c741b0e1bde41f7f0fc60d5ece47de553083
引用
收藏
页码:135 / +
页数:3
相关论文
共 50 条
  • [1] Intelligent support for information retrieval of web documents
    Koval, R
    Návrat, P
    [J]. COMPUTING AND INFORMATICS, 2002, 21 (05) : 509 - 528
  • [2] Semantic web-oriented intelligent information retrieval system
    Li, Wenjie
    Zhang, Xiaohuan
    Wei, Xiaofei
    [J]. BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 357 - 361
  • [3] Location based Semantic Information Retrieval from Web Documents using Web Crawler
    Archana, A. B.
    Kumar, Jalesh
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2015, : 370 - 375
  • [4] A natural language interface for information retrieval on semantic web documents
    Quaresma, P
    Rodrigues, IP
    [J]. ADVANCES IN WEB INTELLIGENCE, 2003, 2663 : 142 - 154
  • [5] A Novel Imaging Approach of Web Documents Based on Semantic Inclusion of Textual and Non - Textual Information
    Zachariasova, Martina
    Kamencay, Patrik
    Hudec, Robert
    Benco, Miroslav
    Matuska, Slavomir
    [J]. 2014 AASRI CONFERENCE ON CIRCUIT AND SIGNAL PROCESSING (CSP 2014), 2014, 9 : 31 - 36
  • [6] Semantic information retrieval on the web
    Sezer, Ebru
    Yazici, Adnan
    Yarimagan, Unal
    [J]. ADVANCES IN INFORMATION SYSTEMS, PROCEEDINGS, 2006, 4243 : 158 - 167
  • [7] Ontology based Fuzzy Classification of Web Documents for Semantic Information Retrieval
    Joshi, Kajal
    Verma, Ashish
    Kandpal, Ankita
    Garg, Shalini
    Chauhan, Rashmi
    Goudar, R. H.
    [J]. 2013 SIXTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2013, : 1 - 5
  • [8] Universal information retrieval system in semantic Web environment
    Yoo, JM
    Myaeng, SH
    Jin, Y
    Lee, MH
    [J]. Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 348 - 353
  • [9] Multilingual and multimedia Information Retrieval from Web documents
    Gatius, M
    Bertran, M
    Rodriguez, H
    [J]. 15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 20 - 24
  • [10] Semantic system for intelligent searching of information by context for the Web
    Aguilar, Jose
    [J]. CIENCIA E INGENIERIA, 2011, 32 (03): : 141 - 150