An intelligent system for semantic information retrieval information from textual web documents

被引:0
|
作者
Karthik, Mukundan [1 ]
Marikkannan, Mariappan [2 ]
Kannan, Arputharaj [1 ]
机构
[1] Anna Univ, Coll Engn, Dept Comp Sci, Madras 600025, Tamil Nadu, India
[2] IRTT, Dept Comp Sci & Engn, Erode 638316, India
来源
关键词
semantic relations; SEMINRET algorithm; text mining; Resources Description Framework (RDF); information extraction (IE); part-of-speech (POS) tag intelligent information retrieval;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text data, which are represented as free text in World Wide Web (WWW), are inherently unstructured and hence it becomes difficult to directly process the text data by computer programs. There has been great interest in text mining techniques recently for helping users to quickly gain knowledge from the Web. Text mining technologies usually involve tasks such as text refining which transforms free text into an intermediate representation form which is machine-processable and knowledge distillation which deduces patterns or knowledge from the intermediate form. These text representation methodologies consider documents as bags of words and ignore the meanings and ideas their authors want to convey. As terms are treated as individual items in such simplistic representations, terms lose their semantic relations and texts lose their original meanings. In this paper, we propose a system that overcomes the limitations of the existing technologies to retrieve the information from the knowledge discovered through data mining based on the detailed meanings of the text. For this, we propose a Knowledge representation technique, which uses Resources Description Framework (RDF) meta-data to represent the semantic relations, which are extracted from textual web document using natural language processing techniques. The main objective of the creation of RDF metadata in this system is to have flexibility for easy retrieval of the semantic information effectively. We also propose an effective SEMantic INformation RETrieval algorithm called SEMINRET algorithm. The experimental results obtained from this system show that the computations of Precision and Recall in RDF databases are highly accurate when compared to XML databases. Moreover, it is observed from our experiments that the document retrieval from the RDF database is more efficient than the document retrieval using XML databases. http://download.springer.com/static/pdf/268/chp%253A10.1007%252F978-3-540-85303-9_13.pdf?originUrl=http%3A%2F%2Flink.springer.com%2Fchapter%2F10.1007%2F978-3-540-85303-9_13 & acl=%2Fstatic%2Fpdf%2F268%2Fchp%25253A10.1007%25252F978-3-540-85303-9_13.pdf%3ForiginUrl%3Dhttp%253A%252F%252Flink.springer.com%252Fchapter%252F10.1007%252F978-3-540-85303-9_13*similar to hmac=b6bcdacdbafdb16067bb029d6343c741b0e1bde41f7f0fc60d5ece47de553083
引用
收藏
页码:135 / +
页数:3
相关论文
共 50 条
  • [41] Research on effective Web information retrieval based on Semantic Web
    Xiao Min
    Xiong Qianxing
    Wang Chunhua
    Pu Qiumei
    [J]. DCABES 2007 PROCEEDINGS, VOLS I AND II, 2007, : 903 - 905
  • [42] Intelligent mining of safety hazard information from construction documents using semantic similarity and information entropy
    Tian, Dan
    Li, Mingchao
    Shen, Yang
    Han, Shuai
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
  • [43] A WWW intelligent information retrieval system utilizing a semantic associative search method
    Kiyoki, Y
    Chen, X
    Kitagawa, T
    [J]. WEB TECHNOLOGIES AND APPLICATIONS, 1998, : 93 - 102
  • [44] Information retrieval from spoken documents
    Fapso, M
    Smrz, P
    Schwarz, P
    Szöke, I
    Schwarz, M
    Cernocky, J
    Karafiát, M
    Burget, L
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2006, 3878 : 410 - 416
  • [45] Information Retrieval from Documents: A Survey
    M. Mitra
    B.B. Chaudhuri
    [J]. Information Retrieval, 2000, 2 (2-3): : 141 - 163
  • [46] An intelligent information retrieval system model
    Chen, JJ
    Liu, LZ
    Song, HT
    Yu, XL
    [J]. PROCEEDINGS OF THE 4TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-4, 2002, : 2500 - 2503
  • [47] Intelligent Retrieval System on Legal Information
    Le, Hoang H.
    Cong-Thanh Nguyen
    Ngo, Thinh P.
    Vinh, Phu V.
    Nguyen, Binh T.
    Huynh, Anh T.
    Nguyen, Hien D.
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT I, 2023, 13995 : 97 - 108
  • [48] Hyper-Textual Language Model for Web Information Retrieval
    Xie, Ying
    Raghavan, Vijay V.
    Young, Andrew
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 68 - +
  • [49] Term frequency occurrences on web pages for textual information retrieval
    Sivapathasundaram, Karthika
    Cheng, Xiaochun
    Petridis, Miltos
    [J]. DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 585 - 590
  • [50] Semantic and Keyword Based Web Techniques in Information Retrieval
    Mala, Vajenti
    Lobiyal, D. K.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 23 - 26