Intelligent support for information retrieval of web documents

被引:0
|
作者
Koval, R [1 ]
Návrat, P [1 ]
机构
[1] Slovak Univ Technol Bratislava, Dept Comp Sci & Engn, Bratislava 81219, Slovakia
关键词
intelligent information retrieval; suffix tree clustering algorithm; click-stream analysis; web tool; search agent;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main goal of this research was to investigate the means of intelligent support for retrieval of web documents. We have proposed the architecture of the web tool system - Trillian, which discovers the interests of users without their interaction and uses them for autonomous searching of related web content. Discovered pages are suggested to the user. The discovery of user interests is based on analysis of documents visited by the users previously. We have created a module for completely transparent tracking of the user's movement on the web, which logs both visited URLs and contents of web pages. The post analysis step is based on a variant of the suffix tree clustering algorithm. We primarily focus on overall Trillian architecture design and the process of discovering topics of interests. We have implemented an experimental prototype of Trillian and evaluated the quality, speed and usefulness of the proposed system. We have shown that clustering is a feasible technique for extraction of interests from web documents. We consider the proposed architecture to be quite promising and suitable for future extensions.
引用
收藏
页码:509 / 528
页数:20
相关论文
共 50 条
  • [1] An intelligent system for semantic information retrieval information from textual web documents
    Karthik, Mukundan
    Marikkannan, Mariappan
    Kannan, Arputharaj
    [J]. COMPUTATIONAL FORENSICS, PROCEEDINGS, 2008, 5158 : 135 - +
  • [2] SemCrawl: Framework for Crawling Ontology Annotated Web Documents for Intelligent Information Retrieval
    Dhingra, Vandana
    Bhatia, Komal Kumar
    [J]. INTELLIGENT DISTRIBUTED COMPUTING, 2015, 321 : 213 - 223
  • [3] Web service for intelligent retrieval of normative information
    Bravo-Aranda, G.
    Herndadez-Rodriguez, F.
    Fernandez-de-la-Puente, A.
    [J]. EWORK AND EBUSINESS IN ARCHITECTURE, ENGINEERIN G AND CONSTRUCTION, 2006, : 543 - +
  • [4] Multilingual and multimedia Information Retrieval from Web documents
    Gatius, M
    Bertran, M
    Rodriguez, H
    [J]. 15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 20 - 24
  • [5] Artificial Intelligent Information Retrieval Using Assigning Context of Documents
    Liu Yong-Min
    Cheng Shu
    [J]. NSWCTC 2009: INTERNATIONAL CONFERENCE ON NETWORKS SECURITY, WIRELESS COMMUNICATIONS AND TRUSTED COMPUTING, VOL 2, PROCEEDINGS, 2009, : 592 - +
  • [6] Intelligent Interface for Web Information Retrieval with Document Understanding
    Khokale, Rahul S.
    Atique, Mohammad
    [J]. HUMAN-COMPUTER INTERACTION: APPLICATIONS AND SERVICES, PT III, 2014, 8512 : 21 - 31
  • [7] Distributed visual reasoning for intelligent information retrieval on the Web
    Lee, C
    Chen, YT
    [J]. INTERACTING WITH COMPUTERS, 2000, 12 (05) : 445 - 467
  • [8] A natural language interface for information retrieval on semantic web documents
    Quaresma, P
    Rodrigues, IP
    [J]. ADVANCES IN WEB INTELLIGENCE, 2003, 2663 : 142 - 154
  • [9] An automatic classification technique and tool for information retrieval of web documents
    Di Martino, B
    Mazzocca, N
    Squeglia, A
    Mazzeo, A
    [J]. CONCURRENT ENGINEERING: ENHANCED INTEROPERABLE SYSTEMS, 2003, : 1043 - 1050
  • [10] Information granulation for Web based information retrieval support systems
    Yao, JT
    Yao, YY
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: TOOLS AND TECHNOLOGY V, 2003, 5098 : 138 - 146