Rhetorical structure theory for content-based indexing and retrieval of Web documents

被引:1
|
作者
Marir, F [1 ]
Haouam, K [1 ]
机构
[1] Univ N London, Sch Informat & Multimedia Technol, London N7 8DB, England
关键词
document indexing and retrieval; rhetorical structure theory;
D O I
10.1109/ITRE.2004.1393667
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The amount of information available on the Internet is currently growing at an incredible rate. However, the lack of efficient indexing is still a major barrier to effective information retrieval on the web. This paper presents the design of a technique for content-based indexing and retrieval of relevant documents from a large collection of documents such as the Internet. The technique aims at improving the quality of retrieval by capturing the semantics of the documents. It introduces a thematic relationship between parts of text using a linguistics theory called Rhetorical Structure Theory (RST) based on cue phrases to determine the set of rhetorical relations. Once these structures are determined, they can be saved into a database. We can then query that collection using not only keywords, as traditional Information retrieval systems, but also rhetorical relations. The indexing and retrieval technique described in this paper is under development and initial results on a small number of documents have been very successful.
引用
下载
收藏
页码:160 / 164
页数:5
相关论文
共 50 条
  • [31] System profiles in content-based image indexing and retrieval
    Esin Guldogan
    Moncef Gabbouj
    Signal, Image and Video Processing, 2010, 4 : 463 - 480
  • [32] AN INDEXING SCHEME FOR CONTENT-BASED RETRIEVAL OF IMAGES BY SHAPE
    Mocanu, Irina
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2006, 68 (01): : 25 - 34
  • [33] Indexing of baseball telecast for content-based video retrieval
    Kawashima, T
    Tateyama, K
    Iijima, T
    Aoki, Y
    1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 871 - 874
  • [34] Fast indexing and searching for content-based image retrieval
    You, J
    Shen, H
    VISUAL INFORMATION PROCESSING VII, 1998, 3387 : 212 - 218
  • [35] Ehipasiko: A content-based image indexing and retrieval system
    Teng, Shyh Wei
    Ting, Kai Ming
    Advances in Intelligent IT: Active Media Technology 2006, 2006, 138 : 436 - 437
  • [36] Automatic indexing of news video for the content-based retrieval
    Yang, MS
    Yoo, CJ
    Chang, OB
    INPUT/OUTPUT AND IMAGING TECHNOLOGIES, 1998, 3422 : 176 - 186
  • [37] An efficient indexing method for content-based image retrieval
    Feng, Deying
    Yang, Jie
    Liu, Congxin
    NEUROCOMPUTING, 2013, 106 : 103 - 114
  • [38] Content-based image indexing and retrieval in compressed domain
    Jiang, J
    ADVANCES IN MODELLING, ANIMATION AND RENDERING, 2002, : 39 - 64
  • [39] Content-based image indexing and retrieval with XML representations
    Azzam, IA
    Charlapally, AG
    Leung, CHC
    Harwood, JF
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 181 - 185
  • [40] System profiles in content-based image indexing and retrieval
    Guldogan, Esin
    Gabbouj, Moncef
    SIGNAL IMAGE AND VIDEO PROCESSING, 2010, 4 (04) : 463 - 480