Using Topic Identification in Chinese Information Retrieval

被引:0
|
作者
Yeh, Ching-Long [1 ]
Chen, Yi-Chun [1 ]
机构
[1] Tatung Univ, Dept Comp Sci & Engn, Taipei, Taiwan
来源
JOURNAL OF INTERNET TECHNOLOGY | 2009年 / 10卷 / 02期
关键词
Natural Language Processing; Shallow Parsing; Topic Identification; Information Retrieval;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information retrieval is to identify documents, from text collections, which are relevant with respect to some query. In current information retrieval systems, users can query with an unordered set of keywords, a question or a sentence. A list of document links matching the query can be retrieved and ordered by relevancy between the query and the documents. In this article, we are concerned with a hypothesis that the discourse-level element, topic, could be used to contribute the calculations of information retrieval. Due to the phenomenon of zero anaphora frequently occurring in Chinese texts, the topics may be omitted and are not expressed on the surface text. The key elements of the centering model of local discourse coherence are employed to extract structures of discourse segments. We propose a topic identification method using the local discourse structure to recover the omissions of topics and identify the topics of documents in the text collection. Then the topic information is inserted into the text for creating better indices. The experiment results are demonstrated on a test collection which is taken from Chinese Information Retrieval Benchmark, version 3.0.
引用
收藏
页码:95 / 102
页数:8
相关论文
共 50 条
  • [1] Improving information retrieval using XML and topic maps
    Schweiger, Ralf
    Dudeck, Joachim
    CHARTING THE TOPIC MAPS RESEARCH AND APPLICATIONS LANDSCAPE, 2006, 3873 : 253 - 262
  • [2] Topic Structure for Information Retrieval
    He, Jiyin
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 851 - 851
  • [3] Chinese information retrieval: using characters or words?
    Nie, Jian-Yun
    Ren, Fuji
    Information Processing and Management, 1999, 35 (04): : 443 - 462
  • [4] Chinese information retrieval: using characters or words?
    Nie, JY
    Ren, F
    INFORMATION PROCESSING & MANAGEMENT, 1999, 35 (04) : 443 - 462
  • [5] Sentence retrieval with LSI and topic identification
    Parapar, David
    Barreiro, Alvaro
    ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 119 - 130
  • [6] An information space using topic identification for retrieved documents
    Escorial, D
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2001, 2004 : 507 - 508
  • [7] Modeling Latent Topic Interactions using Quantum Interference for Information Retrieval
    Sordoni, Alessandro
    He, Jing
    Nie, Jian-Yun
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1197 - 1200
  • [8] Multilingual information retrieval using English and Chinese queries
    Chen, AT
    EVLAUATION OF CROSS-LANGUAGE INFORMATION RETRIEVAL SYSTEMS, 2002, 2406 : 44 - 58
  • [9] Chinese Question Retrieval System Using Dependency Information
    Qiu, Jing
    Liao, Le-Jian
    Hao, Jun-Kang
    ACTIVE MEDIA TECHNOLOGY, 2010, 6335 : 288 - +
  • [10] Prospecting the Effect of Topic Modeling in Information Retrieval
    Sharaff, Aakanksha
    Dewangan, Jitesh Kumar
    Sisodia, Dilip Singh
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2021, 17 (03) : 18 - 34