Using Topic Identification in Chinese Information Retrieval

被引:0
|
作者
Yeh, Ching-Long [1 ]
Chen, Yi-Chun [1 ]
机构
[1] Tatung Univ, Dept Comp Sci & Engn, Taipei, Taiwan
来源
JOURNAL OF INTERNET TECHNOLOGY | 2009年 / 10卷 / 02期
关键词
Natural Language Processing; Shallow Parsing; Topic Identification; Information Retrieval;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information retrieval is to identify documents, from text collections, which are relevant with respect to some query. In current information retrieval systems, users can query with an unordered set of keywords, a question or a sentence. A list of document links matching the query can be retrieved and ordered by relevancy between the query and the documents. In this article, we are concerned with a hypothesis that the discourse-level element, topic, could be used to contribute the calculations of information retrieval. Due to the phenomenon of zero anaphora frequently occurring in Chinese texts, the topics may be omitted and are not expressed on the surface text. The key elements of the centering model of local discourse coherence are employed to extract structures of discourse segments. We propose a topic identification method using the local discourse structure to recover the omissions of topics and identify the topics of documents in the text collection. Then the topic information is inserted into the text for creating better indices. The experiment results are demonstrated on a test collection which is taken from Chinese Information Retrieval Benchmark, version 3.0.
引用
收藏
页码:95 / 102
页数:8
相关论文
共 50 条
  • [21] Exploring Influence of Topic Segmentation on Information Retrieval Quality
    Shtekh, Gennady
    Kazakova, Polina
    Nikitinsky, Nikita
    Skachkov, Nikolay
    INTERNET SCIENCE (INSCI 2018), 2018, 11193 : 131 - 140
  • [22] Implicit feature identification in Chinese reviews using explicit topic mining model
    Xu, Hua
    Zhang, Fan
    Wang, Wei
    KNOWLEDGE-BASED SYSTEMS, 2015, 76 : 166 - 175
  • [23] Adaptive information retrieval: Introduction to the special topic issue of information processing and management
    Jose, Joemon M.
    Joho, Hideo
    van Rijsbergen, C. J.
    INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (06) : 1819 - 1821
  • [24] A Clustering Retrieval System of Chinese information
    Sha, Xin-Guang
    Liu, Yuan-Chao
    Liu, Ming
    Wang, Xiao-Long
    IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 255 - 260
  • [25] A Clustering Retrieval System of Chinese Information
    Sha, Sin-Guang
    Liu, Yuan-Chao
    Liu, Ming
    Wang, Xiao-Long
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2009, 12 (02): : 411 - 418
  • [26] Comparing representations in Chinese information retrieval
    Kwok, KL
    PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1997, : 34 - 41
  • [27] A Hybrid Chinese Information Retrieval Model
    Li, Zhihan
    Xu, Yue
    Geva, Shlomo
    ACTIVE MEDIA TECHNOLOGY, 2010, 6335 : 267 - 276
  • [28] Discovery of topic distribution through WWW information retrieval process
    Takama, Y
    Hirota, K
    IECON 2000: 26TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-4: 21ST CENTURY TECHNOLOGIES AND INDUSTRIAL OPPORTUNITIES, 2000, : 1644 - 1647
  • [29] Information Retrieval based on Topic Map and Grid Agent Technologies
    Chen Jing
    Yu Ying
    2009 IEEE INTERNATIONAL SYMPOSIUM ON IT IN MEDICINE & EDUCATION, VOLS 1 AND 2, PROCEEDINGS, 2009, : 341 - +
  • [30] Applying Topic Segmentation to Document-Level Information Retrieval
    Shtekh, Gennady
    Kazakova, Polina
    Nikitinsky, Nikita
    Skachkov, Nikolay
    CEE-SECR'18: PROCEEDINGS OF THE 14TH CENTRAL AND EASTERN EUROPEAN SOFTWARE ENGINEERING CONFERENCE RUSSIA, 2018,