Using Topic Identification in Chinese Information Retrieval

被引:0
|
作者
Yeh, Ching-Long [1 ]
Chen, Yi-Chun [1 ]
机构
[1] Tatung Univ, Dept Comp Sci & Engn, Taipei, Taiwan
来源
JOURNAL OF INTERNET TECHNOLOGY | 2009年 / 10卷 / 02期
关键词
Natural Language Processing; Shallow Parsing; Topic Identification; Information Retrieval;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Information retrieval is to identify documents, from text collections, which are relevant with respect to some query. In current information retrieval systems, users can query with an unordered set of keywords, a question or a sentence. A list of document links matching the query can be retrieved and ordered by relevancy between the query and the documents. In this article, we are concerned with a hypothesis that the discourse-level element, topic, could be used to contribute the calculations of information retrieval. Due to the phenomenon of zero anaphora frequently occurring in Chinese texts, the topics may be omitted and are not expressed on the surface text. The key elements of the centering model of local discourse coherence are employed to extract structures of discourse segments. We propose a topic identification method using the local discourse structure to recover the omissions of topics and identify the topics of documents in the text collection. Then the topic information is inserted into the text for creating better indices. The experiment results are demonstrated on a test collection which is taken from Chinese Information Retrieval Benchmark, version 3.0.
引用
收藏
页码:95 / 102
页数:8
相关论文
共 50 条
  • [31] AUTOMATIC TOPIC DETECTION STRATEGY FOR INFORMATION RETRIEVAL IN SPOKEN DOCUMENT
    Jin, Shan
    Misra, Hemant
    Sikora, Thomas
    Jose, Joemon
    2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES, 2009, : 300 - +
  • [32] Continual Learning of Long Topic Sequences in Neural Information Retrieval
    Gerald, Thomas
    Soulier, Laure
    ADVANCES IN INFORMATION RETRIEVAL, PT I, 2022, 13185 : 244 - 259
  • [33] Topic Models Ensembles for AD-HOC Information Retrieval
    Ormeno, Pablo
    Mendoza, Marcelo
    Valle, Carlos
    INFORMATION, 2021, 12 (09)
  • [35] Topic based language models for ad hoc information retrieval
    Azzopardi, L
    Girolami, M
    van Rijsbergen, CJ
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 3281 - 3286
  • [36] A topic-specific dictionary construction algorithm for information retrieval
    Xu, Jingfang
    Li, Xing
    Li, Yue
    Jisuanji Gongcheng/Computer Engineering, 2005, 31 (21): : 143 - 145
  • [37] Human motion retrieval using topic model
    Zhu, Mingyang
    Sun, Huaijiang
    Lan, Rongyi
    Li, Bin
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2012, 23 (05) : 469 - 476
  • [38] Assisting web document retrieval with topic identification in tourism domain
    Prasath, Rajendra
    Kumar, Vijai
    Sarkar, Sudeshna
    WEB INTELLIGENCE, 2015, 13 (01) : 31 - 41
  • [39] Using heterogeneous linguistic knowledge in local coherence identification for information retrieval
    Chan, SWK
    JOURNAL OF INFORMATION SCIENCE, 2000, 26 (05) : 313 - 328
  • [40] Disease Detection and Identification Using Sequence Data and Information Retrieval Methods
    Joshi, Sankranti
    Radhika, Pai M.
    Manohara, Pai M. M.
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS (ICACNI 2015), VOL 1, 2016, 43 : 565 - 572