Monolingual Information Retrieval using Terrier: FIRE 2010 Experiments based on n-gram indexing

被引:1
|
作者
Vishwakarma, Santosh K. [1 ]
Lakhtaria, Karna Ljit I. [2 ]
Bhatnagar, Divya [3 ]
Sharma, Akhilesh K. [3 ]
机构
[1] Gyan Ganga Inst Technol & Sci, Jabalpur, Madhya Pradesh, India
[2] Auro Univ, Surat, Gujarat, India
[3] SPSU, Udaipur 313001, Rajasthan, India
关键词
Information Retrieval; N-gram; MAP; Pruning; Hindi Monolingual; TEXT;
D O I
10.1016/j.procs.2015.07.484
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
N-gram based indexing technique has been proved as a useful technique for efficient document retrieval. We applied the n-gram approach and performed experiments in Hindi language text collections. The experiments are performed on the dataset of FIRE 2010 Hindi text collections. We used the Terrier open search engine for experimental purpose. Our experiments state that 4-gram gives the best results among all n-grams of different length. The results show an increase in value of mean average precision. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:815 / 820
页数:6
相关论文
共 50 条
  • [1] An efficient document retrieval method using n-gram indexing
    Ogawa, Yasushi
    Matsuda, Toru
    [J]. Systems and Computers in Japan, 2002, 33 (02) : 54 - 63
  • [2] Application of variable length N-gram vectors to monolingual and bilingual information retrieval
    Gayo-Avello, D
    Alvarez-Gutiérrez, D
    Gayo-Avello, J
    [J]. MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 73 - 82
  • [3] Improving arabic information retrieval system using n-gram method
    Legal Informatics center, Lebanese University, Sami Solh Street-Bp5396/116, Lebanon
    不详
    不详
    [J]. WSEAS Trans. Comput., 1600, 4 (125-133):
  • [4] N-gram adaptation with dynamic interpolation coefficient using information retrieval technique
    Choi, Joon-Ki
    Oh, Yung-Hwan
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (09): : 2579 - 2582
  • [5] Techniques for gigabyte-scale n-gram based information retrieval on personal computers
    Miller, E
    Shen, D
    Liu, JL
    Nicholas, C
    Chen, T
    [J]. INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, PROCEEDINGS, 1999, : 1410 - 1416
  • [6] Searching Polyphonic Indonesian Folksongs Based on N-gram Indexing Technique
    Marsye, Aurora
    Adriani, Mirna
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2009, 5839 : 387 - 396
  • [7] Answering questions with an n-gram based passage retrieval engine
    Davide Buscaldi
    Paolo Rosso
    José Manuel Gómez-Soriano
    Emilio Sanchis
    [J]. Journal of Intelligent Information Systems, 2010, 34 : 113 - 134
  • [8] Answering questions with an n-gram based passage retrieval engine
    Buscaldi, Davide
    Rosso, Paolo
    Manuel Gomez-Soriano, Jose
    Sanchis, Emilio
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2010, 34 (02) : 113 - 134
  • [9] Advanced Information Extraction with n-gram based LSI
    Guven, Ahmet
    Bozkurt, O. Ozgur
    Kalipsiz, Oya
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 17, 2006, 17 : 13 - 18
  • [10] n-Gram-based indexing for Korean text retrieval
    Lee, JH
    Cho, HY
    Park, HR
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1999, 35 (04) : 427 - 441