Statistical vs. Rule-based stemming for monolingual french retrieval

被引:0
|
作者
Majumder, Prasenjit [1 ]
Mitra, Mandar [1 ]
Datta, Kalyankumar [2 ]
机构
[1] Indian Stat Inst, CVPR Unit, Kolkata, India
[2] Jadavpur Univ, Dept EE, Kolkata, India
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes our approach to the 2006 Adhoc Monolingual Information Retrieval run for French. The goal of our experiment was to compare the performance of a proposed statistical stemmer with that of a rule-based stemmer, specifically the French version of Porter's stemmer. The statistical stemming approach is based on lexicon clustering, using a novel string distance measure. We submitted three official runs, besides a baseline run that uses no stemming. The results show that stemming significantly improves retrieval performance (as expected) by about 9-10%, and the performance of the statistical stemmer is comparable with that of the rule-based stemmer.
引用
收藏
页码:107 / +
页数:2
相关论文
共 50 条
  • [21] Dependency Parsing of Estonian: Statistical and Rule-based Approaches
    Muischnek, Kadri
    Mueuerisep, Kaili
    Puolakainen, Tiina
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, BALTIC HLT 2014, 2014, 268 : 111 - +
  • [22] Combining statistical and reinforcement learning in rule-based classification
    Jorge Muruzábal
    Computational Statistics, 2001, 16 : 341 - 359
  • [23] Declarative vs Rule-based Control for Flocking Dynamics
    Mehmood, Usama
    Paoletti, Nicola
    Dung Phan
    Grosu, Radu
    Lin, Shan
    Stoller, Scott D.
    Tiwari, Ashish
    Yang, Junxing
    Smolka, Scott A.
    33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 816 - 823
  • [24] Rule-based Reordering Space in Statistical Machine Translation
    Pecheux, Nicolas
    Allauzen, Alexandre
    Yvon, Francois
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1800 - 1806
  • [25] Fuzzy rule-based classifier for content-based image retrieval
    Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland
    Jaworska, T. (Tatiana.Jaworska@ibspan.waw.pl), 1600, Springer Verlag (183 AISC):
  • [26] Rule-based vs. optimisation-based order release in workload control: A simulation study of a MTO manufacturer
    Puergstaller, Peter
    Missbauer, Hubert
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2012, 140 (02) : 670 - 680
  • [27] Legal Values and International Perspectives on Corporate Governance: Principle-Based Implementations vs. Rule-Based Systems
    Feleaga, Niculae
    Dragomir, Voicu
    Feleaga, Liliana
    PROCEEDINGS OF THE 6TH EUROPEAN CONFERENCE ON MANAGEMENT LEADERSHIP AND GOVERNANCE, 2010, : 145 - 152
  • [28] A Semantic Approach of Rule-based Document Retrieval for Ship Survey
    Lu, Wen
    Fan, Shidong
    Cao, Jiyin
    2015 4TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED LOGISTICS AND TRANSPORT (ICALT), 2015, : 161 - 166
  • [29] A rule-based obfuscating focused crawler in the audio retrieval domain
    Marco Montanaro
    Antonio Maria Rinaldi
    Cristiano Russo
    Cristian Tommasino
    Multimedia Tools and Applications, 2024, 83 : 25231 - 25260
  • [30] A rule-based obfuscating focused crawler in the audio retrieval domain
    Montanaro, Marco
    Rinaldi, Antonio Maria
    Russo, Cristiano
    Tommasino, Cristian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 25231 - 25260