Statistical vs. Rule-based stemming for monolingual french retrieval

被引:0
|
作者
Majumder, Prasenjit [1 ]
Mitra, Mandar [1 ]
Datta, Kalyankumar [2 ]
机构
[1] Indian Stat Inst, CVPR Unit, Kolkata, India
[2] Jadavpur Univ, Dept EE, Kolkata, India
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes our approach to the 2006 Adhoc Monolingual Information Retrieval run for French. The goal of our experiment was to compare the performance of a proposed statistical stemmer with that of a rule-based stemmer, specifically the French version of Porter's stemmer. The statistical stemming approach is based on lexicon clustering, using a novel string distance measure. We submitted three official runs, besides a baseline run that uses no stemming. The results show that stemming significantly improves retrieval performance (as expected) by about 9-10%, and the performance of the statistical stemmer is comparable with that of the rule-based stemmer.
引用
收藏
页码:107 / +
页数:2
相关论文
共 50 条
  • [41] Rule-based vs. Training-based Extraction of Index Terms from Business Documents - How to Combine the Results
    Schuster, Daniel
    Hanke, Marcel
    Muthmann, Klemens
    Esser, Daniel
    DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
  • [42] Rule-based safety vs adaptive safety: An articulation issue
    Falzon, Pierre
    HEALTHCARE SYSTEMS ERGONOMICS AND PATIENT SAFETY 2011: AN ALLIANCE BETWEEN PROFESSIONALS AND CITIZENS FOR PATIENT SAFETY AND QUALITY OF LIFE, 2011, : 16 - 21
  • [43] Incorporating statistical information of lexical dependency into a rule-based parser
    Roh, Yoon-Hyung
    Lee, Ki-Young
    Kim, Young-Gil
    PACLIC 23 - Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, 2009, 2 : 493 - 500
  • [44] Boosting Statistical Tagger Accuracy with Simple Rule-Based Grammars
    Hulden, Mans
    Francom, Jerid
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2114 - 2117
  • [45] Multimedia data mining for building rule-based image retrieval systems
    Wang, DH
    Ma, XH
    2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 197 - 200
  • [46] EXPERIMENTAL INVESTIGATIONS OF UNCERTAINTY IN A RULE-BASED SYSTEM FOR INFORMATION-RETRIEVAL
    TONG, RM
    SHAPIRO, DG
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1985, 22 (03): : 265 - 282
  • [47] Risk-based vs Rule-based Electromagnetic Compatibility in Large Installations
    Leferink, Frank
    PROCEEDINGS OF THE 2018 IEEE 4TH GLOBAL ELECTROMAGNETIC COMPATIBILITY CONFERENCE (GEMCCON), 2018,
  • [48] Classification-Based Approach for Hybridizing Statistical and Rule-Based Machine Translation
    Park, Eun-Jin
    Kwon, Oh-Woog
    Kim, Kangil
    Kim, Young-Kil
    ETRI JOURNAL, 2015, 37 (03) : 541 - 550
  • [49] A rule-based approach to multiple statistical test analysis of binary data
    Molnau, WE
    Keats, JB
    IIE TRANSACTIONS, 1996, 28 (03) : 203 - 213
  • [50] Handwritten digit recognition using statistical and rule-based decision fusion
    Gorgevik, D
    Cakmakov, D
    Radevski, V
    11TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, PROCEEDINGS, 2002, : 131 - 135