Statistical vs. Rule-based stemming for monolingual french retrieval

被引:0
|
作者
Majumder, Prasenjit [1 ]
Mitra, Mandar [1 ]
Datta, Kalyankumar [2 ]
机构
[1] Indian Stat Inst, CVPR Unit, Kolkata, India
[2] Jadavpur Univ, Dept EE, Kolkata, India
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes our approach to the 2006 Adhoc Monolingual Information Retrieval run for French. The goal of our experiment was to compare the performance of a proposed statistical stemmer with that of a rule-based stemmer, specifically the French version of Porter's stemmer. The statistical stemming approach is based on lexicon clustering, using a novel string distance measure. We submitted three official runs, besides a baseline run that uses no stemming. The results show that stemming significantly improves retrieval performance (as expected) by about 9-10%, and the performance of the statistical stemmer is comparable with that of the rule-based stemmer.
引用
收藏
页码:107 / +
页数:2
相关论文
共 50 条
  • [1] Statistical vs. Rule-Based Machine Translation: A Comparative Study on Indian Languages
    Sreelekha, S.
    Bhattacharyya, Pushpak
    Malathi, D.
    INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND APPLICATIONS, ICICA 2016, 2018, 632 : 663 - 675
  • [2] A Review on Rule-based and Hybrid Stemming Techniques
    Swain, Kadambini
    Nayak, Ajit Kumar
    2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND BUSINESS ANALYTICS (ICDSBA 2018), 2018, : 25 - 29
  • [3] Evaluating Medical Lexical Simplification: Rule-Based vs. BERT
    Tran, Linh
    Velazquez, Erick
    Sips, Robert-Jan
    De Boer, Victor
    PUBLIC HEALTH AND INFORMATICS, PROCEEDINGS OF MIE 2021, 2021, 281 : 1023 - 1024
  • [4] Stemming algorithm for Kazakh Language using rule-based approach
    Sultanova, Nazerke
    Kozhakhmet, Kanat
    Jantayev, Ruslan
    Botbayeva, Azhar
    2019 15TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTER AND COMPUTATION (ICECCO), 2019,
  • [5] A rule-based approach to image retrieval
    Mehta, D
    Diwakar, ESVNLS
    Jawahar, CV
    IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 586 - 590
  • [6] Rule-based models for case retrieval
    Sun, ZH
    Finnie, G
    KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 1511 - 1515
  • [7] Accuracy vs. Interpretability of Fuzzy Rule-Based Classifiers: An Evolutionary Approach
    Gorzalczany, Marian B.
    Rudzinski, Filip
    SWARM AND EVOLUTIONARY COMPUTATION, 2012, 7269 : 222 - 230
  • [8] A COMBINED STATISTICAL AND RULE-BASED CLASSIFIER
    TIEN, D
    NICKOLLS, P
    IMAGES OF THE TWENTY-FIRST CENTURY, PTS 1-6, 1989, 11 : 1829 - 1829
  • [9] OPTIMIZATION-BASED VS RULE-BASED
    ZORASTER, S
    SAWEY, R
    COMMUNICATIONS OF THE ACM, 1992, 35 (06) : 18 - 19
  • [10] Note: Rule-based forecasting vs. damped-trend exponential smoothing
    Gardner, ES
    MANAGEMENT SCIENCE, 1999, 45 (08) : 1169 - 1176