Ranking vs. Classification: a Case Study in Mining Organization Name Translation from Snippets

被引:1
|
作者
Yang, Muyun [1 ]
Shi, Zhenyong [1 ]
Li, Sheng [1 ]
Zhao, Tiejun [1 ]
Qi, HaoLiang [2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Harbin Inst Technol, Dept Comp, Harbin 150001, Peoples R China
关键词
snippet mining; organization name translation; ranking; classification; SVM;
D O I
10.1109/IALP.2009.73
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Both classification and ranking strategy have been reported positively in mining the named entity (NE) translation from the snippets re-turned by the web search engine. Taking the most challenging issue of the organization name and its translation as an example, this paper conducts a contrastive study on the two strategies under SVM framework. We empirically show that the method of translation ranking achieves the best performance in various data settings, with the best Top-1 precision up to 65.75%. We conclude that, compared with the classification strategy, the ranking strategy is more suitable in such snippet based translation mining, in which the unbalance data issue prevails.
引用
收藏
页码:308 / 313
页数:6
相关论文
共 50 条
  • [1] Study on Unknown Term Translation Mining from Google Snippets
    Li, Bin
    Yao, Jianmin
    INFORMATION, 2019, 10 (09)
  • [2] Feature vs. classifier fusion for predictive data mining -: a case study in pesticide classification
    Bostrom, Henrik
    2007 PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOLS 1-4, 2007, : 440 - 446
  • [3] Cumulative Citation Recommendation: Classification vs. Ranking
    Balog, Krisztian
    Ramampiaro, Heri
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 941 - 944
  • [4] OOV Translation Mining from Mixed-Language Snippets from a Search Engine
    Qu Yun-Qian
    Yao Jian-Min
    Sun Jun
    Sun Meng
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 931 - 935
  • [5] Scoring vs. Ranking: An Experimental Study of Idea Evaluation Processes
    Cui, Zhijian
    Shijith Kumar, P. M.
    Goncalves, Dilney
    PRODUCTION AND OPERATIONS MANAGEMENT, 2019, 28 (01) : 176 - 188
  • [6] Ranking vs. Preference: A Comparative Study of Self-reporting
    Yannakakis, Georgios N.
    Hallam, John
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PT I, 2011, 6974 : 437 - +
  • [7] Interactive Translation vs. Pre-Translation in TMs: A Pilot Study
    Wallis, Julian M. S.
    META, 2008, 53 (03) : 623 - 629
  • [8] BPE vs. Morphological Segmentation: A Case Study on Machine Translation of Four Polysynthetic Languages
    Mager, Manuel
    Oncevay, Arturo
    Mager, Elisabeth
    Kann, Katharina
    Vu, Ngoc Thang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 961 - 971
  • [9] Functionalism in Translation: A Case Study Investigation into Translation literature based on Nord's Documentary vs. Instrumental Dichotomy
    Arbabi, Maryam
    Farahani, Mehrdad Vasheghani
    KHAZAR JOURNAL OF HUMANITIES AND SOCIAL SCIENCES, 2019, 22 (02): : 51 - 70
  • [10] Twitter account classification using account metadata: organization vs. individual
    Cetinkaya, Yusuf Mucahit
    Gurlek, Mesut
    Toroslu, Ismail Hakki
    Karagoz, Pinar
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2022, 30 (04) : 1404 - 1418