Fractional Similarity: Cross-Lingual Feature Selection for Search

被引:0
|
作者
Jagarlamudi, Jagadeesh [1 ]
Bennett, Paul N. [2 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Microsoft Res, Redmond, WA 98052 USA
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Training data as well as supplementary data such as usage-based click behavior may abound in one search market (i.e., a particular region, domain, or language) and be much scarcer in another market. Transfer methods attempt to improve performance in these resource-scarce markets by leveraging data across markets. However, differences in feature distributions across markets can change the optimal model. We introduce a method called Fractional Similarity, which uses query-based variance within a market to obtain more reliable estimates of feature deviations across markets. An empirical analysis demonstrates that using this scoring method as a feature selection criterion in cross-lingual transfer improves relevance ranking in the foreign language and compares favorably to a baseline based on KL divergence.
引用
收藏
页码:226 / +
页数:3
相关论文
共 50 条
  • [31] Domain Specific Cross-Lingual Knowledge Linking Based on Similarity Flooding
    Pan, Liangming
    Wang, Zhigang
    Li, Juanzi
    Tang, Jie
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2016, 2016, 9983 : 426 - 438
  • [32] A resource-light method for cross-lingual semantic textual similarity
    Glavas, Goran
    Franco-Salvador, Marc
    Ponzetto, Simone P.
    Rosso, Paolo
    KNOWLEDGE-BASED SYSTEMS, 2018, 143 : 1 - 9
  • [33] CROSS-LINGUAL TRANSFER FOR SPEECH PROCESSING USING ACOUSTIC LANGUAGE SIMILARITY
    Wu, Peter
    Shi, Jiatong
    Zhong, Yifan
    Watanabe, Shinji
    Black, Alan W.
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 1050 - 1057
  • [34] Cross-lingual document similarity estimation and dictionary generation with comparable corpora
    Stajner, Tadej
    Mladenic, Dunja
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 58 (03) : 729 - 743
  • [35] Cross-lingual document similarity estimation and dictionary generation with comparable corpora
    Tadej Štajner
    Dunja Mladenić
    Knowledge and Information Systems, 2019, 58 : 729 - 743
  • [36] News Across Languages - Cross-Lingual Document Similarity and Event Tracking
    Rupnik, Jan
    Muhic, Andrej
    Leban, Gregor
    Skraba, Primoz
    Fortuna, Blaz
    Grobelnik, Marko
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2016, 55 : 283 - 316
  • [37] Reinforced Transformer with Cross-Lingual Distillation for Cross-Lingual Aspect Sentiment Classification
    Wu, Hanqian
    Wang, Zhike
    Qing, Feng
    Li, Shoushan
    ELECTRONICS, 2021, 10 (03) : 1 - 14
  • [38] CROSS-LINGUAL FRAME SELECTION METHOD FOR POLYGLOT SPEECH SYNTHESIS
    Chen, Chia-Ping
    Huang, Yi-Chin
    Wu, Chung-Hsien
    Lee, Kuan-De
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4521 - 4524
  • [39] Cross-Lingual Blog Analysis by Cross-Lingual Comparison of Characteristic Terms and Blog Posts
    Nakasaki, Hiroyuki
    Kawaba, Mariko
    Utsuro, Takehito
    Fukuhara, Tomohiro
    Nakagawa, Hiroshi
    Kando, Noriko
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON UNIVERSAL COMMUNICATION, 2008, : 105 - +
  • [40] DiTTO : A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer
    Kumar, Shanu
    Soujanya, Abbaraju
    Dandapat, Sandipan
    Sitaram, Sunayana
    Choudhury, Monojit
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 385 - 406