Language-Agnostic Representation Learning for Product Search on E-Commerce Platforms

被引:17
|
作者
Ahuja, Aman [1 ,2 ]
Rao, Nikhil [2 ]
Katariya, Sumeet [2 ]
Subbian, Karthik [2 ]
Reddy, Chandan K. [1 ]
机构
[1] Virginia Tech, Arlington, VA 22203 USA
[2] Amazon, Palo Alto, CA USA
关键词
Product search; deep learning; E-commerce; multi-task learning; cross-lingual models;
D O I
10.1145/3336191.3371852
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Product search forms an indispensable component of any e-commerce service, and helps customers find products of their interest from a large catalog on these websites. When products that are irrelevant to the search query are surfaced, it leads to a poor customer experience, thus reducing user trust and increasing the likelihood of churn. While identifying and removing such results from product search is crucial, doing so is a burdensome task that requires large amounts of human annotated data to train accurate models. This problem is exacerbated when products are cross-listed across countries that speak multiple languages, and customers specify queries in multiple languages and from different cultural contexts. In this work, we propose a novel multi-lingual multi-task learning framework, to jointly train product search models on multiple languages, with limited amount of training data from each language. By aligning the query and product representations from different languages into a language-independent vector space of queries and products, respectively, the proposed model improves the performance over baseline search models in any given language. We evaluate the performance of our model on real data collected from a leading e-commerce service. Our experimental evaluation demonstrates up to 23% relative improvement in the classification F1-score compared to the state-of-the-art baseline models.
引用
收藏
页码:7 / 15
页数:9
相关论文
共 50 条
  • [1] LaTeX-Numeric: Language-agnostic Text attribute eXtraction for E-commerce Numeric Attributes
    Mehta, Kartik
    Oprea, Ioana
    Rasiwasia, Nikhil
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2021, 2021, : 272 - 279
  • [2] Gated Heterogeneous Graph Representation Learning for Shop Search in E-Commerce
    Niu, Xichuan
    Li, Bofang
    Li, Chenliang
    Xiao, Rong
    Sun, Haochuan
    Wang, Honggang
    Deng, Hongbo
    Chen, Zhenzhong
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2165 - 2168
  • [3] Taxation on duopoly e-commerce platforms and their search environments
    Sangita Poddar
    Tanmoyee Banerjee(Chatterjee)
    Swapnendu Banerjee
    [J]. SN Business & Economics, 3 (8):
  • [4] Consumer Search and Product Returns in E-Commerce
    Janssen, Maarten
    Williams, Cole
    [J]. AMERICAN ECONOMIC JOURNAL-MICROECONOMICS, 2024, 16 (02) : 387 - 419
  • [5] Building Language-Agnostic Grounded Language Learning Systems
    Kery, Caroline
    Pillai, Nisha
    Matuszek, Cynthia
    Ferraro, Francis
    [J]. 2019 28TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2019,
  • [6] Deep Learning Based Sentiment Aware Ranking for E-commerce Product Search
    Jbene, Mourad
    Tigani, Smail
    [J]. ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2020), VOL 2, 2022, 1418 : 87 - 97
  • [7] Learning and Transferring IDs Representation in E-commerce
    Zhao, Kui
    Li, Yuechuan
    Shuai, Zhaoqian
    Yang, Cheng
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1031 - 1039
  • [8] Duplicate product record detection engine for e-commerce platforms
    Albayrak, Osman Semih
    Aytekin, Tevfik
    Kalayci, Tolga Ahmet
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193
  • [9] AliISA: Creating an Interactive Search Experience in E-commerce Platforms
    Xiao, Fei
    Wang, Zhen
    Huang, Haikuan
    Huang, Jun
    Chen, Xi
    Deng, Hongbo
    Qiu, Minghui
    Gong, Xiaoli
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1305 - 1308
  • [10] Language-Agnostic Knowledge Representation for a Truly Multilingual Semantic Web
    Jain, Sarika
    Kysliak, Anastasiia
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2022, 13 (01)