COSMO: A Large-Scale E-commerce Common Sense Knowledge Generation and Serving System at Amazon

被引:0
|
作者
Yu, Changlong [1 ]
Liu, Xin [1 ]
Maia, Jefferson [1 ]
Li, Yang [1 ]
Cao, Tianyu [1 ]
Gao, Yifan [1 ]
Song, Yangqiu [2 ]
Goutam, Rahul [1 ]
Zhang, Haiyang [1 ]
Yin, Bing [1 ]
Li, Zheng [1 ]
机构
[1] Amazon, Palo Alto, CA 94303 USA
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
关键词
D O I
10.1145/3626246.3653398
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Applications of large-scale knowledge graphs in the e-commerce platforms can improve shopping experience for their customers. While existing e-commerce knowledge graphs (KGs) integrate a large volume of concepts or product attributes, they fail to discover user intentions, leaving the gap with how people think, behave, and interact with surrounding world. In this work, we present COSMO, a scalable system to mine user-centric commonsense knowledge from massive behaviors and construct industry-scale knowledge graphs to empower diverse online services. In particular, we describe a pipeline for collecting high-quality seed knowledge assertions that are distilled from large language models (LLMs) and further refined by critic classifiers trained over human-in-the-loop annotated data. Since those generations may not always align with human preferences and contain noises, we then describe how we adopt instruction tuning to finetune an efficient language model (COSMO-LM) for faithful e-commerce commonsense knowledge generation at scale. COSMO-LM effectively expands our knowledge graph to 18 major categories at Amazon, producing millions of high-quality knowledge with only 30k annotated instructions. Finally COSMO has been deployed in Amazon search applications such as search navigation. Both offline and online A/B experiments demonstrate our proposed system achieves significant improvement. Furthermore, these experiments highlight the immense potential of commonsense knowledge extracted from instruction-finetuned large language models.
引用
收藏
页码:148 / 160
页数:13
相关论文
共 50 条
  • [1] Large-scale Visual Search and Similarity for E-Commerce
    Anand, Gaurav
    Wang, Siyun
    Ni, Karl
    APPLICATIONS OF MACHINE LEARNING 2021, 2021, 11843
  • [2] Ontology management for large-scale e-commerce applications
    Lee, J
    Goodwin, R
    DEEC 2005: International Workshop on Data Engineering Issues in E-Commerce, Proceedings, 2005, : 7 - 15
  • [3] Online E-Commerce Fraud: A Large-scale Detection and Analysis
    Weng, Haiqin
    Li, Zhao
    Ji, Shouling
    Chu, Chen
    Lu, Haifeng
    Du, Tianyu
    He, Qinming
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1435 - 1440
  • [4] Study of Large-scale Enterprise Strategic Decision Support System in E-commerce Environment
    Jiang, Yuantao
    2011 INTERNATIONAL CONFERENCE ON COMPUTER, ELECTRICAL, AND SYSTEMS SCIENCES, AND ENGINEERING (CESSE 2011), 2011, : 528 - 531
  • [5] Building Large-Scale Deep Learning System for Entity Recognition in E-Commerce Search
    Wen, Musen
    Vasthimal, Deepak Kumar
    Lu, Alan
    Wang, Tian
    Guo, Aimin
    BDCAT'19: PROCEEDINGS OF THE 6TH IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, 2019, : 149 - 154
  • [6] Heterogeneous Embedding Propagation for Large-scale E-Commerce User Alignment
    Zheng, Vincent W.
    Sha, Mo
    Li, Yuchen
    Yang, Hongxia
    Fang, Yuan
    Zhang, Zhenjie
    Tan, Kian-Lee
    Chang, Kevin Chen-Chuan
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1434 - 1439
  • [7] Large-scale Robust Online Matching and Its Application in E-commerce
    Jin, Rong
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1351 - 1351
  • [8] Large-scale Fake Click Detection for E-commerce Recommendation Systems
    Li, Jingdong
    Li, Zhao
    Huang, Jiaming
    Zhang, Ji
    Wang, Xiaoling
    Lu, Xingjian
    Zhou, Jingren
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2595 - 2606
  • [9] CFSF: On Cloud-Based Recommendation for Large-Scale E-commerce
    Hu, Long
    Lin, Kai
    Hassan, Mohammad Mehedi
    Alamri, Atif
    Alelaiwi, Abdulhameed
    MOBILE NETWORKS & APPLICATIONS, 2015, 20 (03): : 380 - 390
  • [10] CFSF: On Cloud-Based Recommendation for Large-Scale E-commerce
    Long Hu
    Kai Lin
    Mohammad Mehedi Hassan
    Atif Alamri
    Abdulhameed Alelaiwi
    Mobile Networks and Applications, 2015, 20 : 380 - 390