COSMO: A Large-Scale E-commerce Common Sense Knowledge Generation and Serving System at Amazon

被引:0
|
作者
Yu, Changlong [1 ]
Liu, Xin [1 ]
Maia, Jefferson [1 ]
Li, Yang [1 ]
Cao, Tianyu [1 ]
Gao, Yifan [1 ]
Song, Yangqiu [2 ]
Goutam, Rahul [1 ]
Zhang, Haiyang [1 ]
Yin, Bing [1 ]
Li, Zheng [1 ]
机构
[1] Amazon, Palo Alto, CA 94303 USA
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
关键词
D O I
10.1145/3626246.3653398
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Applications of large-scale knowledge graphs in the e-commerce platforms can improve shopping experience for their customers. While existing e-commerce knowledge graphs (KGs) integrate a large volume of concepts or product attributes, they fail to discover user intentions, leaving the gap with how people think, behave, and interact with surrounding world. In this work, we present COSMO, a scalable system to mine user-centric commonsense knowledge from massive behaviors and construct industry-scale knowledge graphs to empower diverse online services. In particular, we describe a pipeline for collecting high-quality seed knowledge assertions that are distilled from large language models (LLMs) and further refined by critic classifiers trained over human-in-the-loop annotated data. Since those generations may not always align with human preferences and contain noises, we then describe how we adopt instruction tuning to finetune an efficient language model (COSMO-LM) for faithful e-commerce commonsense knowledge generation at scale. COSMO-LM effectively expands our knowledge graph to 18 major categories at Amazon, producing millions of high-quality knowledge with only 30k annotated instructions. Finally COSMO has been deployed in Amazon search applications such as search navigation. Both offline and online A/B experiments demonstrate our proposed system achieves significant improvement. Furthermore, these experiments highlight the immense potential of commonsense knowledge extracted from instruction-finetuned large language models.
引用
收藏
页码:148 / 160
页数:13
相关论文
共 50 条
  • [41] FLOPPIES: A Framework for Large-Scale Ontology Population of Product Information from Tabular Data in E-commerce Stores
    Nederstigt, Lennart J.
    Aanen, Steven S.
    Vandic, Damir
    Frasincar, Flavius
    DECISION SUPPORT SYSTEMS, 2014, 59 : 296 - 311
  • [42] The JDDC Corpus: A Large-Scale Multi-Turn Chinese Dialogue Dataset for E-commerce Customer Service
    Chen, Meng
    Liu, Ruixue
    Shen, Lei
    Yuan, Shaozu
    Zhou, Jingyan
    Wu, Youzheng
    He, Xiaodong
    Zhou, Bowen
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 459 - 466
  • [43] Securing the Deep Fraud Detector in Large-Scale E-Commerce Platform via Adversarial Machine Learning Approach
    Guo, Qingyu
    Li, Zhao
    An, Bo
    Hui, Pengrui
    Huang, Jiaming
    Zhang, Long
    Zhao, Mengchen
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 616 - 626
  • [44] DC-GNN: Decoupled Graph Neural Networks for Improving and Accelerating Large-Scale E-commerce Retrieval
    Feng, Chenchen
    He, Yu
    Wen, Shiyang
    Liu, Guojun
    Wang, Liang
    Xu, Jian
    Zheng, Bo
    COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 32 - 40
  • [45] Effect of e-commerce popularization on farmland abandonment in rural China: Evidence from a large-scale household survey
    Wang, Yahui
    Yang, Aoxi
    Li, Yuanqing
    Yang, Qingyuan
    LAND USE POLICY, 2023, 135
  • [46] Detecting the Internet Water Army via Comprehensive Behavioral Features Using Large-scale E-commerce Reviews
    Guo, Bo
    Wang, Hao
    Yu, Zhaojun
    Sun, Yu
    2017 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (IEEE CITS), 2017, : 88 - 92
  • [47] Knowledge Generation Using Sentiment Classification Involving Machine Learning on E-Commerce
    Ghosh, Swarup Kr
    Dey, Sowvik
    Ghosh, Anupam
    INTERNATIONAL JOURNAL OF BUSINESS ANALYTICS, 2019, 6 (02) : 74 - 90
  • [48] Towards Knowledge-Based Personalized Product Description Generation in E-commerce
    Chen, Qibin
    Lin, Junyang
    Zhang, Yichang
    Yang, Hongxia
    Zhou, Jingren
    Tang, Jie
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 3040 - 3050
  • [49] Relation labeling in product knowledge graphs with large language models for e-commerce
    Chen, Jiao
    Ma, Luyi
    Li, Xiaohan
    Xu, Jianpeng
    Cho, Jason H. D.
    Nag, Kaushiki
    Korpeoglu, Evren
    Kumar, Sushant
    Achan, Kannan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (12) : 5725 - 5743
  • [50] A knowledge-based decision-support-system for e-commerce
    Kassel, S
    Grebenstein, K
    Tittmann, C
    EUROMEDIA '2005: 11TH ANNUAL EUROMEDIA CONFERENCE, 2005, : 146 - 150