EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce

被引:0
|
作者
Li, Yangning [1 ,4 ]
Ma, Shirong [1 ]
Wang, Xiaobin [3 ]
Huang, Shen [3 ]
Jiang, Chengyue [2 ]
Zheng, Hai-Tao [1 ,4 ]
Xie, Pengjun [3 ]
Huang, Fei [3 ]
Jiang, Yong [3 ]
机构
[1] Tsinghua Univ, SIGS, Beijing, Peoples R China
[2] ShanghaiTech Univ, Shanghai, Peoples R China
[3] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China
[4] PengCheng Lab, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, instruction-following Large Language Models (LLMs) , represented by ChatGPT, have exhibited exceptional performance in general Natural Language Processing (NLP) tasks. However, the unique characteristics of E-commerce data pose significant challenges to general LLMs. An LLM tailored specifically for E-commerce scenarios, possessing robust cross-dataset/task generalization capabilities, is a pressing necessity. To solve this issue, in this work, we proposed the first E-commerce instruction dataset EcomInstruct, with a total of 2.5 million instruction data. EcomInstruct scales up the data size and task diversity by constructing atomic tasks with E-commerce basic data types, such as product information, user reviews. Atomic tasks are defined as intermediate tasks implicitly involved in solving a final task, which we also call Chain-of-Task tasks. We developed EcomGPT with different parameter scales by training the backbone model BLOOMZ with the EcomInstruct. Benefiting from the fundamental semantic understanding capabilities acquired from the Chain-of-Task tasks, EcomGPT exhibits excellent zero-shot generalization capabilities. Extensive experiments and human evaluations demonstrate that EcomGPT outperforms ChatGPT in term of cross-dataset/task generalization on E-commerce tasks. The EcomGPT will be public at https://github.com/Alibaba-NLP/EcomGPT.
引用
收藏
页码:18582 / 18590
页数:9
相关论文
共 48 条
  • [21] Advancing entity recognition in biomedicine via instruction tuning of large language models
    Keloth, Vipina K.
    Hu, Yan
    Xie, Qianqian
    Peng, Xueqing
    Wang, Yan
    Zheng, Andrew
    Selek, Melih
    Raja, Kalpana
    Wei, Chih Hsuan
    Jin, Qiao
    Lu, Zhiyong
    Chen, Qingyu
    Xu, Hua
    BIOINFORMATICS, 2024, 40 (04)
  • [22] Instruction Tuning Large Language Models for Multimodal Relation Extraction Using LoRA
    Li, Zou
    Pang, Ning
    Zhao, Xiang
    WEB INFORMATION SYSTEMS AND APPLICATIONS, WISA 2024, 2024, 14883 : 364 - 376
  • [23] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
    Zhang, Jinrui
    Wang, Teng
    Zhang, Haigang
    Lu, Ping
    Zheng, Feng
    COMPUTER VISION - ECCV 2024, PT XXXVII, 2025, 15095 : 196 - 213
  • [24] WaveCoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning
    Yu, Zhaojian
    Zhang, Xin
    Shang, Ning
    Huang, Yangyu
    Xu, Can
    Zhao, Yishujie
    Hu, Wenxiang
    Yin, Qiufeng
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 5140 - 5153
  • [25] Intent-based Product Collections for E-commerce using Pretrained Language Models
    Kim, Hiun
    Jeong, Jisu
    Kim, Kyung-Min
    Lee, Dongjun
    Lee, Hyun Dong
    Seo, Dongpil
    Han, Jeeseung
    Park, Dong Wook
    Heo, Ji Ae
    Kim, Rak Yeong
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 228 - 237
  • [26] Examining the Supply Chain Management Models for Agricultural Products Under the Context of E-Commerce
    Zhang, Weiqing
    Li, Jiaxin
    He, Yajie
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2023, 30 (04): : 1193 - 1200
  • [27] e-CLIP: Large-Scale Vision-Language Representation Learning in E-commerce
    Shin, Wonyoung
    Park, Jonghun
    Woo, Taekang
    Cho, Yongwoo
    Oh, Kwangjin
    Song, Hwanjun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3484 - 3494
  • [28] A Large Language Model and Qualitative Comparative Analysis-Based Study of Trust in E-Commerce
    Davoodi, Laleh
    Mezei, Jozsef
    APPLIED SCIENCES-BASEL, 2024, 14 (21):
  • [29] Analysis on Logistics Distribution Models of the Chain-store Management Under the E-commerce Environment
    Xi Ying
    PROCEEDINGS OF 2014 INTERNATIONAL SYMPOSIUM - DEVELOPMENT OF MODERN SERVICE INDUSTRY, 2014, : 20 - 23
  • [30] Evaluating large language models on geospatial tasks: a multiple geospatial task benchmarking study
    Xu, Liuchang
    Zhao, Shuo
    Lin, Qingming
    Chen, Luyao
    Luo, Qianqian
    Wu, Sensen
    Ye, Xinyue
    Feng, Hailin
    Du, Zhenhong
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2025, 18 (01)