EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce

被引:0
|
作者
Li, Yangning [1 ,4 ]
Ma, Shirong [1 ]
Wang, Xiaobin [3 ]
Huang, Shen [3 ]
Jiang, Chengyue [2 ]
Zheng, Hai-Tao [1 ,4 ]
Xie, Pengjun [3 ]
Huang, Fei [3 ]
Jiang, Yong [3 ]
机构
[1] Tsinghua Univ, SIGS, Beijing, Peoples R China
[2] ShanghaiTech Univ, Shanghai, Peoples R China
[3] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China
[4] PengCheng Lab, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, instruction-following Large Language Models (LLMs) , represented by ChatGPT, have exhibited exceptional performance in general Natural Language Processing (NLP) tasks. However, the unique characteristics of E-commerce data pose significant challenges to general LLMs. An LLM tailored specifically for E-commerce scenarios, possessing robust cross-dataset/task generalization capabilities, is a pressing necessity. To solve this issue, in this work, we proposed the first E-commerce instruction dataset EcomInstruct, with a total of 2.5 million instruction data. EcomInstruct scales up the data size and task diversity by constructing atomic tasks with E-commerce basic data types, such as product information, user reviews. Atomic tasks are defined as intermediate tasks implicitly involved in solving a final task, which we also call Chain-of-Task tasks. We developed EcomGPT with different parameter scales by training the backbone model BLOOMZ with the EcomInstruct. Benefiting from the fundamental semantic understanding capabilities acquired from the Chain-of-Task tasks, EcomGPT exhibits excellent zero-shot generalization capabilities. Extensive experiments and human evaluations demonstrate that EcomGPT outperforms ChatGPT in term of cross-dataset/task generalization on E-commerce tasks. The EcomGPT will be public at https://github.com/Alibaba-NLP/EcomGPT.
引用
收藏
页码:18582 / 18590
页数:9
相关论文
共 48 条
  • [31] Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors
    Zhang, Kai
    Gutierrez, Bernal Jimenez
    Su, Yu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 794 - 812
  • [32] A Comparative Analysis of Instruction Fine-Tuning Large Language Models for Financial Text Classification
    Fatemi, Sorouralsadat
    Hu, Yuheng
    Mousavi, Maryam
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2025, 16 (01)
  • [33] CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions
    Rao, Jun
    Liu, Xuebo
    Lian, Lian
    Cheng, Shengjun
    Liao, Yunjie
    Zhang, Min
    EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, 2024, : 10064 - 10083
  • [34] Action Contextualization: Adaptive Task Planning and Action Tuning Using Large Language Models
    Gupta, Sthithpragya
    Yao, Kunpeng
    Niederhauser, Loic
    Billard, Aude
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9407 - 9414
  • [35] Enhancing Visual Information Extraction with Large Language Models Through Layout-Aware Instruction Tuning
    Li, Teng
    Wang, Jiapeng
    Jin, Lianwen
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII, 2025, 15037 : 276 - 289
  • [36] InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment
    Wang, Jianing
    Wu, Junda
    Hon, Yupeng
    Liu, Yao
    Gao, Ming
    McAuley, Julian
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13492 - 13510
  • [37] ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
    Shenoy, Ashish
    Bodapati, Sravan
    Kirchhoff, Katrin
    ECNLP 4: THE FOURTH WORKSHOP ON E-COMMERCE AND NLP, 2021, : 18 - 25
  • [38] DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning
    Wang, Yejie
    He, Keqing
    Dong, Guanting
    Wang, Pei
    Zeng, Weihao
    Diao, Muxi
    Zhang, Mengdi
    Wang, Jingang
    Cai, Xunliang
    Xu, Weiran
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 4706 - 4721
  • [39] Style in the Long Tail: Discovering Unique Interests with Latent Variable Models in Large Scale Social E-commerce
    Hu, Diane
    Hall, Rob
    Attenberg, Josh
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 1640 - 1649
  • [40] LLM-Ensemble: Optimal Large Language Model Ensemble Method for E-commerce Product Attribute Value Extraction
    Fang, Chenhao
    Li, Xiaohan
    Fan, Zezhong
    Xu, Jianpeng
    Nag, Kaushiki
    Korpeoglu, Evren
    Kumar, Sushant
    Achan, Kannan
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2910 - 2914