EcomGPT: Instruction-Tuning Large Language Models with Chain-of-Task Tasks for E-commerce

被引：0

作者：

Li, Yangning ^{[1
,4
]}

Ma, Shirong ^{[1
]}

Wang, Xiaobin ^{[3
]}

Huang, Shen ^{[3
]}

Jiang, Chengyue ^{[2
]}

Zheng, Hai-Tao ^{[1
,4
]}

Xie, Pengjun ^{[3
]}

Huang, Fei ^{[3
]}

Jiang, Yong ^{[3
]}

机构：

[1] Tsinghua Univ, SIGS, Beijing, Peoples R China

[2] ShanghaiTech Univ, Shanghai, Peoples R China

[3] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China

[4] PengCheng Lab, Shenzhen, Peoples R China

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17 | 2024年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, instruction-following Large Language Models (LLMs) , represented by ChatGPT, have exhibited exceptional performance in general Natural Language Processing (NLP) tasks. However, the unique characteristics of E-commerce data pose significant challenges to general LLMs. An LLM tailored specifically for E-commerce scenarios, possessing robust cross-dataset/task generalization capabilities, is a pressing necessity. To solve this issue, in this work, we proposed the first E-commerce instruction dataset EcomInstruct, with a total of 2.5 million instruction data. EcomInstruct scales up the data size and task diversity by constructing atomic tasks with E-commerce basic data types, such as product information, user reviews. Atomic tasks are defined as intermediate tasks implicitly involved in solving a final task, which we also call Chain-of-Task tasks. We developed EcomGPT with different parameter scales by training the backbone model BLOOMZ with the EcomInstruct. Benefiting from the fundamental semantic understanding capabilities acquired from the Chain-of-Task tasks, EcomGPT exhibits excellent zero-shot generalization capabilities. Extensive experiments and human evaluations demonstrate that EcomGPT outperforms ChatGPT in term of cross-dataset/task generalization on E-commerce tasks. The EcomGPT will be public at https://github.com/Alibaba-NLP/EcomGPT.

引用

页码：18582 / 18590

页数：9

共 48 条

[31] Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors
Zhang, Kai
Gutierrez, Bernal Jimenez
Su, Yu
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 794 - 812
[32] A Comparative Analysis of Instruction Fine-Tuning Large Language Models for Financial Text Classification
Fatemi, Sorouralsadat
Hu, Yuheng
Mousavi, Maryam
ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2025, 16 (01)
[33] CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions
Rao, Jun
Liu, Xuebo
Lian, Lian
Cheng, Shengjun
Liao, Yunjie
Zhang, Min
EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference, 2024, : 10064 - 10083
[34] Action Contextualization: Adaptive Task Planning and Action Tuning Using Large Language Models
Gupta, Sthithpragya
Yao, Kunpeng
Niederhauser, Loic
Billard, Aude
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9407 - 9414
[35] Enhancing Visual Information Extraction with Large Language Models Through Layout-Aware Instruction Tuning
Li, Teng
Wang, Jiapeng
Jin, Lianwen
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VII, 2025, 15037 : 276 - 289
[36] InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment
Wang, Jianing
Wu, Junda
Hon, Yupeng
Liu, Yao
Gao, Ming
McAuley, Julian
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13492 - 13510
[37] ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
Shenoy, Ashish
Bodapati, Sravan
Kirchhoff, Katrin
ECNLP 4: THE FOURTH WORKSHOP ON E-COMMERCE AND NLP, 2021, : 18 - 25
[38] DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning
Wang, Yejie
He, Keqing
Dong, Guanting
Wang, Pei
Zeng, Weihao
Diao, Muxi
Zhang, Mengdi
Wang, Jingang
Cai, Xunliang
Xu, Weiran
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 4706 - 4721
[39] Style in the Long Tail: Discovering Unique Interests with Latent Variable Models in Large Scale Social E-commerce
Hu, Diane
Hall, Rob
Attenberg, Josh
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 1640 - 1649
[40] LLM-Ensemble: Optimal Large Language Model Ensemble Method for E-commerce Product Attribute Value Extraction
Fang, Chenhao
Li, Xiaohan
Fan, Zezhong
Xu, Jianpeng
Nag, Kaushiki
Korpeoglu, Evren
Kumar, Sushant
Achan, Kannan
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2910 - 2914

← 1 2 3 4 5 →