e-CLIP: Large-Scale Vision-Language Representation Learning in E-commerce

被引:8
|
作者
Shin, Wonyoung [1 ]
Park, Jonghun [1 ]
Woo, Taekang [1 ]
Cho, Yongwoo [1 ]
Oh, Kwangjin [1 ]
Song, Hwanjun [2 ]
机构
[1] NAVER Shopping, Seongnam, South Korea
[2] NAVER AI Res, Seongnam, South Korea
关键词
Multimodal pre-training; Large-scale pre-training;
D O I
10.1145/3511808.3557067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding vision and language representations of product content is vital for search and recommendation applications in e-commerce. As a backbone for online shopping platforms and inspired by the recent success in representation learning research, we propose a contrastive learning framework that aligns language and visual models using unlabeled raw product text and images. We present techniques we used to train large-scale representation learning models and share solutions that address domain-specific challenges. We study the performance using our pre-trained model as backbones for diverse downstream tasks, including category classification, attribute extraction, product matching, product clustering, and adult product recognition. Experimental results show that our proposed method outperforms the baseline in each downstream task regarding both single modality and multiple modalities.
引用
收藏
页码:3484 / 3494
页数:11
相关论文
共 50 条
  • [21] Vision-Language Tracking With CLIP and Interactive Prompt Learning
    Zhu, Hong
    Lu, Qingyang
    Xue, Lei
    Zhang, Pingping
    Yuan, Guanglin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (03) : 3659 - 3670
  • [22] Securing the Deep Fraud Detector in Large-Scale E-Commerce Platform via Adversarial Machine Learning Approach
    Guo, Qingyu
    Li, Zhao
    An, Bo
    Hui, Pengrui
    Huang, Jiaming
    Zhang, Long
    Zhao, Mengchen
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 616 - 626
  • [23] Multi-Channel Sellers Traffic Allocation in Large-scale E-commerce Promotion
    Shen Xin
    Ye, Yizhou
    Ester, Martin
    Long, Cheng
    Zhang, Jie
    Li, Zhao
    Yuan, Kaiying
    Li, Yanghua
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2845 - 2852
  • [24] A linguistic solution for double large-scale group decision-making in E-commerce
    Wu, Tong
    Liu, Xinwang
    Qin, Jindong
    COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 116 : 97 - 112
  • [25] Study of Large-scale Enterprise Strategic Decision Support System in E-commerce Environment
    Jiang, Yuantao
    2011 INTERNATIONAL CONFERENCE ON COMPUTER, ELECTRICAL, AND SYSTEMS SCIENCES, AND ENGINEERING (CESSE 2011), 2011, : 528 - 531
  • [26] Large-Scale Item Categorization in e-Commerce Using Multiple Recurrent Neural Networks
    Ha, Jung-Woo
    Pyo, Hyuna
    Kim, Jeonghee
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 107 - 115
  • [27] A large-scale last-mile consolidation model for e-commerce home delivery
    Munoz-Villamizar, Andres
    Velazquez-Martinez, Josue C.
    Caballero-Caballero, Sergio
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [28] Hierarchical Bipartite Graph Neural Networks: Towards Large-Scale E-commerce Applications
    Li, Zhao
    Shen, Xin
    Jiao, Yuhang
    Pan, Xuming
    Zou, Pengcheng
    Meng, Xianling
    Yao, Chengwei
    Bu, Jiajun
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1677 - 1688
  • [29] Learning from e-commerce for e-learning
    Liu, Zhengdan
    PROCEEDINGS OF THE 2007 1ST INTERNATIONAL SYMPOSIUM ON INFORMATION TECHNOLOGIES AND APPLICATIONS IN EDUCATION (ISITAE 2007), 2007, : 193 - 197
  • [30] LiLiuM: eBay's Large Language Models for e-commerce
    Herold, Christian
    Kozielski, Michael
    Ekimov, Leonid
    Petrushkov, Pavel
    Vandenbussche, Pierre-Yves
    Khadivi, Shahram
    arXiv,