Dynamic Mixup for Multi-Label Long-Tailed Food Ingredient Recognition

被引:18
|
作者
Gao, Jixiang [1 ]
Chen, Jingjing [1 ]
Fu, Huazhu [2 ]
Jiang, Yu-Gang [1 ]
机构
[1] Fudan Univ, Shanghai 200437, Peoples R China
[2] ASTAR, Inst High Performance Comp IHPC, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Ingredient recognition; imbalanced multi-label classification; long-tailed problem;
D O I
10.1109/TMM.2022.3181789
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing the ingredients composition for given food images facilitates the estimation of nutrition facts, which is crucial to various health relevant applications. Nevertheless, ingredient recognition is a multi-label long-tailed classification problem, where each image may contain multiple labels and the class distributions are highly imbalanced. Most existing approaches leverage off-the-shelf Convolutional Neural Networks (CNN) for multi-label ingredient recognition, overlooking the long-tailed issue, which results in low accuracy for tail ingredient categories. To address this problem, this paper proposes a dynamic Mixup (D-Mixup) approach, aiming to dynamically augment minority ingredients, in order to boost the recognition performance for tail ingredient categories. Specifically, our D-Mixup approach dynamically selects two training images based on the predictions of the previous training epoch, and generates a new synthetic image to train the recognition network. In this way, the training samples of tailed classes can be dynamically enlarged and better discriminative representations can be learnt for rare classes. Extensive experiments on both VIREO Food-172 dataset and UEC Food-100 dataset demonstrate the effectiveness of the proposed D-Mixup method.
引用
收藏
页码:4764 / 4773
页数:10
相关论文
共 50 条
  • [41] Long-Tailed Recognition via Weight Balancing
    Alshammari, Shaden
    Wang, Yu-Xiong
    Ramanan, Deva
    Kong, Shu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6887 - 6897
  • [42] Equalization Loss for Long-Tailed Object Recognition
    Tan, Jingru
    Wang, Changbao
    Li, Buyu
    Li, Quanquan
    Ouyang, Wanli
    Yin, Changqing
    Yan, Junjie
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 11659 - 11668
  • [43] Decoupled Optimisation for Long-Tailed Visual Recognition
    Cong, Cong
    Xuan, Shiyu
    Liu, Sidong
    Zhang, Shiliang
    Pagnucco, Maurice
    Song, Yang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1380 - 1388
  • [44] Decoupled Contrastive Learning for Long-Tailed Recognition
    Xuan, Shiyu
    Zhang, Shiliang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 6396 - 6403
  • [45] Distilling Virtual Examples for Long-tailed Recognition
    He, Yin-Yin
    Wu, Jianxin
    Wei, Xiu-Shen
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 235 - 244
  • [46] Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge
    Holste, Gregory
    Zhou, Yiliang
    Wang, Song
    Jaiswal, Ajay
    Lin, Mingquan
    Zhuge, Sherry
    Yang, Yuzhe
    Kim, Dongkyun
    Nguyen-Mau, Trong-Hieu
    Tran, Minh-Triet
    Jeong, Jaehyup
    Park, Wongi
    Ryu, Jongbin
    Hong, Feng
    Verma, Arsh
    Yamagishi, Yosuke
    Kim, Changhyun
    Seo, Hyeryeong
    Kang, Myungjoo
    Celi, Leo Anthony
    Lu, Zhiyong
    Summers, Ronald M.
    Shih, George
    Wang, Zhangyang
    Peng, Yifan
    MEDICAL IMAGE ANALYSIS, 2024, 97
  • [47] DBN-Mix: Training dual branch network using bilateral mixup augmentation for long-tailed visual recognition
    Baik, Jae Soon
    Yoon, In Young
    Choi, Jun Won
    PATTERN RECOGNITION, 2024, 147
  • [48] Dynamic collaborative learning with heterogeneous knowledge transfer for long-tailed visual recognition
    Zhou, Hao
    Luo, Tingjin
    He, Yongming
    INFORMATION FUSION, 2025, 115
  • [49] Normalizing Batch Normalization for Long-Tailed Recognition
    Bao, Yuxiang
    Kang, Guoliang
    Yang, Linlin
    Duan, Xiaoyue
    Zhao, Bo
    Zhang, Baochang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 209 - 220
  • [50] On Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization and Beyond
    Yang, Yuzhe
    Wang, Hao
    Katabi, Dina
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 57 - 75