Dynamic Mixup for Multi-Label Long-Tailed Food Ingredient Recognition

被引:18
|
作者
Gao, Jixiang [1 ]
Chen, Jingjing [1 ]
Fu, Huazhu [2 ]
Jiang, Yu-Gang [1 ]
机构
[1] Fudan Univ, Shanghai 200437, Peoples R China
[2] ASTAR, Inst High Performance Comp IHPC, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Ingredient recognition; imbalanced multi-label classification; long-tailed problem;
D O I
10.1109/TMM.2022.3181789
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing the ingredients composition for given food images facilitates the estimation of nutrition facts, which is crucial to various health relevant applications. Nevertheless, ingredient recognition is a multi-label long-tailed classification problem, where each image may contain multiple labels and the class distributions are highly imbalanced. Most existing approaches leverage off-the-shelf Convolutional Neural Networks (CNN) for multi-label ingredient recognition, overlooking the long-tailed issue, which results in low accuracy for tail ingredient categories. To address this problem, this paper proposes a dynamic Mixup (D-Mixup) approach, aiming to dynamically augment minority ingredients, in order to boost the recognition performance for tail ingredient categories. Specifically, our D-Mixup approach dynamically selects two training images based on the predictions of the previous training epoch, and generates a new synthetic image to train the recognition network. In this way, the training samples of tailed classes can be dynamically enlarged and better discriminative representations can be learnt for rare classes. Extensive experiments on both VIREO Food-172 dataset and UEC Food-100 dataset demonstrate the effectiveness of the proposed D-Mixup method.
引用
收藏
页码:4764 / 4773
页数:10
相关论文
共 50 条
  • [1] LABEL-OCCURRENCE-BALANCED MIXUP FOR LONG-TAILED RECOGNITION
    Zhang, Shaoyu
    Chen, Chen
    Zhang, Xiujuan
    Peng, Silong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3224 - 3228
  • [2] Multi-Label Meta Weighting for Long-Tailed Dynamic Scene Graph Generation
    Chen, Shuo
    Du, Yingjun
    Mettes, Pascal
    Snoek, Cees G. M.
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 39 - 47
  • [3] Robust Asymmetric Loss for Multi-Label Long-Tailed Learning
    Park, Wongi
    Park, Inhyuk
    Kim, Sungeun
    Ryu, Jongbin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2703 - 2712
  • [4] Does Head Label Help for Long-Tailed Multi-Label Text Classification
    Xiao, Lin
    Zhang, Xiangliang
    Jing, Liping
    Huang, Chi
    Song, Mingyang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14103 - 14111
  • [5] Long-Tailed Multi-label Retinal Diseases Recognition via Relational Learning and Knowledge Distillation
    Zhou, Qian
    Zou, Hua
    Wang, Zhongyuan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 709 - 718
  • [6] Residual diverse ensemble for long-tailed multi-label text classification
    Jiangxin SHI
    Tong WEI
    Yufeng LI
    Science China(Information Sciences), 2024, 67 (11) : 92 - 105
  • [7] Residual diverse ensemble for long-tailed multi-label text classification
    Shi, Jiangxin
    Wei, Tong
    Li, Yufeng
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (11)
  • [8] Exploring Contrastive Learning for Long-Tailed Multi-label Text Classification
    Audibert, Alexandre
    Gauffre, Aurelien
    Amini, Massih-Reza
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT VII, ECML PKDD 2024, 2024, 14947 : 245 - 261
  • [9] Distributionally Robust Loss for Long-Tailed Multi-label Image Classification
    Lin, Dekun
    Peng, Tailai
    Chen, Rui
    Xie, Xinran
    Qin, Xiaolin
    Cui, Zhe
    COMPUTER VISION - ECCV 2024, PT XXXIII, 2025, 15091 : 417 - 433
  • [10] Probability Guided Loss for Long-Tailed Multi-Label Image Classification
    Lin, Dekun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1577 - 1585