Dynamic Mixup for Multi-Label Long-Tailed Food Ingredient Recognition

被引:18
|
作者
Gao, Jixiang [1 ]
Chen, Jingjing [1 ]
Fu, Huazhu [2 ]
Jiang, Yu-Gang [1 ]
机构
[1] Fudan Univ, Shanghai 200437, Peoples R China
[2] ASTAR, Inst High Performance Comp IHPC, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Ingredient recognition; imbalanced multi-label classification; long-tailed problem;
D O I
10.1109/TMM.2022.3181789
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing the ingredients composition for given food images facilitates the estimation of nutrition facts, which is crucial to various health relevant applications. Nevertheless, ingredient recognition is a multi-label long-tailed classification problem, where each image may contain multiple labels and the class distributions are highly imbalanced. Most existing approaches leverage off-the-shelf Convolutional Neural Networks (CNN) for multi-label ingredient recognition, overlooking the long-tailed issue, which results in low accuracy for tail ingredient categories. To address this problem, this paper proposes a dynamic Mixup (D-Mixup) approach, aiming to dynamically augment minority ingredients, in order to boost the recognition performance for tail ingredient categories. Specifically, our D-Mixup approach dynamically selects two training images based on the predictions of the previous training epoch, and generates a new synthetic image to train the recognition network. In this way, the training samples of tailed classes can be dynamically enlarged and better discriminative representations can be learnt for rare classes. Extensive experiments on both VIREO Food-172 dataset and UEC Food-100 dataset demonstrate the effectiveness of the proposed D-Mixup method.
引用
收藏
页码:4764 / 4773
页数:10
相关论文
共 50 条
  • [21] Beyond the Label Distribution Prior for Long-Tailed Recognition
    Li, Ming
    Cao, Liujuan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 792 - 803
  • [22] Long-tail Mixup for Extreme Multi-label Classification
    Han, Sangwoo
    Choi, Eunseong
    Lim, Chan
    Shim, Hyunjung
    Lee, Jongwuk
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3998 - 4002
  • [23] Open Long-Tailed Recognition in a Dynamic World
    Liu, Ziwei
    Miao, Zhongqi
    Zhan, Xiaohang
    Wang, Jiayun
    Gong, Boqing
    Yu, Stella X.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1836 - 1851
  • [24] An Optimized Ensemble Framework for Multi-Label Classification on Long-Tailed Chest X-ray Data
    Jeong, Jaehyup
    Jeoun, Bosoung
    Park, Yeonju
    Han, Bohyung
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2731 - 2738
  • [25] Criticality-aware Deconfounded Classification of Long-tailed Multi-label 12-lead Electrocardiogram
    Deb, Trisrota
    Sahu, Ishan
    Ukil, Arijit
    Pal, Arpan
    Khandelwal, Sundeep
    Garain, Utpal
    2024 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS, PERCOM WORKSHOPS, 2024, : 239 - 244
  • [26] Advanced Augmentation and Ensemble Approaches for Classifying Long-Tailed Multi-Label Chest X-Rays
    Trong-Hieu Nguyen-Mau
    Tuan-Luc Huynh
    Thanh-Danh Le
    Hai-Dang Nguyen
    Minh-Triet Tran
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2721 - 2730
  • [27] Long-Tailed Food Classification
    He, Jiangpeng
    Lin, Luotao
    Eicher-Miller, Heather A.
    Zhu, Fengqing
    NUTRIENTS, 2023, 15 (12)
  • [28] Dynamic Learnable Logit Adjustment for Long-Tailed Visual Recognition
    Zhang, Enhao
    Geng, Chuanxing
    Li, Chaohua
    Chen, Songcan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 7986 - 7997
  • [29] Dynamic prior probability network for long-tailed visual recognition
    Zhou, Xuesong
    Sun, Jiaqi
    Zhai, Junhai
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 268
  • [30] Food Ingredients Recognition Through Multi-label Learning
    Bolanos, Marc
    Ferra, Aina
    Radeva, Petia
    NEW TRENDS IN IMAGE ANALYSIS AND PROCESSING - ICIAP 2017, 2017, 10590 : 394 - 402