Zero-Shot Food Image Detection Based on Transformer

被引:0
|
作者
Song, Jingru [1 ]
Min, Weiqing [2 ,3 ]
Zhou, Pengfei [2 ,3 ]
Rao, Quanrui [1 ]
Sheng, Guorui [1 ]
Yang, Yancun [1 ]
Wang, Lili [1 ]
Jiang, Shuqiang [2 ,3 ]
机构
[1] School of Information and Electrical Engineering, Ludong University, Yantai,264025, China
[2] Institute of Computing Technology, Chinese Academy of Sciences, Beijing,100190, China
[3] Key Lab of Intelligent Information Processing, Chinese Academy of Sciences, Beijing,100190, China
关键词
Food chemistry - Food ingredients;
D O I
10.13386/j.issn1002-0306.2024030027
中图分类号
学科分类号
摘要
As a fundamental task in food computing, food detection played a crucial role in locating and identifying food items from input images, particularly in applications such as intelligent canteen settlement and dietary health management. However, food categories were constantly updating in practical scenarios, making it difficult for food detectors trained on fixed categories to accurately detect previously unseen food categories. To address this issue, this paper proposed a zero-shot food image detection method. Firstly, a Transformer-based food primitive generator was constructed, where each primitive contained fine-grained attributes relevant to food categories. These primitives could be selectively assembled based on the food characteristics to synthesize new food features. Secondly, an enhancement component of visual feature disentanglement was proposed in order to impose more constraints on the visual features of unseen food categories. The visual features of food images were decomposed into semantically related features and semantically unrelated features, thereby better transferring semantic knowledge of food categories to their visual features. The proposed method was extensively evaluated on the ZSFooD and UEC-FOOD256 datasets through numerous experiments and ablation studies. Under the zero-shot detection (ZSD) setting, optimal average precision on unseen classes reached 4.9% and 24.1%, respectively, demonstrating the effectiveness of the proposed approach. Under the generalized zero-shot detection (GZSD) setting, the harmonic mean of visible and unseen classes reaches 5.8% and 22.0%, respectively, further validating the effectiveness of the proposed method. © The Author(s) 2024.
引用
收藏
页码:18 / 26
相关论文
共 50 条
  • [21] Zero-shot image classification based on generative adversarial network
    Wei H.
    Zhang Y.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2019, 45 (12): : 2345 - 2350
  • [22] INCREMENTAL ZERO-SHOT LEARNING BASED ON ATTRIBUTES FOR IMAGE CLASSIFICATION
    Xue, Nan
    Wang, Yi
    Fan, Xin
    Min, Maomao
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 850 - 854
  • [23] Zero-Shot Image Classification Based on a Learnable Deep Metric
    Liu, Jingyi
    Shi, Caijuan
    Tu, Dongjing
    Shi, Ze
    Liu, Yazhi
    SENSORS, 2021, 21 (09)
  • [24] ZERO-SHOT OBJECT DETECTION WITH TRANSFORMERS
    Zheng, Ye
    Cui, Li
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 444 - 448
  • [25] Generalized Zero-Shot Image Classification Based on Reconstruction Contrast
    Xu R.
    Shao S.
    Cao W.
    Liu B.
    Tao D.
    Liu W.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (12): : 1078 - 1088
  • [26] A Survey of Zero-Shot Object Detection
    Cao, Weipeng
    Yao, Xuyang
    Xu, Zhiwu
    Liu, Ye
    Pan, Yinghui
    Ming, Zhong
    BIG DATA MINING AND ANALYTICS, 2025, 8 (03): : 726 - 750
  • [27] FFusion: Feature Fusion Transformer for Zero-Shot Learning
    Tao, Wenjin
    Xie, Jiahao
    An, Zhinan
    Meng, Xianjia
    ELECTRONICS, 2025, 14 (05):
  • [28] A Survey of Zero-Shot Stance Detection
    Liu, Guangzhen
    Zhao, Kai
    Zhang, Linlin
    Bi, Xuehua
    Lv, Xiaoyi
    Chen, Cheng
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT V, NLPCC 2024, 2025, 15363 : 107 - 120
  • [29] Zero-Shot Camouflaged Object Detection
    Li, Haoran
    Feng, Chun-Mei
    Xu, Yong
    Zhou, Tao
    Yao, Lina
    Chang, Xiaojun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5126 - 5137
  • [30] Zero-shot Image Categorization by Image Correlation Exploration
    Gao, LianLi
    Song, Jingkuan
    Shao, Junming
    Zhu, Xiaofeng
    Shen, Heng Tao
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 487 - 490