Dietary Assessment With Multimodal ChatGPT: A Systematic Analysis

被引:2
|
作者
Lo, Frank P. -W. [1 ]
Qiu, Jianing [2 ]
Wang, Zeyu [1 ]
Chen, Junhong [1 ]
Xiao, Bo [1 ]
Yuan, Wu [2 ]
Giannarou, Stamatia [1 ]
Frost, Gary [3 ]
Lo, Benny [3 ]
机构
[1] Imperial Coll London, Hamlyn Ctr, London SW7 2AZ, England
[2] Chinese Univ Hong Kong, Dept Biomed Engn, Hong Kong, Peoples R China
[3] Imperial Coll London, Fac Med, Dept Metab Digest & Reprod, London SW7 2AZ, England
基金
比尔及梅琳达.盖茨基金会; 英国医学研究理事会; 英国生物技术与生命科学研究理事会;
关键词
Artificial intelligence; Estimation; Task analysis; Chatbots; Monitoring; Accuracy; Visualization; ChatGPT; deep learning; dietary assessment; food recognition; foundation model; GPT-4V; passive monitoring; COUNTING BITES; FOOD;
D O I
10.1109/JBHI.2024.3417280
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional approaches to dietary assessment are primarily grounded in self-reporting methods or structured interviews conducted under the supervision of dietitians. These methods, however, are often subjective, inaccurate, and time-intensive. Although artificial intelligence (AI)-based solutions have been devised to automate the dietary assessment process, prior AI methodologies tackle dietary assessment in a fragmented landscape (e.g., merely recognizing food types or estimating portion size) and encounter challenges in their ability to generalize across a diverse range of food categories, dietary behaviors, and cultural contexts. Recently, the emergence of multimodal foundation models, such as GPT-4V, has exhibited transformative potential across a wide range of tasks in various research domains. These models have demonstrated remarkable generalist intelligence and accuracy, owing to their large-scale pre-training on broad datasets and substantially scaled model size. In this study, we explore the application of GPT-4V powering multimodal ChatGPT for dietary assessment, along with prompt engineering and passive monitoring techniques. We evaluated the proposed pipeline using a self-collected, semi free-living dietary intake dataset, captured through wearable cameras. Our findings reveal that GPT-4V excels in food detection under challenging conditions without any fine-tuning or adaptation using food-specific datasets. By guiding the model with specific language prompts (e.g., African cuisine), it shifts from recognizing common staples like rice and bread to accurately identifying regional dishes like banku and ugali. Another standout feature of GPT-4V is its contextual awareness. GPT-4V can leverage surrounding objects as scale references to deduce the portion sizes of food items, further facilitating the process of dietary assessment.
引用
收藏
页码:7577 / 7587
页数:11
相关论文
共 50 条
  • [1] Performance of ChatGPT in French language analysis of multimodal retinal cases
    Mikhail, D.
    Mihalache, A.
    Huang, R. S.
    Khairy, T.
    Popovic, M. M.
    Milad, D.
    Shor, R.
    Pereira, A.
    Kwok, J.
    Yan, P.
    Wong, D. T.
    Kertes, P. J.
    Duval, R.
    Muni, R. H.
    JOURNAL FRANCAIS D OPHTALMOLOGIE, 2025, 48 (03):
  • [2] Automated Assessment of Encouragement and Warmth in Classrooms Leveraging Multimodal Emotional Features and ChatGPT
    Hou, Ruikun
    Fuetterer, Tim
    Buehler, Babette
    Bozkir, Efe
    Gerjets, Peter
    Trautwein, Ulrich
    Kasneci, Enkelejda
    ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, AIED 2024, 2024, 14829 : 60 - 74
  • [3] A systematic review and meta-analysis of unimodal and multimodal predation risk assessment in birds
    Mathot, Kimberley J.
    Arteaga-Torres, Josue David
    Besson, Anne
    Hawkshaw, Deborah M.
    Klappstein, Natasha
    McKinnon, Rebekah A.
    Sridharan, Sheeraja
    Nakagawa, Shinichi
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [4] A Systematic Review of Systematic Reviews of Validated Dietary Assessment Tools
    Hooson, J.
    Hancock, N.
    Greenwood, D. C.
    Robinson, S.
    Burley, V. J.
    Roe, M.
    Steer, T.
    Wark, P. A.
    Cade, J. E.
    PROCEEDINGS OF THE NUTRITION SOCIETY, 2016, 75 (OCE3) : E239 - E239
  • [5] Assessment of multimodal treatment options in recurrent and persistent acromegaly: a systematic review and meta-analysis
    Maroufi, Seyed Farzad
    Assar, Manijeh
    Khorasanizadeh, Mirhojjat
    Sabet, Fatemeh Mahdavi
    Sabahi, Mohammadmahdi
    Dabecco, Rocco
    Adada, Badih
    Zada, Gabriel
    Borghei-Razavi, Hamid
    JOURNAL OF NEURO-ONCOLOGY, 2024, 168 (01) : 13 - 25
  • [6] Assessment of multimodal treatment options in recurrent and persistent acromegaly: a systematic review and meta-analysis
    Seyed Farzad Maroufi
    Manijeh Assar
    MirHojjat Khorasanizadeh
    Fatemeh Mahdavi Sabet
    Mohammadmahdi Sabahi
    Rocco Dabecco
    Badih Adada
    Gabriel Zada
    Hamid Borghei-Razavi
    Journal of Neuro-Oncology, 2024, 168 : 13 - 25
  • [7] Validity of Dietary Assessment in Athletes: A Systematic Review
    Capling, Louise
    Beck, Kathryn L.
    Gifford, Janelle A.
    Slater, Gary
    Flood, Victoria M.
    O'Connor, Helen
    NUTRIENTS, 2017, 9 (12)
  • [8] Systematic exploration and in-depth analysis of ChatGPT architectures progression
    Banik, Debajyoty
    Pati, Natasha
    Sharma, Atul
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (09)
  • [9] ChatGPT for assessment writing
    Zuckerman, Matthew
    Flood, Ryan
    Tan, Rachael J. B.
    Kelp, Nicole
    Ecker, David J.
    Menke, Jonathan
    Lockspeiser, Tai
    MEDICAL TEACHER, 2023, 45 (11) : 1224 - 1227
  • [10] The use of ChatGPT in assessment
    Kanik, Mehmet
    INTERNATIONAL JOURNAL OF ASSESSMENT TOOLS IN EDUCATION, 2024, 11 (03): : 608 - 621