Dietary Assessment With Multimodal ChatGPT: A Systematic Analysis

被引:2
|
作者
Lo, Frank P. -W. [1 ]
Qiu, Jianing [2 ]
Wang, Zeyu [1 ]
Chen, Junhong [1 ]
Xiao, Bo [1 ]
Yuan, Wu [2 ]
Giannarou, Stamatia [1 ]
Frost, Gary [3 ]
Lo, Benny [3 ]
机构
[1] Imperial Coll London, Hamlyn Ctr, London SW7 2AZ, England
[2] Chinese Univ Hong Kong, Dept Biomed Engn, Hong Kong, Peoples R China
[3] Imperial Coll London, Fac Med, Dept Metab Digest & Reprod, London SW7 2AZ, England
基金
比尔及梅琳达.盖茨基金会; 英国医学研究理事会; 英国生物技术与生命科学研究理事会;
关键词
Artificial intelligence; Estimation; Task analysis; Chatbots; Monitoring; Accuracy; Visualization; ChatGPT; deep learning; dietary assessment; food recognition; foundation model; GPT-4V; passive monitoring; COUNTING BITES; FOOD;
D O I
10.1109/JBHI.2024.3417280
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional approaches to dietary assessment are primarily grounded in self-reporting methods or structured interviews conducted under the supervision of dietitians. These methods, however, are often subjective, inaccurate, and time-intensive. Although artificial intelligence (AI)-based solutions have been devised to automate the dietary assessment process, prior AI methodologies tackle dietary assessment in a fragmented landscape (e.g., merely recognizing food types or estimating portion size) and encounter challenges in their ability to generalize across a diverse range of food categories, dietary behaviors, and cultural contexts. Recently, the emergence of multimodal foundation models, such as GPT-4V, has exhibited transformative potential across a wide range of tasks in various research domains. These models have demonstrated remarkable generalist intelligence and accuracy, owing to their large-scale pre-training on broad datasets and substantially scaled model size. In this study, we explore the application of GPT-4V powering multimodal ChatGPT for dietary assessment, along with prompt engineering and passive monitoring techniques. We evaluated the proposed pipeline using a self-collected, semi free-living dietary intake dataset, captured through wearable cameras. Our findings reveal that GPT-4V excels in food detection under challenging conditions without any fine-tuning or adaptation using food-specific datasets. By guiding the model with specific language prompts (e.g., African cuisine), it shifts from recognizing common staples like rice and bread to accurately identifying regional dishes like banku and ugali. Another standout feature of GPT-4V is its contextual awareness. GPT-4V can leverage surrounding objects as scale references to deduce the portion sizes of food items, further facilitating the process of dietary assessment.
引用
收藏
页码:7577 / 7587
页数:11
相关论文
共 50 条
  • [21] Response to: "ChatGPT for assessment writing"
    Doggett, Thomas
    Warr, Harriet
    Johnson, Jo-Anne
    Cork, Simon
    MEDICAL TEACHER, 2024, 46 (06) : 857 - 858
  • [22] ChatGPT and the future of impact assessment
    Khan, Mehreen
    Chaudhry, Muhammad Nawaz
    Ahsan, Muhammad
    Ahmad, Rameesha
    ENVIRONMENTAL SCIENCE & POLICY, 2024, 157
  • [23] Dietary assessment methods for micronutrient intake: a systematic review on vitamins
    Henriquez-Sanchez, Patricia
    Sanchez-Villegas, Almudena
    Doreste-Alonso, Jorge
    Ortiz-Andrellucchi, Adriana
    Pfrimer, Karina
    Serra-Majem, Lluis
    BRITISH JOURNAL OF NUTRITION, 2009, 102 : S10 - S37
  • [24] Image-Assisted Dietary Assessment: A Systematic Review of the Evidence
    Gemming, Luke
    Utter, Jennifer
    Mhurchu, Cliona Ni
    JOURNAL OF THE ACADEMY OF NUTRITION AND DIETETICS, 2015, 115 (01) : 64 - 77
  • [25] Systematic review and website presentation of validated dietary assessment tools
    Warthon-Medina, M.
    Hooson, J.
    Hancock, N.
    Gibson, L. E.
    Bush, L. A.
    Hutchinson, J.
    Greenwood, D. C.
    Robinson, S.
    Burley, V. J.
    Roe, M.
    Steers, T.
    Wark, P. A.
    Cade, J. E.
    PROCEEDINGS OF THE NUTRITION SOCIETY, 2018, 77 (OCE4) : E202 - E202
  • [26] Methodological aspects of dietary quality assessment in Brazil: systematic review
    Morais, Dayane de Castro
    de Sousa Moraes, Luis Fernando
    Guimaraes da Silva, Danielle Cristina
    Pinto, Carina Aparecida
    de Novaes, Juliana Farias
    CIENCIA & SAUDE COLETIVA, 2017, 22 (08): : 2671 - 2680
  • [27] Development of search strategies for systematic reviews in health using ChatGPT: a critical analysis
    Guimaraes, Nathalia Sernizon
    Joviano-Santos, Julliane Vasconcelos
    Reis, Marcela Gomes
    Chaves, Roberta Rayra Martins
    JOURNAL OF TRANSLATIONAL MEDICINE, 2024, 22 (01)
  • [28] Multimodal Analysis and Assessment of Therapist Empathy in Motivational Interviews
    Trang Tran
    Yin, Yufeng
    Tavabi, Leili
    Delacruz, Joannalyn
    Borsari, Brian
    Woolley, Joshua
    Scherer, Stefan
    Soleymani, Mohammad
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2023, 2023, : 406 - 415
  • [29] ChatGPT in healthcare: A taxonomy and systematic review
    Li, Jianning
    Dada, Amin
    Puladi, Behrus
    Kleesiek, Jens
    Egger, Jan
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 245
  • [30] Development of search strategies for systematic reviews in health using ChatGPT: a critical analysis
    Nathalia Sernizon Guimarães
    Julliane Vasconcelos Joviano-Santos
    Marcela Gomes Reis
    Roberta Rayra Martins Chaves
    Journal of Translational Medicine, 22