LitAI: Enhancing Multimodal Literature Understanding and Mining with Generative AI

被引:0
|
作者
Medisetti, Gowtham [1 ]
Compson, Zacchaeus [1 ]
Fan, Heng [1 ]
Yang, Huaxiao [1 ]
Feng, Yunhe [1 ]
机构
[1] Univ North Texas, Denton, TX 76205 USA
关键词
Literature Mining; OCR; Generative AI; Prompt Engineering; ChatGPT; GPT-4;
D O I
10.1109/MIPR62202.2024.00080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information processing and retrieval in literature are critical for advancing scientific research and knowledge discovery. The inherent multimodality and diverse literature formats, including text, tables, and figures, present significant challenges in literature information retrieval. This paper introduces LitAI, a novel approach that employs readily available generative AI tools to enhance multimodal information retrieval from literature documents. By integrating tools such as optical character recognition (OCR) with generative AI services, LitAI facilitates the retrieval of text, tables, and figures from PDF documents. We have developed specific prompts that leverage in-context learning and prompt engineering within Generative AI to achieve precise information extraction. Our empirical evaluations, conducted on datasets from the ecological and biological sciences, demonstrate the superiority of our approach over several established baselines including Tesseract-OCR and GPT-4. The implementation of LitAI is accessible at https://github.com/ResponsibleAILab/LitAI.
引用
收藏
页码:471 / 476
页数:6
相关论文
共 50 条
  • [41] Enhancing Autonomous System Security and Resilience With Generative AI: A Comprehensive Survey
    Andreoni, Martin
    Lunardi, Willian Tessaro
    Lawton, George
    Thakkar, Shreekant
    IEEE ACCESS, 2024, 12 : 109470 - 109493
  • [42] Developing an Intermediate Framework for Enhancing Comic Creation Through Generative AI
    Chen, Wenjuan
    Li, Jingke
    Tang, Congyun
    Sun, Guoyu
    HUMAN-COMPUTER INTERACTION, PT V, HCI 2024, 2024, 14688 : 292 - 306
  • [43] Generative AI as Virtual Healthcare Assistant for Enhancing Patient Care Quality
    Samala, Agariadne Dwinggo
    Rawas, Soha
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2024, 20 (05) : 174 - 187
  • [44] Is It AI or Is It Me? Understanding Users' Prompt Journey with Text-to-Image Generative AI Tools
    Goloujeh, Atefeh Mahdavi
    Sullivan, Anne
    Magerko, Brian
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
  • [45] A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding
    Burns, Andrea
    Srinivasan, Krishna
    Ainslie, Joshua
    Brown, Geoff
    Plummer, Bryan A.
    Saenko, Kate
    Ni, Jianmo
    Guo, Mandy
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 1917 - 1947
  • [46] Understanding and Shaping Human-Technology Assemblages in the Age of Generative AI
    Andres, Josh
    Danta, Chris
    Bianchi, Andrea
    Hong, Sungyeon
    Li, Zhuying
    Sandoval, Eduardo B.
    Martin, Charles
    Cooper, Ned
    PROCEEDINGS OF THE 2024 ACM DESIGNING INTERACTIVE SYSTEMS CONFERENCE, DIS 2024 COMPANION, 2024, : 413 - 416
  • [47] Understanding of Behaviors in Real World through Video Analysis and Generative AI
    Kanna, Yoshihiro
    Kajiki, Yoshihiro
    NEC Technical Journal, 2024, 17 (02): : 29 - 32
  • [48] The role of the data mining exception in generative artificial intelligence (AI) - an european perspective
    Contardi, Avv. magali
    REVISTA E-MERCATORIA, 2025, 24 (01) : 37 - 76
  • [49] Generative AI in Multimodal Cross-Lingual Dialogue System for Inclusive Communication Support
    Nataraj, Vidhya
    Liao, Wen-Hsuan
    Chang, Yue-Shan
    Chiang, Chen-Yu
    Lin, Chao-Yin
    Lin, Yu-An
    Day, Min-Yuh
    2024 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI 2024, 2024, : 204 - 209
  • [50] Generative AI in Undergraduate Academia: Enhancing Learning Experiences and Navigating Ethical Terrains
    Holechek, Susan
    Sreenivas, Vishnu
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2024, 300 (03) : S80 - S80