InteraRec: Interactive Recommendations Using Multimodal Large Language Models

被引：2

作者：

Karra, Saketh Reddy ^{[1
]}

Tulabandhula, Theja ^{[1
]}

机构：

[1] Univ Illinois, Chicago, IL 60607 USA

来源：

TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2024 WORKSHOPS, RAFDA AND IWTA | 2024年 / 14658卷

关键词：

Large language models; Screenshots; User preferences; Recommendations;

D O I：

10.1007/978-981-97-2650-9_3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Numerous recommendation algorithms leverage weblogs, employing strategies such as collaborative filtering, content-based filtering, and hybrid methods to provide personalized recommendations to users. Weblogs, comprised of records detailing user activities on any website, offer valuable insights into user preferences, behavior, and interests. Despite the wealth of information weblogs provide, extracting relevant features requires extensive feature engineering. The intricate nature of the data also poses a challenge for interpretation, especially for non-experts. Additionally, they often fall short of capturing visual details and contextual nuances that influence user choices. In the present study, we introduce a sophisticated and interactive recommendation framework denoted as InteraRec, which diverges from conventional approaches that exclusively depend on weblogs for recommendation generation. This framework provides recommendations by capturing high-frequency screenshots of web pages as users navigate through a website. Leveraging advanced multimodal large language models (MLLMs), we extract valuable insights into user preferences from these screenshots by generating a user profile summary. Subsequently, we employ the InteraRec framework to extract relevant information from the summary to generate optimal recommendations. Through extensive experiments, we demonstrate the remarkable effectiveness of our recommendation system in providing users with valuable and personalized offerings.

引用

页码：32 / 43

页数：12

共 50 条

[1] Chat with the Environment: Interactive Multimodal Perception Using Large Language Models
Zhao, Xufeng
Li, Mengdi
Weber, Cornelius
Hafez, Muhammad Burhan
Wermter, Stefan
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 3590 - 3596
[2] Leveraging Large Language Models for Goal-aware Interactive Recommendations
Said, Alan
Willemsen, Martijn
Marinho, Leandro Balby
Silva, Itallo
PROCEEDINGS OF THE 11TH CONFERENCE ON HUMAN-AGENT INTERACTION, HAI 2023, 2023, : 464 - 466
[3] Explaining Social Recommendations Using Large Language Models
Ashaduzzaman, Md.
Thi Nguyen
Tsai, Chun-Hua
NEW TRENDS IN DISRUPTIVE TECHNOLOGIES, TECH ETHICS, AND ARTIFICIAL INTELLIGENCE, DITTET 2024, 2024, 1459 : 73 - 84
[4] Using Augmented Small Multimodal Models to Guide Large Language Models for Multimodal Relation Extraction
He, Wentao
Ma, Hanjie
Li, Shaohua
Dong, Hui
Zhang, Haixiang
Feng, Jie
APPLIED SCIENCES-BASEL, 2023, 13 (22):
[5] A survey on multimodal large language models
Yin, Shukang
Fu, Chaoyou
Zhao, Sirui
Li, Ke
Sun, Xing
Xu, Tong
Chen, Enhong
NATIONAL SCIENCE REVIEW, 2024, 11 (12)
[6] A survey on multimodal large language models
Shukang Yin
Chaoyou Fu
Sirui Zhao
Ke Li
Xing Sun
Tong Xu
Enhong Chen
National Science Review, 2024, 11 (12) : 277 - 296
[7] Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging
Harrison, Rachel M.
Dereventsov, Anton
Bibin, Anton
2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 1535 - 1542
[8] From Large Language Models to Large Multimodal Models: A Literature Review
Huang, Dawei
Yan, Chuan
Li, Qing
Peng, Xiaojiang
APPLIED SCIENCES-BASEL, 2024, 14 (12):
[9] Emotion Recognition from Videos Using Multimodal Large Language Models
Vaiani, Lorenzo
Cagliero, Luca
Garza, Paolo
FUTURE INTERNET, 2024, 16 (07)
[10] A comprehensive survey of large language models and multimodal large models in medicine
Xiao, Hanguang
Zhou, Feizhong
Liu, Xingyue
Liu, Tianqi
Li, Zhipeng
Liu, Xin
Huang, Xiaoxuan
INFORMATION FUSION, 2025, 117

← 1 2 3 4 5 →