Personalized Fashion Recommendation with Visual Explanations based on Multimodal Attention Network

被引：132

作者：

Chen, Xu ^{[1
]}

Chen, Hanxiong ^{[2
]}

Xu, Hongteng ^{[3
,4
]}

Zhang, Yongfeng ^{[2
]}

Cao, Yixin ^{[5
]}

Qin, Zheng ^{[1
]}

Zha, Hongyuan ^{[6
]}

机构：

[1] Tsinghua Univ, Beijing, Peoples R China

[2] Rutgers State Univ, New Brunswick, NJ USA

[3] Duke Univ, Durham, NC 27706 USA

[4] InfiniaML Inc, Durham, NC USA

[5] Natl Univ Singapore, Singapore, Singapore

[6] Georgia Inst Technol, Atlanta, GA 30332 USA

来源：

PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19) | 2019年

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.1145/3331184.3331254

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Fashion recommendation has attracted increasing attention from both industry and academic communities. This paper proposes a novel neural architecture for fashion recommendation based on both image region-level features and user review information. Our basic intuition is that: for a fashion image, not all the regions are equally important for the users, i.e., people usually care about a few parts of the fashion image. To model such human sense, we learn an attention model over many pre-segmented image regions, based on which we can understand where a user is really interested in on the image, and correspondingly, represent the image in a more accurate manner. In addition, by discovering such fine-grained visual preference, we can visually explain a recommendation by highlighting some regions of its image. For better learning the attention model, we also introduce user review information as a weak supervision signal to collect more comprehensive user preference. In our final framework, the visual and textual features are seamlessly coupled by a multimodal attention network. Based on this architecture, we can not only provide accurate recommendation, but also can accompany each recommended item with novel visual explanations. We conduct extensive experiments to demonstrate the superiority of our proposed model in terms of Top-N recommendation, and also we build a collectively labeled dataset for evaluating our provided visual explanations in a quantitative manner.

引用

页码：765 / 774

页数：10

共 50 条

[31] Media Personalized Recommendation System Based on Network Algorithm
Jiang, Wendan
[J]. 2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 2061 - 2066
[32] RESEARCH ON PERSONALIZED RECOMMENDATION ALGORITHM BASED ON SOCIAL NETWORK
Zhu, Linke
Ge, Wei
[J]. FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING (ICCEE 2011), 2011, : 111 - 115
[33] Bibliographic Network Representation Based Personalized Citation Recommendation
Cai, Xiaoyan
Zheng, Yu
Yang, Libin
Dai, Tao
Guo, Lantian
[J]. IEEE ACCESS, 2019, 7 : 457 - 467
[34] Research on Personalized Recommendation Algorithm Based on Dynamic Network
Ling, Kun
Jiang, Jiulei
Li, Shengqing
[J]. 2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 940 - 945
[35] Vman: visual-modified attention network for multimodal paradigms
Song, Xiaoyu
Han, Dezhi
Chen, Chongqing
Shen, Xiang
Wu, Huafeng
[J]. VISUAL COMPUTER, 2024,
[36] VistaNet: Visual Aspect Attention Network for Multimodal Sentiment Analysis
Quoc-Tuan Truong
Lauw, Hady W.
[J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 305 - 312
[37] VisdaNet: Visual Distillation and Attention Network for Multimodal Sentiment Classification
Hou, Shangwu
Tuerhong, Gulanbaier
Wushouer, Mairidan
[J]. SENSORS, 2023, 23 (02)
[38] Co-Attention Memory Network for Multimodal Microblog's Hashtag Recommendation
Ma, Renfeng
Qiu, Xipeng
Zhang, Qi
Hu, Xiangkun
Jiang, Yu-Gang
Huang, Xuanjing
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (02) : 388 - 400
[39] Fashion Coordinates Recommendation based on User Behavior and Visual Clothing Style
Gu, Sida
Liu, Xiaoqiang
Cai, Lizhi
Shen, Jie
[J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING (ICCIP 2017), 2017, : 185 - 189
[40] Personalized Session-Based Recommendation Using Graph Attention Networks
Xie, Yongquan
Li, Zhengru
Qin, Tian
Tseng, Finn
Johannes, Kristinsson
Qiu, Shiqi
Murphey, Yi Lu
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,

← 1 2 3 4 5 →