A Novel Heterogeneous Network for Modeling Driver Attention With Multi-Level Visual Content

被引:11
|
作者
Hu, Zhongxu [1 ]
Zhang, Yiran [1 ]
Li, Qinghua [2 ]
Lv, Chen [1 ]
机构
[1] Nanyang Technol Univ, Sch Mech & Aerosp Engn, Singapore 637460, Singapore
[2] Alibaba DAMO Acad, Autonomous Driving Lab, Hangzhou 311121, Peoples R China
关键词
Feature extraction; Semantics; Visualization; Estimation; Task analysis; Optical imaging; Object detection; Driver attention modeling; multi-level visual content; heterogeneous network; semantic attention map; graph neural network; INTELLIGENT VEHICLES; AUTOMATED VEHICLES; SALIENCY DETECTION; DECISION-MAKING; PREDICTION; GAZE; INFERENCE; FOCUS;
D O I
10.1109/TITS.2022.3208004
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Driver attention modeling is a crucial technique in building human-centric intelligent driving systems. Considering the human visual mechanism, this study leverages multi-level visual content, including low-level texture features, middle-level optical flows, and high-level semantic information, as the model input. Subsequently, a heterogeneous model is proposed to handle the multi-level input, which integrates the graph and convolutional neural networks. Distinguished from the existing studies that use semantic segmentation, our study directly leverages the objection detection information in an interpretable manner. To deal with the detected objects, in this work, a graph attention network is used to explicitly construct the semantic information, rather than handle the features extracted by convolutional modules for building the latent space features, which are used in existing studies. Further, a semantic attention module is proposed to integrate the non-Euclidean output of the graph network with the Euclidean feature maps of the convolutional neural networks. Finally, these integrated features are decoded to generate a driver attention map. Three typical datasets are used to validate the proposed method. A comprehensive comparison and analysis have proven the feasibility and validity of our proposed method, as well as its ability to achieve state-of-the-art performance.
引用
收藏
页码:24343 / 24354
页数:12
相关论文
共 50 条
  • [1] MDAN: Multi-level Dependent Attention Network for Visual Emotion Analysis
    Xu, Liwen
    Wang, Zhengtao
    Wu, Bin
    Lui, Simon
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9469 - 9478
  • [2] Visual Attention Dehazing Network with Multi-level Features Refinement and Fusion
    Yin, Shibai
    Yang, Xiaolong
    Wang, Yibin
    Yang, Yee-Hong
    [J]. PATTERN RECOGNITION, 2021, 118
  • [3] MLAN: Multi-Level Attention Network
    Qin, Peinuan
    Wang, Qinxuan
    Zhang, Yue
    Wei, Xueyao
    Gao, Meiguo
    [J]. IEEE ACCESS, 2022, 10 : 105437 - 105446
  • [4] Visual Relation Detection with Multi-Level Attention
    Zheng, Sipeng
    Chen, Shizhe
    Jin, Qin
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 121 - 129
  • [5] Multi-level Cross-attention Siamese Network For Visual Object Tracking
    Zhang, Jianwei
    Wang, Jingchao
    Zhang, Huanlong
    Miao, Mengen
    Cai, Zengyu
    Chen, Fuguo
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (12): : 3976 - 3990
  • [6] MMAN: Metapath Based Multi-Level Graph Attention Networks for Heterogeneous Network Embedding
    Liu, Jie
    Song, Lingyun
    Gao, Li
    Shang, Xuequn
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 13005 - 13006
  • [7] Driver’s Attention Prediction Based on Multi-Level Temporal-Spatial Fusion Network
    Jin, Lisheng
    Ji, Bingdong
    Guo, Baicang
    [J]. Qiche Gongcheng/Automotive Engineering, 2023, 45 (05): : 759 - 767
  • [8] Multi-level Attention Networks for Visual Question Answering
    Yu, Dongfei
    Fu, Jianlong
    Mei, Tao
    Rui, Yong
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4187 - 4195
  • [9] Multi-Level Attention Split Network: A Novel Malaria Cell Detection Algorithm
    Xiong, Zhao
    Wu, Jiang
    [J]. INFORMATION, 2024, 15 (03)
  • [10] The multi-level classification and regression network for visual tracking via residual channel attention
    Yu, Junyang
    Zuo, Mengle
    Dong, Lifeng
    Zhang, Huanlong
    He, Xin
    [J]. DIGITAL SIGNAL PROCESSING, 2022, 120