A Revised Video Vision Transformer for Traffic Estimation With Fleet Trajectories

被引:2
|
作者
Li, Duo [1 ,2 ]
Lasenby, Joan [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
[2] Nottingham Trent Univ, Dept Engn, Nottingham NG1 4FQ, England
基金
英国工程与自然科学研究理事会;
关键词
Transformers; Trajectory; Estimation; Sensors; Computer architecture; Roads; Monitoring; Traffic estimation; vehicle trajectory; deep learning; STATE ESTIMATION; MISSING DATA; MODEL; FLOW; HIGHWAY; WAVES;
D O I
10.1109/JSEN.2022.3193663
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Real-time traffic monitoring represents a key component for transportation management. The increasing penetration rate of connected vehicles with positioning devices encourages the utilization of trajectory data for real-time traffic monitoring. The use of commercial fleet trajectory data could be seen as the first step towards mobile sensing networks. The main objective of this research is to estimate space occupancy of a single road segment with partially observed trajectories (commercial fleet trajectories in our case). We first formulate the trajectory-based traffic estimation as a video computing problem. Then, we reconstruct trajectory series into video-like data by performing spatial discretization. Following this, video input is embedded using a tubelet embedding strategy. Finally, a Revised Video Vision Transformer (RViViT) is proposed to estimate traffic state from video embeddings. The proposed RViViT is tested on a public dataset of naturalistic vehicle trajectories collected from German highways around Cologne during 2017 and 2018. The results witness the effectiveness of the proposed method in traffic estimation with partially observed trajectories.
引用
收藏
页码:17103 / 17112
页数:10
相关论文
共 50 条
  • [1] ViViT: A Video Vision Transformer
    Arnab, Anurag
    Dehghani, Mostafa
    Heigold, Georg
    Sun, Chen
    Lucic, Mario
    Schmid, Cordelia
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6816 - 6826
  • [2] Video Summarization With Spatiotemporal Vision Transformer
    Hsu, Tzu-Chun
    Liao, Yi-Sheng
    Huang, Chun-Rong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3013 - 3026
  • [3] TransAnomaly: Video Anomaly Detection Using Video Vision Transformer
    Yuan, Hongchun
    Cai, Zhenyu
    Zhou, Hui
    Wang, Yue
    Chen, Xiangzhi
    [J]. IEEE ACCESS, 2021, 9 : 123977 - 123986
  • [4] Video Summarization With Frame Index Vision Transformer
    Hsu, Tzu-Chun
    Liao, Yi-Sheng
    Huang, Chun-Rong
    [J]. PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [5] Video captioning based on vision transformer and reinforcement learning
    Zhao, Hong
    Chen, Zhiwen
    Guo, Lan
    Han, Zeyu
    [J]. PEERJ COMPUTER SCIENCE, 2022, 8
  • [6] Temporally Efficient Vision Transformer for Video Instance Segmentation
    Yang, Shusheng
    Wang, Xinggang
    Li, Yu
    Fang, Yuxin
    Fang, Jiemin
    Liu, Wenyu
    Zhao, Xun
    Shan, Ying
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2875 - 2885
  • [7] Video captioning based on vision transformer and reinforcement learning
    Zhao H.
    Chen Z.
    Guo L.
    Han Z.
    [J]. PeerJ Computer Science, 2022, 8
  • [8] Utilization of Vision Transformer for Classification and Ranking of Video Distortions
    AlDahoul, Nouar
    Karim, Hezerul Abdul
    Tan, Myles Joshua Toledo
    [J]. ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, ANNPR 2022, 2023, 13739 : 195 - 204
  • [9] Self-Supervised RGB-NIR Fusion Video Vision Transformer Framework for rPPG Estimation
    Park, Soyeon
    Kim, Bo-Kyeong
    Dong, Suh-Yeon
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [10] ViTA: Video Transformer Adaptor for Robust Video Depth Estimation
    Xian, Ke
    Peng, Juewen
    Cao, Zhiguo
    Zhang, Jianming
    Lin, Guosheng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3302 - 3316