BjTT: A Large-Scale Multimodal Dataset for Traffic Prediction

被引:0
|
作者
Zhang, Chengyang [1 ]
Zhang, Yong [1 ]
Shao, Qitan [1 ]
Feng, Jiangtao [1 ]
Li, Bo [1 ]
Lv, Yisheng [2 ]
Piao, Xinglin [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Sch Informat Sci & Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Roads; Social networking (online); Transportation; Data collection; Task analysis; Blogs; Meteorology; Traffic prediction; large-scale; new dataset; FLOW; NETWORKS; MODELS;
D O I
10.1109/TITS.2024.3440650
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Traffic prediction plays a significant role in Intelligent Transportation Systems (ITS). Although many datasets have been introduced to support the study of traffic prediction, most of them only provide time-series traffic data. However, urban transportation systems are always susceptible to various factors, including unusual weather and traffic accidents. Therefore, relying solely on historical data for traffic prediction greatly limits the accuracy of the prediction. In this paper, we introduce Beijing Text-Traffic (BjTT), a large-scale multimodal dataset for traffic prediction. BjTT comprises over 32,000 time-series traffic records, capturing velocity and congestion levels on more than 1,200 roads within the 5th ring area of Beijing. Meanwhile, each piece of traffic data is coupled with a text describing the traffic system (including time, location, and events). We detail the data collection and processing procedures and present a statistical analysis of the BjTT dataset. Furthermore, we conduct comprehensive experiments on the dataset with state-of-the-art traffic prediction methods and text-guided generative models, which reveal the unique characteristics of the BjTT. The dataset is available at https://github.com/ChyaZhang/BjTT.
引用
收藏
页码:18992 / 19003
页数:12
相关论文
共 50 条
  • [1] MultiSubs: A Large-scale Multimodal and Multilingual Dataset
    Wang, Josiah
    Figueiredo, Josiel
    Specia, Lucia
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6776 - 6785
  • [2] Multimodal fusion for large-scale traffic prediction with heterogeneous retentive networks
    Yan, Yimo
    Cui, Songyi
    Liu, Jiahui
    Zhao, Yaping
    Zhou, Bodong
    Kuo, Yong-Hong
    [J]. Information Fusion, 2025, 114
  • [3] A Large-Scale Spatio-Temporal Multimodal Fusion Framework for Traffic Prediction
    Zhou, Bodong
    Liu, Jiahui
    Cui, Songyi
    Zhao, Yaping
    [J]. BIG DATA MINING AND ANALYTICS, 2024, 7 (03): : 621 - 636
  • [4] Large-Scale Traffic Congestion Prediction based on Multimodal Fusion and Representation Mapping
    Zhou, Bodong
    Liu, Jiahui
    Cui, Songyi
    Zhao, Yaping
    [J]. 2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2022, : 672 - 680
  • [5] LargeST: A Benchmark Dataset for Large-Scale Traffic Forecasting
    Liu, Xu
    Xia, Yutong
    Liang, Yuxuan
    Hu, Junfeng
    Wang, Yiwei
    Bai, Lei
    Huang, Chao
    Liu, Zhenguang
    Hooi, Bryan
    Zimmermann, Roger
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [6] Zenseact Open Dataset: A large-scale and diverse multimodal dataset for autonomous driving
    Alibeigi, Mina
    Ljungbergh, William
    Tonderski, Adam
    Hess, Georg
    Lilja, Adam
    Lindstrom, Carl
    Motorniuk, Daria
    Fu, Junsheng
    Widahl, Jenny
    Petersson, Christoffer
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20121 - 20131
  • [7] A Large-Scale Chinese Multimodal NER Dataset with Speech Clues
    Sui, Dianbo
    Tian, Zhengkun
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 2807 - 2818
  • [8] COMPRESSED PREDICTION OF LARGE-SCALE URBAN TRAFFIC
    Mitrovic, Nikola
    Asif, Muhammad Tayyab
    Dauwels, Justin
    Jaillet, Patrick
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] A Large-Scale Mobile Traffic Dataset For Mobile Application Identification
    Zhao, Shuang
    Chen, Shuhui
    Wang, Fei
    Wei, Ziling
    Zhong, Jincheng
    Liang, Jianbing
    [J]. COMPUTER JOURNAL, 2024, 67 (04): : 1501 - 1513
  • [10] Comprehensive Mobile Traffic Characterization Based on a Large-Scale Mobile Traffic Dataset
    Zhao, Shuang
    Zhong, Jincheng
    Chen, Shuhui
    Liang, Jianbing
    [J]. NETWORK AND SYSTEM SECURITY, NSS 2022, 2022, 13787 : 214 - 232