Improved OpenCL-Based Implementation of Social Field Pedestrian Model

被引:10
|
作者
Yu, Bin [1 ]
Zhu, Ke [2 ]
Wu, Kaiteng [2 ]
Zhang, Michael [3 ]
机构
[1] Tongji Univ, Key Lab Rd & Traff Engn, Minist Educ, Shanghai 201804, Peoples R China
[2] Tongji Univ, Coll Transportat Engn, Shanghai 201804, Peoples R China
[3] Univ Calif Davis, Dept Civil & Environm Engn, Davis, CA 95616 USA
关键词
Computational modeling; Graphics processing units; Force; Numerical models; Legged locomotion; Computer architecture; Intelligent transportation systems; Algorithm; crowd dynamics; heterogeneous parallel computing; OpenCL; pedestrian flow; CELLULAR-AUTOMATON MODEL; JAMMING TRANSITION; ROUTE CHOICE; SIMULATION; DYNAMICS; EVACUATION; ALGORITHM;
D O I
10.1109/TITS.2019.2920868
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Two aspects of improvements are proposed for the OpenCL-based implementation of the social field pedestrian model. In the aspect of algorithm, a method based on the idea of divide-and-conquer is devised in order to overcome the problem of global memory depletion when fields are of a larger size. This is of importance for the study of finer pedestrian walking behavior, which usually requires larger fields. In the aspect of computation, the OpenCL heterogeneous framework is thoroughly studied. Factors that may affect the numerical efficiency are evaluated, with regarding to the social field model previously proposed. This includes usage of local memory, deliberate patch of data structures for avoidance of bank conflicts, and so on. Experiments disclose that the numerical efficiency is brought to an even higher level. Compared with the CPU model and the previous GPU model, the present GPU model can be at most 71.56 and 13.3 times faster, respectively.
引用
收藏
页码:2828 / 2839
页数:12
相关论文
共 50 条
  • [1] Discretization and OpenCL-based implementation of social field pedestrian model
    Yu, Bin
    Wu, Kaiteng
    Zhu, Ke
    Ji, Yuhan
    Zhang, Michael
    Yang, Xinwen
    NEUROCOMPUTING, 2018, 315 : 299 - 309
  • [2] OpenCL-Based Performance Enhancement of Model Transformations
    Fekete, Tamas
    Mezei, Gergely
    IWOCL'18: PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON OPENCL, 2018, : 89 - 90
  • [3] Introduction of an OpenCL-Based Model Transformation Engine
    Fekete, Tamas
    Mezei, Gergely
    SOFTWARE TECHNOLOGIES: APPLICATIONS AND FOUNDATIONS, STAF 2017, 2018, 10748 : 14 - 19
  • [4] Opencl-pytorch: an OpenCL-based extension of PyTorch
    Sui, Yicheng
    Sun, Yufei
    Shi, Changqing
    Wang, Haotian
    Zhang, Zhiqiang
    Wang, Jiahao
    Zhang, Yuzhi
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2024, 6 (03) : 287 - 300
  • [5] OpenCL-Darknet: implementation and optimization of OpenCL-based deep learning object detection framework
    Koo, Yongbon
    Kim, Sunghoon
    Ha, Young-guk
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (04): : 1299 - 1319
  • [6] OpenCL-Darknet: implementation and optimization of OpenCL-based deep learning object detection framework
    Yongbon Koo
    Sunghoon Kim
    Young-guk Ha
    World Wide Web, 2021, 24 : 1299 - 1319
  • [7] An OpenCL-based feature matcher
    Condello, Giovanni
    Pasteris, Paolo
    Pau, Danilo
    Sami, Mariagiovanna
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (04) : 345 - 350
  • [8] Workload Analysis and Efficient OpenCL-based Implementation of SIFT Algorithm on a Smartphone
    Wang, Guohui
    Rister, Blaine
    Cavallaro, Joseph R.
    2013 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2013, : 759 - 762
  • [9] Melia: A MapReduce Framework on OpenCL-Based FPGAs
    Wang, Zeke
    Zhang, Shuhao
    He, Bingsheng
    Zhang, Wei
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (12) : 3547 - 3560
  • [10] OpenCL-Based Erasure Coding on Heterogeneous Architectures
    Chen, Guoyang
    Zhou, Huiyang
    Shen, Xipeng
    Gahm, Josh
    Venkat, Narayan
    Booth, Skip
    Marshall, John
    2016 IEEE 27TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2016, : 33 - 40