High-Performance Spatial Query Processing on Big Taxi Trip Data using GPGPUs

被引:10
|
作者
Zhang, Jianting [1 ]
You, Simin [2 ]
Gruenwald, Le [3 ]
机构
[1] CUNY, Dept Comp Sci, New York, NY 10021 USA
[2] CUNY, Grad Ctr, Dept Comp Sci, New York, NY USA
[3] Univ Oklahoma, Sch Comp Sci, Norman, OK 73019 USA
关键词
High Performance; Spatial Query; Big Data; Taxi Trip; GPGPU;
D O I
10.1109/BigData.Congress.2014.20
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
City-wide GPS recorded taxi trip data contains rich information for traffic and travel analysis to facilitate transportation planning and urban studies. However, traditional data management techniques are largely incapable of processing big taxi trip data at the scale of hundreds of millions. In this study, we aim at utilizing the General Purpose computing on Graphics Processing Units (GPGPUs) technologies to speed up processing complex spatial queries on big taxi data on inexpensive commodity GPUs. By using the land use types of tax lot polygons as a proxy for trip purposes at the pickup and drop-off locations, we formulate a taxi trip data analysis problem as a large-scale nearest neighbor spatial query problem based on point-to-polygon distance. Experiments on nearly 170 million taxi trips in the New York City (NYC) in 2009 and 735,488 tax lot polygons with 4,698,986 vertices have demonstrated the efficiency of the proposed techniques: the GPU implementations is about 10-20X faster than the host system and completes the spatial query in about a minute by using a low-end workstation equipped with an Nvidia GTX Titan GPU device with a total equipment cost of below $2,000. We further discuss several interesting patterns discovered from the query results which warrant further study. The proposed approach can be an interesting alternative to traditional MapReduce/Hadoop based approaches to processing big data with respect to performance and cost.
引用
收藏
页码:72 / 79
页数:8
相关论文
共 50 条
  • [21] High Performance Analysis of Big Spatial Data
    Haynes, David
    Ray, Suprio
    Manson, Steven M.
    Soni, Ankit
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1953 - 1957
  • [22] Construct Trip Graphs by Using Taxi Trajectory Data
    Hao Yu
    Xi Guo
    Xiao Luo
    Weihao Bian
    Taohong Zhang
    Data Science and Engineering, 2023, 8 : 1 - 22
  • [23] Incremental Query Processing on Big Data Streams
    Fegaras, Leonidas
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (11) : 2998 - 3012
  • [24] Contributions to High-Performance Big Data Computing
    Fox, Geoffrey
    Qiu, Judy
    Crandall, David
    Von Laszewski, Gregor
    Beckstein, Oliver
    Paden, John
    Paraskevakos, Ioannis
    Jha, Shantenu
    Wang, Fusheng
    Marathe, Madhav
    Vullikanti, Anil
    Cheatham, Thomas
    FUTURE TRENDS OF HPC IN A DISRUPTIVE SCENARIO, 2019, 34 : 34 - 81
  • [25] Benchmarking Elastic Query Processing on Big Data
    Vorona, Dimitri
    Funke, Florian
    Kemper, Alfons
    Neumann, Thomas
    BIG DATA BENCHMARKING, WBDB 2014, 2015, 8991 : 37 - 44
  • [26] Construct Trip Graphs by Using Taxi Trajectory Data
    Yu, Hao
    Guo, Xi
    Luo, Xiao
    Bian, Weihao
    Zhang, Taohong
    DATA SCIENCE AND ENGINEERING, 2023, 8 (01) : 1 - 22
  • [27] Multimedia processing using deep learning technologies, high-performance computing cloud resources, and Big Data volumes
    Mahmoudi, Sidi Ahmed
    Belarbi, Mohammed Amin
    Mahmoudi, Said
    Belalem, Ghalem
    Manneback, Pierre
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (17):
  • [28] Design and Implementation of a High-performance Stream-oriented Big Data Processing System
    Wang, Meng
    Liu, Jun
    Zhou, Wenli
    2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 1, 2016, : 363 - 368
  • [29] Spatial query processing for high resolutions
    Kriegel, HP
    Pfeifle, M
    Pötke, M
    Seidl, T
    EIGHTH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2003, : 17 - 26
  • [30] On Performance Prediction of Big Data Transfer in High-performance Networks
    Liu, Wuji
    Yun, Daqing
    Wu, Chase Q.
    Rao, Nageswara S., V
    Hou, Aiqin
    Shen, Wei
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,