High-Performance Spatial Query Processing on Big Taxi Trip Data using GPGPUs

被引:10
|
作者
Zhang, Jianting [1 ]
You, Simin [2 ]
Gruenwald, Le [3 ]
机构
[1] CUNY, Dept Comp Sci, New York, NY 10021 USA
[2] CUNY, Grad Ctr, Dept Comp Sci, New York, NY USA
[3] Univ Oklahoma, Sch Comp Sci, Norman, OK 73019 USA
关键词
High Performance; Spatial Query; Big Data; Taxi Trip; GPGPU;
D O I
10.1109/BigData.Congress.2014.20
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
City-wide GPS recorded taxi trip data contains rich information for traffic and travel analysis to facilitate transportation planning and urban studies. However, traditional data management techniques are largely incapable of processing big taxi trip data at the scale of hundreds of millions. In this study, we aim at utilizing the General Purpose computing on Graphics Processing Units (GPGPUs) technologies to speed up processing complex spatial queries on big taxi data on inexpensive commodity GPUs. By using the land use types of tax lot polygons as a proxy for trip purposes at the pickup and drop-off locations, we formulate a taxi trip data analysis problem as a large-scale nearest neighbor spatial query problem based on point-to-polygon distance. Experiments on nearly 170 million taxi trips in the New York City (NYC) in 2009 and 735,488 tax lot polygons with 4,698,986 vertices have demonstrated the efficiency of the proposed techniques: the GPU implementations is about 10-20X faster than the host system and completes the spatial query in about a minute by using a low-end workstation equipped with an Nvidia GTX Titan GPU device with a total equipment cost of below $2,000. We further discuss several interesting patterns discovered from the query results which warrant further study. The proposed approach can be an interesting alternative to traditional MapReduce/Hadoop based approaches to processing big data with respect to performance and cost.
引用
收藏
页码:72 / 79
页数:8
相关论文
共 50 条
  • [1] DISPAQ: Distributed Profitable-Area Query from Big Taxi Trip Data
    Putri, Fadhilah Kurnia
    Song, Giltae
    Kwon, Joonho
    Rao, Praveen
    SENSORS, 2017, 17 (10)
  • [2] High-Performance Computing for Big Data Processing
    Wu, Yulei
    Xiang, Yang
    Ge, Jingguo
    Muller, Peter
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 88 : 693 - 695
  • [3] Characterizing the urban spatial structure using taxi trip big data and implications for urban planning
    Haibo LI
    Xiaocong XU
    Xia LI
    Shifa MA
    Honghui ZHANG
    Frontiers of Earth Science, 2021, (01) : 70 - 80
  • [4] Characterizing the urban spatial structure using taxi trip big data and implications for urban planning
    Haibo Li
    Xiaocong Xu
    Xia Li
    Shifa Ma
    Honghui Zhang
    Frontiers of Earth Science, 2021, 15 : 70 - 80
  • [5] Characterizing the urban spatial structure using taxi trip big data and implications for urban planning
    Li, Haibo
    Xu, Xiaocong
    Li, Xia
    Ma, Shifa
    Zhang, Honghui
    FRONTIERS OF EARTH SCIENCE, 2021, 15 (01) : 70 - 80
  • [6] Towards Parallel Spatial Query Processing for Big Spatial Data
    Zhong, Yunqin
    Han, Jizhong
    Zhang, Tieying
    Li, Zhenhua
    Fang, Jinyun
    Chen, Guihai
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 2085 - 2094
  • [7] Multilevel Data Processing Using Parallel Algorithms for Analyzing Big Data in High-Performance Computing
    Ahmad, Awais
    Paul, Anand
    Din, Sadia
    Rathore, M. Mazhar
    Choi, Gyu Sang
    Jeon, Gwanggil
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2018, 46 (03) : 508 - 527
  • [8] Query Processing Techniques for Big Spatial-Keyword Data
    Mahmood, Ahmed
    Aref, Walid G.
    SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 1777 - 1782
  • [9] Multilevel Data Processing Using Parallel Algorithms for Analyzing Big Data in High-Performance Computing
    Awais Ahmad
    Anand Paul
    Sadia Din
    M. Mazhar Rathore
    Gyu Sang Choi
    Gwanggil Jeon
    International Journal of Parallel Programming, 2018, 46 : 508 - 527
  • [10] PARALLELISM FOR HIGH-PERFORMANCE QUERY-PROCESSING
    WINTERS, VG
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 580 : 344 - 356