Big Data Trip Classification on the New York City Taxi and Uber Sensor Network

被引:6
|
作者
Sun, Huiyu [1 ]
Hu, Siyuan [1 ]
McIntosh, Suzanne [1 ]
Cao, Yi [2 ,3 ]
机构
[1] NYU, Dept Comp Sci, New York, NY 10003 USA
[2] Nanjing Univ Informat Sci & Technol, Jiangsu Engn Ctr Network Monitoring, Nanjing, Jiangsu, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing, Jiangsu, Peoples R China
来源
JOURNAL OF INTERNET TECHNOLOGY | 2018年 / 19卷 / 02期
关键词
Big data; Classification; Mobile sensor network; NYC taxi; Uber;
D O I
10.3966/160792642018031902027
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Millions of trips are made every day by taxis and Uber in New York City. We first employ big data technologies to analyze this vast dataset: Apache Spark is used for data processing and classification, Apache Hive is used for data storage, and MapReduce is used for data profiling. Since taxis and Uber are equipped with GPS sensors, we then visualize a mobile sensor network over New York City separated into fine-sized regions each acting as a mobile sensing node. Each location on the network falls into a region and is classified into one of three categories based on which service dominates the particular region: Yellow taxi, Green taxi, or Uber. We utilize logistic regression to classify a region into one of the three categories. Our classification algorithm is then used to analyze the interaction between taxi and Uber, for example to quantify the expansion of Uber. Experiments run on the Spark cluster show our classifier achieves an accuracy of over 85% scored on the 2014 taxi and Uber dataset. Finally, we propose a trip recommendation system for users using classification results together with a web service application.
引用
收藏
页码:591 / 598
页数:8
相关论文
共 50 条
  • [1] Big Data Computation of Taxi Movement in New York City
    Deri, Joya A.
    Franchetti, Franz
    Moura, Jose M. F.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2616 - 2625
  • [2] TAXI DATA IN NEW YORK CITY: A NETWORK PERSPECTIVE
    Deri, Joya A.
    Moura, Jose M. F.
    [J]. 2015 49TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2015, : 1829 - 1833
  • [3] Big Data Mobile Services for New York City Taxi Riders and Drivers
    Sun, Huiyu
    McIntosh, Suzanne
    [J]. 2016 5TH IEEE INTERNATIONAL CONFERENCE ON MOBILE SERVICES (MS 2016), 2016, : 57 - 64
  • [4] Modeling Taxi Trip Demand by Time of Day in New York City
    Yang, Ci
    Gonzales, Eric J.
    [J]. TRANSPORTATION RESEARCH RECORD, 2014, (2429) : 110 - 120
  • [5] Using 'Big Data' to understand the impacts of Uber on taxis in New York City
    Willis, George
    Tranos, Emmanouil
    [J]. TRAVEL BEHAVIOUR AND SOCIETY, 2021, 22 : 94 - 107
  • [6] New York City taxi trip duration prediction using MLP and XGBoost
    Poongodi, M.
    Malviya, Mohit
    Kumar, Chahat
    Hamdi, Mounir
    Vijayakumar, V.
    Nebhen, Jamel
    Alyamani, Hasan
    [J]. INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2022, 13 (SUPPL 1) : 16 - 27
  • [7] New York City taxi trip duration prediction using MLP and XGBoost
    M Poongodi
    Mohit Malviya
    Chahat Kumar
    Mounir Hamdi
    V Vijayakumar
    Jamel Nebhen
    Hasan Alyamani
    [J]. International Journal of System Assurance Engineering and Management, 2022, 13 : 16 - 27
  • [8] The Trip to New York City
    星彩
    [J]. 英语大王, 2009, (06) : 10 - 15
  • [9] Revealing spatiotemporal travel demand and community structure characteristics with taxi trip data: A case study of New York City
    Xie, Chen
    Yu, Dexin
    Zheng, Xiaoyu
    Wang, Zhuorui
    Jiang, Zhongtai
    [J]. PLOS ONE, 2021, 16 (11):
  • [10] Big Data Driven Model for New York Taxi Trips Analysis
    Yu, Xuqiao
    [J]. 2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021), 2021, : 1 - 4