Labeling Big Spatial Data: A Case Study of New York Taxi Limousine Dataset

被引:0
|
作者
AlBatati, Fawaz [1 ]
Alarabi, Louai [1 ]
机构
[1] Umm Al Qura Univ, Coll Comp & Informat Syst, Dept Comp Sci, Mecca, Saudi Arabia
关键词
Unsupervised Learning; K-means Clustering Algorithm; Unlabeled data; Spatial-data; Trajectory;
D O I
10.22937/IJCSNS.2021.21.6.27
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering Unlabeled Spatial-datasets to convert them to Labeled Spatial-datasets is a challenging task specially for geographical information systems. In this research study we investigated the NYC Taxi Limousine Commission dataset and discover that all of the spatial-temporal trajectory are unlabeled Spatial-datasets, which is in this case it is not suitable for any data mining tasks, such as classification and regression. Therefore, it is necessary to convert unlabeled Spatial-datasets into labeled Spatial-datasets. In this research study we are going to use the Clustering Technique to do this task for all the Trajectory datasets. A key difficulty for applying machine learning classification algorithms for many applications is that they require a lot of labeled datasets. Labeling a Big-data in many cases is a costly process. In this paper, we show the effectiveness of utilizing a Clustering Technique for labeling spatial data that leads to a high-accuracy classifier.
引用
收藏
页码:207 / 212
页数:6
相关论文
共 50 条
  • [1] Big Data Computation of Taxi Movement in New York City
    Deri, Joya A.
    Franchetti, Franz
    Moura, Jose M. F.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2616 - 2625
  • [2] Analysis of Spatial Equity in Taxi Services: A Case Study of New York City
    Pan, Renbin
    Zhang, Shuangyan
    Yang, Hongtai
    Xie, Kun
    Wen, Yi
    [J]. 2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 2659 - 2664
  • [3] Big Data Driven Model for New York Taxi Trips Analysis
    Yu, Xuqiao
    [J]. 2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021), 2021, : 1 - 4
  • [4] Big Data Mobile Services for New York City Taxi Riders and Drivers
    Sun, Huiyu
    McIntosh, Suzanne
    [J]. 2016 5TH IEEE INTERNATIONAL CONFERENCE ON MOBILE SERVICES (MS 2016), 2016, : 57 - 64
  • [5] Big Data Trip Classification on the New York City Taxi and Uber Sensor Network
    Sun, Huiyu
    Hu, Siyuan
    McIntosh, Suzanne
    Cao, Yi
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2018, 19 (02): : 591 - 598
  • [6] Visual Exploration of Big Spatio-Temporal Urban Data: A Study of New York City Taxi Trips
    Ferreira, Nivan
    Poco, Jorge
    Vo, Huy T.
    Freire, Juliana
    Silva, Claudio T.
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (12) : 2149 - 2158
  • [7] Improving Viability of Electric Taxis by Taxi Service Strategy Optimization: A Big Data Study of New York City
    Tseng, Chien-Ming
    Chau, Sid Chi-Kin
    Liu, Xue
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (03) : 817 - 829
  • [8] TAXI DATA IN NEW YORK CITY: A NETWORK PERSPECTIVE
    Deri, Joya A.
    Moura, Jose M. F.
    [J]. 2015 49TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2015, : 1829 - 1833
  • [9] Exploring the Spatio-Temporal and Behavioural Variations in Taxi Travel Based on Big Data during the COVID-19 Pandemic: A Case Study of New York City
    Li, Sen
    Bao, Shitai
    Yao, Ceyi
    Zhang, Lan
    [J]. SUSTAINABILITY, 2022, 14 (20)
  • [10] A Big Data Driven Model for Taxi Drivers' Airport Pick-up Decisions in New York City
    Yazici, M. Anil
    Kamga, Camille
    Singhal, Abhishek
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,