A Data Cleaning Method for Big Trace Data Using Movement Consistency

被引:10
|
作者
Yang, Xue [1 ]
Tang, Luliang [1 ]
Zhang, Xia [2 ]
Li, Qingquan [3 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Hubei, Peoples R China
[2] Wuhan Univ, Sch Urban Design, Wuhan 430070, Hubei, Peoples R China
[3] Shenzhen Univ, Coll Civil Engn, Shenzhen 518060, Peoples R China
来源
SENSORS | 2018年 / 18卷 / 03期
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
data cleaning; big data; vehicle trajectory; movement consistency modeling; GPS;
D O I
10.3390/s18030824
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Given the popularization of GPS technologies, the massive amount of spatiotemporal GPS traces collected by vehicles are becoming a new kind of big data source for urban geographic information extraction. The growing volume of the dataset, however, creates processing and management difficulties, while the low quality generates uncertainties when investigating human activities. Based on the conception of the error distribution law and position accuracy of the GPS data, we propose in this paper a data cleaning method for this kind of spatial big data using movement consistency. First, a trajectory is partitioned into a set of sub-trajectories using the movement characteristic points. In this process, GPS points indicate that the motion status of the vehicle has transformed from one state into another, and are regarded as the movement characteristic points. Then, GPS data are cleaned based on the similarities of GPS points and the movement consistency model of the sub-trajectory. The movement consistency model is built using the random sample consensus algorithm based on the high spatial consistency of high-quality GPS data. The proposed method is evaluated based on extensive experiments, using GPS trajectories generated by a sample of vehicles over a 7-day period in Wuhan city, China. The results show the effectiveness and efficiency of the proposed method.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] The optimization of the big data cleaning based on task merging
    Yang D.-H.
    Li N.-N.
    Wang H.-Z.
    Li J.-Z.
    Gao H.
    Wang, Hong-Zhi (wangzh@hit.edu.cn), 1600, Science Press (39): : 97 - 108
  • [42] A Method for Quantifying Consistency in Animal Distributions Using Survey Data
    Heath, Joel P.
    Montevecchi, William A.
    Esler, Daniel
    PLOS ONE, 2012, 7 (09):
  • [43] Decomposition tree: a spatio-temporal indexing method for movement big data
    Zhenwen He
    Chonglong Wu
    Gang Liu
    Zufang Zheng
    Yiping Tian
    Cluster Computing, 2015, 18 : 1481 - 1492
  • [44] Decomposition tree: a spatio-temporal indexing method for movement big data
    He, Zhenwen
    Wu, Chonglong
    Liu, Gang
    Zheng, Zufang
    Tian, Yiping
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (04): : 1481 - 1492
  • [45] FTPG: A Fine-Grained Traffic Prediction Method With Graph Attention Network Using Big Trace Data
    Fang, Mengyuan
    Tang, Luliang
    Yang, Xue
    Chen, Yang
    Li, Chaokui
    Li, Qingquan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) : 5163 - 5175
  • [46] An Efficient Transportation Architecture for Big Data Movement
    Hu, Weisheng
    Sun, Weiqiang
    Jin, Yaohui
    Guo, Wei
    Xiao, Shilin
    2013 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2013,
  • [47] The method of Big data processing
    Shakhovska, Natalya
    PROCEEDINGS OF THE 2017 12TH INTERNATIONAL SCIENTIFIC AND TECHNICAL CONFERENCE ON COMPUTER SCIENCES AND INFORMATION TECHNOLOGIES (CSIT 2017), VOL. 1, 2017, : 122 - 126
  • [48] Spatiotemporal Analysis of Taxi-Driver Shifts Using Big Trace Data
    Cheng, Luling
    Yang, Xue
    Tang, Luliang
    Duan, Qian
    Kan, Zihan
    Zhang, Xia
    Ye, Xinyue
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (04)
  • [49] A Data Cleaning Method for CiteSeer Dataset
    Wang, Yan
    Zhang, Hao
    Li, Yaxin
    Wang, Deyun
    Ma, Yanlin
    Zhou, Tong
    Lu, Jianguo
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2016, PT I, 2016, 10041 : 35 - 49
  • [50] Data cleaning method for distribution transformer
    Liu Y.
    Luan W.
    Xu Y.
    Wang P.
    Guo S.
    1600, Power System Technology Press (41): : 1008 - 1014