Fast Large-Scale Trajectory Clustering

被引:54
|
作者
Wang, Sheng [1 ,2 ]
Bao, Zhifeng [2 ]
Culpepper, J. Shane [2 ]
Sellis, Timos [3 ]
Qin, Xiaolin [4 ]
机构
[1] NYU, New York, NY 10003 USA
[2] RMIT Univ, Melbourne, Vic, Australia
[3] Swinburne Univ Technol, Hawthorn, Vic, Australia
[4] Nanjing Univ Aeronaut & Astronaut, Nanjing, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2019年 / 13卷 / 01期
关键词
ALGORITHM; MANAGEMENT;
D O I
10.14778/3357377.3357380
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study the problem of large-scale trajectory data clustering, k-paths, which aims to efficiently identify k "representative" paths in a road network. Unlike traditional clustering approaches that require multiple data-dependent hyperparameters, k-paths can be used for visual exploration in applications such as traffic monitoring, public transit planning, and site selection. By combining map matching with an efficient intermediate representation of trajectories and a novel edge-based distance (EBD) measure, we present a scalable clustering method to solve k-paths. Experiments verify that we can cluster millions of taxi trajectories in less than one minute, achieving improvements of up to two orders of magnitude over state-of-the-art solutions that solve similar trajectory clustering problems.
引用
收藏
页码:29 / 42
页数:14
相关论文
共 50 条
  • [1] Fast, large-scale transformation-invariant clustering
    Frey, BJ
    Jojic, N
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 721 - 727
  • [2] Large-scale Subspace Clustering by Fast Regression Coding
    Li, Jun
    Zhao, Handong
    Tao, Zhiqiang
    Fu, Yun
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2138 - 2144
  • [3] Fast algorithm for large-scale subspace clustering by LRR
    Xie, Deyan
    Nie, Feiping
    Gao, Quanxue
    Xiao, Song
    IET IMAGE PROCESSING, 2020, 14 (08) : 1475 - 1480
  • [4] A fast fuzzy clustering algorithm for large-scale datasets
    Shi, LK
    He, PL
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 203 - 208
  • [5] A novel fast heuristic to handle large-scale shape clustering
    Ismkhan, Hassan
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (01) : 160 - 169
  • [6] Fast large-scale spectral clustering by sequential shrinkage optimization
    Liu, Tie-Yan
    Yang, Huai-Yuan
    Zheng, Xin
    Qin, Tao
    Ma, Wei-Ying
    ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 319 - +
  • [7] LARGE-SCALE HIGH-DIMENSIONAL CLUSTERING WITH FAST SKETCHING
    Chatalic, Antoine
    Gribonval, Remi
    Keriven, Nicolas
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4714 - 4718
  • [8] Fast Compressive Spectral Clustering for Large-Scale Sparse Graph
    Li, Ting
    Zhang, Yiming
    Liu, Hao
    Xue, Guangtao
    Liu, Ling
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (01) : 193 - 202
  • [9] YADING: Fast Clustering of Large-Scale Time Series Data
    Ding, Rui
    Wang, Qiang
    Dang, Yingnong
    Fu, Qiang
    Zhang, Haidong
    Zhang, Dongmei
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (05): : 473 - 484
  • [10] A Fast Clustering Algorithm for Modularization of Large-Scale Software Systems
    Teymourian, Navid
    Izadkhah, Habib
    Isazadeh, Ayaz
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (04) : 1451 - 1462