Fast Large-Scale Trajectory Clustering

被引:54
|
作者
Wang, Sheng [1 ,2 ]
Bao, Zhifeng [2 ]
Culpepper, J. Shane [2 ]
Sellis, Timos [3 ]
Qin, Xiaolin [4 ]
机构
[1] NYU, New York, NY 10003 USA
[2] RMIT Univ, Melbourne, Vic, Australia
[3] Swinburne Univ Technol, Hawthorn, Vic, Australia
[4] Nanjing Univ Aeronaut & Astronaut, Nanjing, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2019年 / 13卷 / 01期
关键词
ALGORITHM; MANAGEMENT;
D O I
10.14778/3357377.3357380
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study the problem of large-scale trajectory data clustering, k-paths, which aims to efficiently identify k "representative" paths in a road network. Unlike traditional clustering approaches that require multiple data-dependent hyperparameters, k-paths can be used for visual exploration in applications such as traffic monitoring, public transit planning, and site selection. By combining map matching with an efficient intermediate representation of trajectories and a novel edge-based distance (EBD) measure, we present a scalable clustering method to solve k-paths. Experiments verify that we can cluster millions of taxi trajectories in less than one minute, achieving improvements of up to two orders of magnitude over state-of-the-art solutions that solve similar trajectory clustering problems.
引用
收藏
页码:29 / 42
页数:14
相关论文
共 50 条
  • [21] A large-scale clustering and 3D trajectory optimization approach for UAV swarms
    Ting MA
    Haibo ZHOU
    Bo QIAN
    Aiyong FU
    Science China(Information Sciences), 2021, 64 (04) : 84 - 99
  • [22] A large-scale clustering and 3D trajectory optimization approach for UAV swarms
    Ting Ma
    Haibo Zhou
    Bo Qian
    Aiyong Fu
    Science China Information Sciences, 2021, 64
  • [23] HGC: fast hierarchical clustering for large-scale single-cell data
    Zou, Ziheng
    Hua, Kui
    Zhang, Xuegong
    BIOINFORMATICS, 2021, 37 (21) : 3964 - 3965
  • [24] A fast hierarchical clustering algorithm for large-scale protein sequence data sets
    Szilagyi, Sandor M.
    Szilagyi, Laszlo
    COMPUTERS IN BIOLOGY AND MEDICINE, 2014, 48 : 94 - 101
  • [25] Fast spectral clustering learning with hierarchical bipartite graph for large-scale data
    Yang, Xiaojun
    Yu, Weizhong
    Wang, Rong
    Zhang, Guohao
    Nie, Feiping
    PATTERN RECOGNITION LETTERS, 2020, 130 : 345 - 352
  • [26] Large-scale clustering as a probe of the origin and the host environment of fast radio bursts
    Shirasaki, Masato
    Kashiyama, Kazumi
    Yoshida, Naoki
    PHYSICAL REVIEW D, 2017, 95 (08)
  • [27] Large-Scale K-Clustering
    Voevodski, Konstan tin
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (09)
  • [28] A matheuristic for large-scale capacitated clustering
    Gnagi, Mario
    Baumann, Philipp
    COMPUTERS & OPERATIONS RESEARCH, 2021, 132
  • [29] The large-scale clustering of radio sources
    Negrello, M
    Magliocchetti, M
    De Zotti, G
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2006, 368 (02) : 935 - 942
  • [30] Large-scale parallel data clustering
    Judd, D
    McKinley, PK
    Jain, AK
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (08) : 871 - 876