Spatiotemporal Analysis of Traffic Data: Correspondence Analysis with Fuzzified Variables vs. Principal Component Analysis Using Weather and Gas Price as Extra Data

被引:0
|
作者
Loslever, Pierre [1 ,2 ]
机构
[1] Univ Polytech Hauts de France, Lab Automat Mecan & Informat Ind & Humaines, UMR CNRS 8201, Campus Mt Houy, F-59313 Valenciennes 9, France
[2] Polytech Univ Hauts France, LAMIH, Valenciennes, France
来源
NETWORKS & SPATIAL ECONOMICS | 2024年 / 24卷 / 03期
关键词
Traffic analysis; Origin-destination matrix; Correspondence analysis; Principal component analysis; Economics; Weather; Fuzzy sets; TRAVEL; PATTERNS; MATRICES; PRIVACY; DEMAND; MODEL; CITY;
D O I
10.1007/s11067-024-09624-4
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Study of large rail traffic databases presents formidable challenges for transport system specialists, more particularly while keeping both space and time factors together with the possibility of showing influencing factors related to the users and the transport network environment. To perform such a study, a bibliographic analysis in both statistics and transport revealed that geometrical methods for feature extraction and dimension reduction can be seen as suitable. Since there are several methods/options with, in principle, required input data, this article aims at comparing Principal Component Analysis (PCA) and Correspondence Analysis (CA) for traffic frequency data, both methods being actually used with such data. The procedure stands as follows. First a grand matrix is built where the rows correspond to time windows and the columns to all the possible origin-destination links. Then this large frequency matrix is studied using PCA and CA. The next part of the procedure consists in studying the effects of influencing factors with the possibility of keeping the quantitative scales with PCA or using fuzzy segmentation with CA, the corresponding data being considered as supplementary column points. The procedure is applied on a rail transport network including 10 stations (one corresponding to the airport) and one-hour time windows for 4 months, the available influencing factors being the temperature, rain level and gas price. The comparative analysis shows that CA graphical outputs are more complicated than PCA ones, but reveal more specific results, e.g. the network user behavior related to the airport, while PCA mainly opposes link clusters with low vs. high frequencies. Fuzzy windowing performed using actual and simulated data reduces the loss of information when averaging, e.g. over time, and can show non-linear relational phenomena. The possibility of displaying new traffic data in real time is also considered.
引用
收藏
页码:531 / 563
页数:33
相关论文
共 50 条
  • [12] Weather data mining using independent component analysis
    Basak, J
    Sudarshan, A
    Trivedi, D
    Santhanam, MS
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 239 - 253
  • [13] Principal component analysis of interval data: a symbolic data analysis approach
    Carlo N. Lauro
    Francesco Palumbo
    Computational Statistics, 2000, 15 : 73 - 87
  • [14] Principal component analysis of interval data: a symbolic data analysis approach
    Lauro, CN
    Palumbo, F
    COMPUTATIONAL STATISTICS, 2000, 15 (01) : 73 - 87
  • [15] Application of Principal Component Analysis for Outlier Detection in Heterogeneous Traffic Data
    Saha, Pritam
    Roy, Nabanita
    Mukherjee, Deotima
    Sarkar, Ashoke Kumar
    7TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2016) / THE 6TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2016) / AFFILIATED WORKSHOPS, 2016, 83 : 107 - 114
  • [16] A Comparative Study on Application of Data Mining Technique in Human Shape Clustering: Principal Component Analysis vs. Factor Analysis
    Niu, Jianwei
    He, Yiling
    Li, Muyuan
    Zhang, Xin
    Ran, Linghua
    Chao, Chuzhi
    Zhang, Baoqin
    ICIEA 2010: PROCEEDINGS OF THE 5TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOL 4, 2010, : 267 - +
  • [17] Principal Component Analysis of Thermographic Data
    Winfree, William P.
    Cramer, K. Elliott
    Zalameda, Joseph N.
    Howell, Patricia A.
    Burke, Eric R.
    THERMOSENSE: THERMAL INFRARED APPLICATIONS XXXVII, 2015, 9485
  • [18] Principal component analysis of genetic data
    Reich, David
    Price, Alkes L.
    Patterson, Nick
    NATURE GENETICS, 2008, 40 (05) : 491 - 492
  • [19] Principal component analysis with autocorrelated data
    Zamprogno, Bartolomeu
    Reisen, Valderio A.
    Bondon, Pascal
    Aranda Cotta, Higor H.
    Reis Jr, Neyval C.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2020, 90 (12) : 2117 - 2135
  • [20] PRINCIPAL COMPONENT ANALYSIS OF EPIDEMIOLOGICAL DATA
    OSAKI, J
    ISHII, F
    IWAMOTO, S
    SHINBO, S
    BIOMETRICS, 1982, 38 (04) : 1101 - 1101