Statistical models for time sequences data mining

被引:3
|
作者
Ting, JK [1 ]
Ng, MK [1 ]
Rong, HQ [1 ]
Huang, JZ [1 ]
机构
[1] Univ Hong Kong, E Business Technol Inst, Hong Kong, Hong Kong, Peoples R China
关键词
autoregression models; prediction; clustering;
D O I
10.1109/CIFER.2003.1196281
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
In this paper, we present an adaptive modelling technique for studying past behaviors of objects and predicting the near future events. Our approach is to define a sliding window (of different window sizes) over a time sequence and build autoregression models from subsequences in different windows. The models are representations of past behaviors of the sequence objects. We can use the AR coefficients as features to index subsequences to facilitate the query of subsequences with similar behaviors. We can use a clustering algorithm to group time sequences on their similarity in the feature space. We can also use the AR models for prediction within different windows. Our experiments show that the adaptive model can give better prediction than non-adaptive models.
引用
收藏
页码:347 / 354
页数:8
相关论文
共 50 条
  • [1] Statistical models for data mining
    Giudici, P
    Heckerman, D
    Whittaker, J
    DATA MINING AND KNOWLEDGE DISCOVERY, 2001, 5 (03) : 163 - 165
  • [2] Statistical Models for Data Mining
    Paolo Giudici
    David Heckerman
    Joe Whittaker
    Data Mining and Knowledge Discovery, 2001, 5 : 163 - 165
  • [3] Peptide vaccine models using statistical data mining
    Joshi, Rajani R.
    PROTEIN AND PEPTIDE LETTERS, 2007, 14 (06): : 536 - 542
  • [4] Intelligent data recognition of DNA sequences using statistical models
    Keinduangjun, J
    Piamsa-nga, P
    Poovorawan, Y
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 630 - 635
  • [5] Statistical data mining
    Banks, David L.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (01): : 9 - 25
  • [6] Analysis of the statistical characteristics in mining of frequent sequences
    Tumasonis, R
    Dzemyda, G
    INTELLIGENT INFORMATION PROCESSING AND WEB MINING, PROCEEDINGS, 2005, : 377 - 386
  • [7] Statistical methods and models in the analysis of time to event data
    Lee, Minjung
    Han, Junhee
    ANNALS OF TRANSLATIONAL MEDICINE, 2020, 8 (04)
  • [8] Statistical inference and data mining
    Glymour, C
    Madigan, D
    Pregibon, D
    Smyth, P
    COMMUNICATIONS OF THE ACM, 1996, 39 (11) : 35 - 41
  • [9] A statistical perspective on data mining
    Hosking, JRM
    Pednault, EPD
    Sudan, M
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 1997, 13 (2-3): : 117 - 134
  • [10] Statistical perspective on data mining
    IBM T.J. Watson Research Cent, Yorktown Heights, United States
    Future Gener Comput Syst, 2-3 (117-134):