Evolutionary model building under streaming data for classification tasks: opportunities and challenges

被引:21
|
作者
Heywood, Malcolm I. [1 ]
机构
[1] Dalhousie Univ, Fac Comp Sci, Halifax, NS, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Streaming data; Non-stationary processes; Dynamic environment; Imbalanced data; Task decomposition; Ensemble learning; Active learning; Evolvability; Diversity; Memory; PROBLEM DECOMPOSITION; NOVELTY DETECTION; NEURAL NETWORKS; ENSEMBLE; CLASSIFIERS; ENVIRONMENT; MECHANISMS; ALGORITHMS; ADAPTATION; PREDICTION;
D O I
10.1007/s10710-014-9236-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Streaming data analysis potentially represents a significant shift in emphasis from schemes historically pursued for offline (batch) approaches to the classification task. In particular, a streaming data application implies that: (1) the data itself has no formal 'start' or 'end'; (2) the properties of the process generating the data are non-stationary, thus models that function correctly for some part(s) of a stream may be ineffective elsewhere; (3) constraints on the time to produce a response, potentially implying an anytime operational requirement; and (4) given the prohibitive cost of employing an oracle to label a stream, a finite labelling budget is necessary. The scope of this article is to provide a survey of developments for model building under streaming environments from the perspective of both evolutionary and non-evolutionary frameworks. In doing so, we bring attention to the challenges and opportunities that developing solutions to streaming data classification tasks are likely to face using evolutionary approaches.
引用
收藏
页码:283 / 326
页数:44
相关论文
共 50 条
  • [31] Evolutionary under-sampling based bagging ensemble method for imbalanced data classification
    Sun, Bo
    Chen, Haiyan
    Wang, Jiandong
    Xie, Hua
    FRONTIERS OF COMPUTER SCIENCE, 2018, 12 (02) : 331 - 350
  • [32] Evolutionary under-sampling based bagging ensemble method for imbalanced data classification
    Bo Sun
    Haiyan Chen
    Jiandong Wang
    Hua Xie
    Frontiers of Computer Science, 2018, 12 : 331 - 350
  • [33] The Mechanism and Challenges of Validating a Building Information Model regarding data exchange standards
    Lee, Yong-Cheol
    Solihin, Wawan
    Eastman, Charles M.
    AUTOMATION IN CONSTRUCTION, 2019, 100 : 118 - 128
  • [34] Opportunities and challenges for big data analytics in US higher education: A conceptual model for implementation
    Attaran, Mohsen
    Stark, John
    Stotler, Derek
    INDUSTRY AND HIGHER EDUCATION, 2018, 32 (03) : 169 - 182
  • [35] Applying a common data model to Asian databases for multinational pharmacoepidemiologic studies: opportunities and challenges
    Lai, Edward Chia-Cheng
    Ryan, Patrick
    Zhang, Yinghong
    Schuemie, Martijn
    Hardy, N. Chantelle
    Kamijima, Yukari
    Kimura, Shinya
    Kubota, Kiyoshi
    Man, Kenneth K. C.
    Cho, Soo Yeon
    Park, Rae Woong
    Stang, Paul
    Su, Chien-Chou
    Wong, Ian C. K.
    Kao, Yea-Huei Yang
    Setoguchi, Soko
    CLINICAL EPIDEMIOLOGY, 2018, 10 : 875 - 885
  • [36] Lightweight Conditional Model Extrapolation for Streaming Data under Class-Prior Shift
    Tomaszewska, Paulina
    Lampert, Christoph H.
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2128 - 2134
  • [37] Data sharing and privacy issues in neuroimaging research: Opportunities, obstacles, challenges, and monsters under the bed
    White, Tonya
    Blok, Elisabet
    Calhoun, Vince D.
    HUMAN BRAIN MAPPING, 2022, 43 (01) : 278 - 291
  • [38] Enhancing Fairness in Classification Tasks with Multiple Variables: A Data- and Model-Agnostic Approach
    d'Aloisio, Giordano
    Stilo, Giovanni
    Di Marco, Antinisca
    D'Angelo, Andrea
    ADVANCES IN BIAS AND FAIRNESS IN INFORMATION RETRIEVAL, BIAS 2022, 2022, 1610 : 117 - 129
  • [39] Efficient Data Presentation Method for Building User Preference Model Using Interactive Evolutionary Computation
    Hara, Akira
    Kushida, Jun-ichi
    Yasuda, Ryohei
    Takahama, Tetsuyuki
    INTELLIGENT DECISION TECHNOLOGIES, KES-IDT 2021, 2021, 238 : 583 - 593
  • [40] A human tissue and data resource: an overview of opportunities, challenges, and development of a provider/researcher partnership model
    Kort, EJ
    Campbell, B
    Resau, JH
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2003, 70 (02) : 137 - 150