Noval Stream Data Mining Framework under the Background of Big Data

被引:5
|
作者
Yi, Wenquan [1 ]
Teng, Fei [2 ]
Xu, Jianfeng [2 ]
机构
[1] Jiang Xi Vocat Coll Finance & Econ, Jiujiang, Peoples R China
[2] Nanchang Univ, Software Coll, Nanchang, Jiangxi, Peoples R China
基金
美国国家科学基金会;
关键词
Stream data; data mining; clustering; classification; framework;
D O I
10.1515/cait-2016-0053
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stream data mining has been a hot topic for research in the data mining research area in recent years, as it has an extensive application prospect in big data ages. Research on stream data mining mainly focuses on frequent item sets mining, clustering and classification. However, traditional steam data mining methods are not effective enough for handling high dimensional data set because these methods are not fit for the characteristics of stream data. So, these traditional stream data mining methods need to be enhanced for big data applications. To resolve this issue, a hybrid framework is proposed for big steam data mining. In this framework, online and offline model are organized for different tasks, the interior of each model is rationally organized according to different mining tasks. This framework provides a new research idea and macro perspective for stream data mining under the background of big data.
引用
收藏
页码:69 / 77
页数:9
相关论文
共 50 条
  • [41] A Parallel DistributedWeka Framework for Big Data Mining using Spark
    Koliopoulos, Aris-Kyriakos
    Yiapanis, Paraskevas
    Tekiner, Firat
    Nenadic, Goran
    Keane, John
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 9 - 16
  • [42] FSBD: A Framework for Scheduling of Big Data Mining in Cloud Computing
    Ismail, Leila
    Masud, Mohammad M.
    Khan, Latifur
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 513 - 520
  • [43] Parallel and distributed clustering framework for big spatial data mining
    Bendechache, Malika
    Tari, A-Kamel
    Kechadi, M-Tahar
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2019, 34 (06) : 671 - 689
  • [44] A Fuzzy Mining Approach for Energy Efficiency in a Big Data Framework
    Fernandez-Basso, Carlos
    Dolores Ruiz, M.
    Martin-Bautista, Maria J.
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (11) : 2747 - 2758
  • [45] Data stream classification and big data analytics
    Krawczyk, Bartosz
    Wozniak, Michal
    Stefanowski, Jerzy
    NEUROCOMPUTING, 2015, 150 : 238 - 239
  • [46] An Efficient Framework of Data Mining and its Analytics on Massive Streams of Big Data Repositories
    Disha, D. N.
    Sowmya, B. J.
    Chetan
    Seema, S.
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING, VLSI, ELECTRICAL CIRCUITS AND ROBOTICS (DISCOVER), 2016, : 195 - 200
  • [47] A dynamic ensemble learning based data mining framework for medical imbalanced big data
    Rithani, M.
    Kumar, R. Prasanna
    Ali, Altalbe
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [48] Clustering Data Stream Under a Belief Function Framework
    Bahri, Maroua
    Elouedi, Zied
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [49] Research on Application of High - efficiency Data Mining to Electric Power Data under Big Data
    Ge, Weichun
    Lei, Zhenjiang
    Li, Zhao
    Hu, Xiaolei
    Liu, Jinyuan
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ARTIFICIAL INTELLIGENCE (CAAI 2017), 2017, 134 : 400 - 405
  • [50] A Secure Big Data Stream Analytics Framework for Disaster Management on the Cloud
    Puthal, Deepak
    Nepal, Surya
    Ranjan, Rajiv
    Chen, Jinjun
    PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 1218 - 1225