Real-Time Machine Learning Competition on Data Streams at the IEEE Big Data 2019

被引:0
|
作者
Boulegane, Dihia [1 ,2 ]
Radulovic, Nedeljko [1 ]
Bifet, Albert [1 ,3 ]
Fievet, Ghislain [4 ]
Sohn, Jimin [5 ]
Nam, Yeonwoo [5 ]
Yu, Seojeong [5 ]
Choi, Dong-Wan [5 ]
机构
[1] Telecom Paris, IP Paris, LTCI, Paris, France
[2] Orange Labs, Grenoble, France
[3] Univ Waikato, Hamilton, New Zealand
[4] Craft Ai, Paris, France
[5] Inha Univ, Incheon, South Korea
关键词
Stream Data Mining; Machine Learning Competition; Information Flow Processing; Internet of Things;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present the competition "Real-time Machine Learning Competition on Data Streams", a BigData Cup Challenge of the IEEE Big Data 2019 conference. Data streams, such as data originated from sensors, have increasingly gained the interest of researchers and companies and are currently widely studied in data science. Companies in the telecommunication and energy industries are trying to exploit these data and get real-time insights on their services and equipment. In order to extract valuable knowledge from data streams, one must be able to analyze the data as they arrive and make meaningful predictions. For this purpose, we use fast incremental learners. There already exists a great community that is organizing various competitions on machine learning tasks for batch learners. Our goal was to introduce the same approach to engage the whole community in solving essential problems in data stream mining. We performed a new kind of data science competition based on a real-time prediction setting, using a novel competition platform on data streams. The examples to predict were released in real-time, and the predictions had also to be submitted in real-lime. To the best of our knowledge, this was the first data science competition conducted in real-lime. The task of the competition was to predict network activity, and the data has been provided by one of our partner companies.
引用
收藏
页码:3493 / 3497
页数:5
相关论文
共 50 条
  • [1] A distributed real-time recommender system for big data streams
    Hazem, Heidy
    Awad, Ahmed
    Yousef, Ahmed Hassan
    AIN SHAMS ENGINEERING JOURNAL, 2023, 14 (08)
  • [2] Big Data Real Time Ingestion and Machine Learning
    Pal, Gautam
    Li, Gangmin
    Atkinson, Katie
    2018 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA STREAM MINING & PROCESSING (DSMP), 2018, : 25 - 31
  • [3] A Scalable Machine Learning Online Service for Big Data Real-Time Analysis
    Baldominos, Alejandro
    Albacete, Esperanza
    Saez, Yago
    Isasi, Pedro
    2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIG DATA (CIBD), 2014, : 112 - 119
  • [4] A New Emulation Platform for Real-time Machine Learning in Substance Use Data Streams
    Bruendl, Stefan A.
    Fang, Hua
    Ngo, Hieu
    Boyer, Edward W.
    Wang, Honggang
    2020 IEEE 21ST INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2020), 2020, : 325 - 332
  • [5] Digital Data Streams: CREATING VALUE FROM THE REAL-TIME FLOW OF BIG DATA
    Pigni, Federico
    Piccoli, Gabriele
    Watson, Richard
    CALIFORNIA MANAGEMENT REVIEW, 2016, 58 (03) : 5 - 25
  • [6] On the locality of Java 8 streams in real-time big data applications
    Chan, Yu
    Gray, Ian
    Wellings, Andy
    Audsley, Neil
    ACM International Conference Proceeding Series, 2014, 2014-October (October): : 20 - 28
  • [7] Survey on Real-time Anomaly Detection Technology for Big Data Streams
    Luo, Yuanvan
    Du, Xuehui
    Sun, Yi
    PROCEEDINGS OF 2018 12TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2018, : 26 - 30
  • [8] Distributed Real-Time Sentiment Analysis for Big Data Social Streams
    Rahnama, Amir Hossein Akhavan
    2014 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2014, : 789 - 794
  • [9] Research on real-time outlier detection over big data streams
    Chen, Liangchen
    Gao, Shu
    Cao, Xiufeng
    International Journal of Computers and Applications, 2020, 42 (01): : 93 - 101
  • [10] Real-time traffic congestion prediction using big data and machine learning techniques
    Chawla, Priyanka
    Hasurkar, Rutuja
    Bogadi, Chaithanya Reddy
    Korlapati, Naga Sindhu
    Rajendran, Rajasree
    Ravichandran, Sindu
    Tolem, Sai Chaitanya
    Gao, Jerry Zeyu
    WORLD JOURNAL OF ENGINEERING, 2024, 21 (01) : 140 - 155