Drift Detection for Multi-label Data Streams Based on Label Grouping and Entropy

被引:16
|
作者
Shi, Zhongwei [1 ]
Wen, Yimin [2 ]
Feng, Chao [1 ]
Zhao, Hai [3 ]
机构
[1] Guilin Univ Elect Technol, Sch Comp Sci & Engn, Guilin, Peoples R China
[2] Guilin Univ Elect Technol, Guangxi Key Lab Trusted Software, Guilin, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
关键词
concept drift; multi-label; label dependence; data stream; entropy;
D O I
10.1109/ICDMW.2014.92
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many real-world applications involve multi-label data streams, so effective concept drift detection methods should be able to consider the unique properties of multi-label stream data, such as label dependence. To deal with these challenges, we proposed an efficient and effective method to detect concept drift based on label grouping and entropy for multi-label data. Two methods are proposed to group the set of class labels into different subsets and a multi-label version of entropy was adjusted to measure the distribution of multi-label data. Concept drift was detected by comparing the entropies of the older and the most recent data. The experiments are run on three synthetic datasets and two real-world datasets and the experimental results illustrate the better classification performance of the proposed method for detecting drift.
引用
收藏
页码:724 / 731
页数:8
相关论文
共 50 条
  • [21] Oversampling multi-label data based on natural neighbor and label correlation
    Liu, Bin
    Zhou, Ao
    Wei, Bingkun
    Wang, Jin
    Tsoumakas, Grigorios
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [22] Synthetic Oversampling of Multi-label Data Based on Local Label Distribution
    Liu, Bin
    Tsoumakas, Grigorios
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 11907 : 180 - 193
  • [23] Concurrent Multi-Label Prediction in Event Streams
    Shou, Xiao
    Gao, Tian
    Subramanian, Dharmashankar
    Bhattacharjya, Debarun
    Bennett, Kristin P.
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9820 - 9828
  • [24] Multi-label classification via multi-target regression on data streams
    Osojnik, Aljaz
    Panov, Pance
    Dzeroski, Saso
    MACHINE LEARNING, 2017, 106 (06) : 745 - 770
  • [25] Multi-label classification via multi-target regression on data streams
    Aljaž Osojnik
    Panče Panov
    Sašo Džeroski
    Machine Learning, 2017, 106 : 745 - 770
  • [26] Multi-label Classification via Multi-target Regression on Data Streams
    Osojnik, Aljaz
    Panov, Pance
    Dzeroski, Saso
    DISCOVERY SCIENCE, DS 2015, 2015, 9356 : 170 - 185
  • [27] Efficient Multi-Label Feature Selection Using Entropy-Based Label Selection
    Lee, Jaesung
    Kim, Dae-Won
    ENTROPY, 2016, 18 (11)
  • [28] On the Stratification of Multi-label Data
    Sechidis, Konstantinos
    Tsoumakas, Grigorios
    Vlahavas, Ioannis
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2011, 6913 : 145 - 158
  • [29] RETRACTION: Incremental deep forest for multi-label data streams learning
    Liang, Shunpan
    Pan, Weiwei
    You, Dianlong
    Liu, Ze
    Yin, Ling
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [30] Multi-label feature selection algorithm based on information entropy
    Zhang, Zhenhai
    Li, Shining
    Li, Zhigang
    Chen, Hao
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2013, 50 (06): : 1177 - 1184