Consistency measure based simultaneous feature selection and instance purification for multimedia traffic classification

被引:8
|
作者
Wu, Zheng [1 ]
Dong, Yu-ning [1 ]
Wei, Hua-Liang [2 ]
Tian, Wei [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing 210003, Peoples R China
[2] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield, S Yorkshire, England
基金
中国国家自然科学基金;
关键词
Traffic classification; Feature selection; Instance purification; Flow fragment; EFFICIENT; PREDICTION; ALGORITHM;
D O I
10.1016/j.comnet.2020.107190
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the increase of multimedia traffic, the implementation of fast and accurate classification has become an important issue. Besides, a manually captured dataset contains certain noise and mislabeled instances, which influences the accuracy of classifier to some extent. Motivated by these observations, a novel feature selection and instance purification (FS&IP) method based on consistency measure is proposed. It utilizes a linear consistency-constrained algorithm for feature selection. In each round of iteration, it removes the instance with the minor labels in every pattern subset. Our method has three desirable properties: 1) It can simultaneously achieve feature selection and data purification. 2) when purifying instance, it doesn't need to annotate the noisy instance with learned labels; that is because it is an unsupervised method in terms of data purification. 3) through data purification, it is able to obtain a minimal feature subset on condition of maintaining accuracy. In addition, the proposed method can be used to discover a new discriminative feature based on linking behaviors called the flow fragment (F - Frag), which can reflect important information among the complex and multitudinous packet communication behaviors. The experimental results over six different datasets demonstrate the advantages of the proposed technique compared to six existing methods, and the discriminative power of the new flow fragment feature.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Feature selection using consistency measure
    Dash, M
    Liu, H
    Motoda, H
    DISCOVERY SCIENCE, PROCEEDINGS, 1999, 1721 : 319 - 320
  • [2] A Weighted Feature Selection Method for Instance-Based Classification
    Agre, Gennady
    Dzhondzhorov, Anton
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2016, 2016, 9883 : 14 - 25
  • [3] Topic-Based Instance and Feature Selection in Multilabel Classification
    Ma, Jianghong
    Chow, Tommy W. S.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 315 - 329
  • [4] FISA: Feature-based instance selection for imbalanced text classification
    Sun, Aixin
    Lim, Ee-Peng
    Benatallah, Boualem
    Hassan, Mahbub
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2006, 3918 : 250 - 254
  • [5] Evolutionary feature and instance selection for traffic sign recognition
    Chen, Zong-Yao
    Lin, Wei-Chao
    Ke, Shih-Wen
    Tsai, Chih-Fong
    COMPUTERS IN INDUSTRY, 2015, 74 : 201 - 211
  • [6] A Fast and Accurate Feature Selection Algorithm Based on Binary Consistency Measure
    Shin, Kilho
    Miyazaki, Seiya
    COMPUTATIONAL INTELLIGENCE, 2016, 32 (04) : 646 - 667
  • [7] A scalable approach to simultaneous evolutionary instance and feature selection
    Garcia-Pedrajas, Nicolas
    de Haro-Garcia, Aida
    Perez-Rodriguez, Javier
    INFORMATION SCIENCES, 2013, 228 : 150 - 174
  • [8] A Scalable Memetic Algorithm for Simultaneous Instance and Feature Selection
    Garcia-Pedrajas, Nicolas
    de Haro-Garcia, Aida
    Perez-Rodriguez, Javier
    EVOLUTIONARY COMPUTATION, 2014, 22 (01) : 1 - 45
  • [9] A Novel Genetic Algorithm Approach to Simultaneous Feature Selection and Instance Selection
    Albuquerque, Inti Mateus Resende
    Bach Hoai Nguyen
    Xue, Bing
    Zhang, Mengjie
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 616 - 623
  • [10] Feature selection for optimizing traffic classification
    Zhang, Hongli
    Lu, Gang
    Qassrawi, Mahmoud T.
    Zhang, Yu
    Yu, Xiangzhan
    COMPUTER COMMUNICATIONS, 2012, 35 (12) : 1457 - 1471