Consistency measure based simultaneous feature selection and instance purification for multimedia traffic classification

被引：8

作者：

Wu, Zheng ^{[1
]}

Dong, Yu-ning ^{[1
]}

Wei, Hua-Liang ^{[2
]}

Tian, Wei ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Coll Telecommun & Informat Engn, Nanjing 210003, Peoples R China

[2] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield, S Yorkshire, England

来源：

COMPUTER NETWORKS | 2020年 / 173卷 / 173期

基金：

中国国家自然科学基金;

关键词：

Traffic classification; Feature selection; Instance purification; Flow fragment; EFFICIENT; PREDICTION; ALGORITHM;

D O I：

10.1016/j.comnet.2020.107190

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the increase of multimedia traffic, the implementation of fast and accurate classification has become an important issue. Besides, a manually captured dataset contains certain noise and mislabeled instances, which influences the accuracy of classifier to some extent. Motivated by these observations, a novel feature selection and instance purification (FS&IP) method based on consistency measure is proposed. It utilizes a linear consistency-constrained algorithm for feature selection. In each round of iteration, it removes the instance with the minor labels in every pattern subset. Our method has three desirable properties: 1) It can simultaneously achieve feature selection and data purification. 2) when purifying instance, it doesn't need to annotate the noisy instance with learned labels; that is because it is an unsupervised method in terms of data purification. 3) through data purification, it is able to obtain a minimal feature subset on condition of maintaining accuracy. In addition, the proposed method can be used to discover a new discriminative feature based on linking behaviors called the flow fragment (F - Frag), which can reflect important information among the complex and multitudinous packet communication behaviors. The experimental results over six different datasets demonstrate the advantages of the proposed technique compared to six existing methods, and the discriminative power of the new flow fragment feature.

引用

页数：13

共 50 条

[1] Feature selection using consistency measure
Dash, M
Liu, H
Motoda, H
DISCOVERY SCIENCE, PROCEEDINGS, 1999, 1721 : 319 - 320
[2] A Weighted Feature Selection Method for Instance-Based Classification
Agre, Gennady
Dzhondzhorov, Anton
ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2016, 2016, 9883 : 14 - 25
[3] Topic-Based Instance and Feature Selection in Multilabel Classification
Ma, Jianghong
Chow, Tommy W. S.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 315 - 329
[4] FISA: Feature-based instance selection for imbalanced text classification
Sun, Aixin
Lim, Ee-Peng
Benatallah, Boualem
Hassan, Mahbub
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2006, 3918 : 250 - 254
[5] Evolutionary feature and instance selection for traffic sign recognition
Chen, Zong-Yao
Lin, Wei-Chao
Ke, Shih-Wen
Tsai, Chih-Fong
COMPUTERS IN INDUSTRY, 2015, 74 : 201 - 211
[6] A Fast and Accurate Feature Selection Algorithm Based on Binary Consistency Measure
Shin, Kilho
Miyazaki, Seiya
COMPUTATIONAL INTELLIGENCE, 2016, 32 (04) : 646 - 667
[7] A scalable approach to simultaneous evolutionary instance and feature selection
Garcia-Pedrajas, Nicolas
de Haro-Garcia, Aida
Perez-Rodriguez, Javier
INFORMATION SCIENCES, 2013, 228 : 150 - 174
[8] A Scalable Memetic Algorithm for Simultaneous Instance and Feature Selection
Garcia-Pedrajas, Nicolas
de Haro-Garcia, Aida
Perez-Rodriguez, Javier
EVOLUTIONARY COMPUTATION, 2014, 22 (01) : 1 - 45
[9] A Novel Genetic Algorithm Approach to Simultaneous Feature Selection and Instance Selection
Albuquerque, Inti Mateus Resende
Bach Hoai Nguyen
Xue, Bing
Zhang, Mengjie
2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 616 - 623
[10] Feature selection for optimizing traffic classification
Zhang, Hongli
Lu, Gang
Qassrawi, Mahmoud T.
Zhang, Yu
Yu, Xiangzhan
COMPUTER COMMUNICATIONS, 2012, 35 (12) : 1457 - 1471

← 1 2 3 4 5 →