A High Accurate Multiple Classifier System for Entity Resolution Using Resampling and Ensemble Selection

被引:2
|
作者
Zhou Xing [1 ]
Diao Xingchun [1 ]
Cao Jianjun [1 ]
机构
[1] PLA Univ Sci & Technol, Nanjing 210007, Jiangsu, Peoples R China
关键词
D O I
10.1155/2015/630176
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Classifiers are often used in entity resolution to classify record pairs into matches, nonmatches, and possible matches, the performance of classifiers is directly related to the performance of entity resolution. In this paper, we develop a multiple classifier system using resampling and ensemble selection. We make full use of the characteristics of entity resolution to distinguish ambiguous instances before classification, so that the algorithm can focus on the ambiguous instances in parallel. Instead of developing an empirical optimal resampling ratio, we vary the ratio in a range to generate multiple resampled data. Further, we use the resampled data to train multiple classifiers and then use ensemble selection to select the best classifiers subset, which is also the best resampling ratio combination. Empirical study shows our method has a relatively high accuracy compared to other state-of-the-art multiple classifiers systems.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Efficient Twitter Sentiment Analysis System with Feature Selection and Classifier Ensemble
    Fouad, Mohammed M.
    Gharib, Tarek F.
    Mashat, Abdulfattah S.
    INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 516 - 527
  • [22] Using multiple classifier behavior to develop a dynamic outlier ensemble
    Ping Yuan
    Biao Wang
    Zhizhong Mao
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 501 - 513
  • [23] Using multiple classifier behavior to develop a dynamic outlier ensemble
    Yuan, Ping
    Wang, Biao
    Mao, Zhizhong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (02) : 501 - 513
  • [24] Software Cost Estimation using Stacked Ensemble Classifier and Feature Selection
    Al-Karak, Mustafa Hammad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (06) : 183 - 189
  • [25] Using Meta-learning in the Selection of the Combination Method of a Classifier Ensemble
    da Silva, Robercy Alves
    de Paula Canuto, Anne Magaly
    Xavier Junior, Joao Carlos
    Ludermir, Teresa Bernarda
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [26] An Ensemble Classifier Based on Feature Selection Using Ant Colony Optimization
    Cao, Jianjun
    Lv, Guojun
    Shang, Yuling
    Weng, Nianfeng
    Chang, Chen
    Liu, Yi
    2018 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2018,
  • [27] OPTIMIZATION of ENSEMBLE Classifier SYSTEM BASED ON MULTIPLE OBJECTIVES GENETIC ALGORITHM
    Tien Thanh Nguyen
    Liew, Alan Wee-Chung
    Xuan Cuong Pham
    Mal Phuong Nguyen
    PROCEEDINGS OF 2014 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2014, : 46 - 51
  • [28] A CASCADED ENSEMBLE CLASSIFIER FOR OBJECT SEGMENTATION IN HIGH RESOLUTION POLARIMETRIC SAR DATA
    Jaeger, Marc
    Reigber, Andreas
    Hellwich, Olaf
    2014 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2014, : 1029 - 1032
  • [29] Building an efficient intrusion detection system based on feature selection and ensemble classifier
    Zhou, Yuyang
    Cheng, Guang
    Jiang, Shanqing
    Dai, Mian
    COMPUTER NETWORKS, 2020, 174
  • [30] Attribute Selection and Ensemble Classifier based Novel Approach to Intrusion Detection System
    Kunal
    Dua, Mohit
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 2191 - 2199