A High Accurate Multiple Classifier System for Entity Resolution Using Resampling and Ensemble Selection

被引:2
|
作者
Zhou Xing [1 ]
Diao Xingchun [1 ]
Cao Jianjun [1 ]
机构
[1] PLA Univ Sci & Technol, Nanjing 210007, Jiangsu, Peoples R China
关键词
D O I
10.1155/2015/630176
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Classifiers are often used in entity resolution to classify record pairs into matches, nonmatches, and possible matches, the performance of classifiers is directly related to the performance of entity resolution. In this paper, we develop a multiple classifier system using resampling and ensemble selection. We make full use of the characteristics of entity resolution to distinguish ambiguous instances before classification, so that the algorithm can focus on the ambiguous instances in parallel. Instead of developing an empirical optimal resampling ratio, we vary the ratio in a range to generate multiple resampled data. Further, we use the resampled data to train multiple classifiers and then use ensemble selection to select the best classifiers subset, which is also the best resampling ratio combination. Empirical study shows our method has a relatively high accuracy compared to other state-of-the-art multiple classifiers systems.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] High-resolution digital resampling using vector rational filters
    Khriji, L
    Cheikh, FA
    Gabbouj, M
    OPTICAL ENGINEERING, 1999, 38 (05) : 893 - 901
  • [42] Entity identification for heterogeneous database integration - a multiple classifier system approach and empirical evaluation
    Zhao, HM
    Ram, S
    INFORMATION SYSTEMS, 2005, 30 (02) : 119 - 132
  • [43] Multiple Classifier System for Urban Area's Extraction from High Resolution Remote Sensing Imagery
    Bedawi, Safaa M.
    Kamel, Mohamed S.
    IMAGE ANALYSIS AND RECOGNITION: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, PT II: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, 2011, 6754 : 307 - 316
  • [44] McPAD: A multiple classifier system for accurate payload-based anomaly detection
    Perdisci, Roberto
    Ariu, Davide
    Fogla, Prahlad
    Giacinto, Giorgio
    Lee, Wenke
    COMPUTER NETWORKS, 2009, 53 (06) : 864 - 881
  • [45] BIO-INSPIRED ENSEMBLE FEATURE SELECTION (BIEFS) AND ENSEMBLE MULTIPLE DEEP LEARNING (EMDL) CLASSIFIER FOR BREAST CANCER DIAGNOSIS
    Priya, R. S. Padma
    Vadivu, P. Senthil
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 483 - 499
  • [46] Automatic Change Detection in High-Resolution Remote Sensing Images by Using a Multiple Classifier System and Spectral-Spatial Features
    Tan, Kun
    Jin, Xiao
    Plaza, Antonio
    Wang, Xuesong
    Xiao, Liang
    Du, Peijun
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (08) : 3439 - 3451
  • [47] Improving Biochemical Named Entity Recognition Using PSO Classifier Selection and Bayesian Combination Methods
    Akkasi, Abbas
    Varoglu, Ekrem
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (06) : 1327 - 1338
  • [48] A novel cascade ensemble classifier system with a high recognition performance on handwritten digits
    Zhang, Ping
    Bui, Tien D.
    Suen, Ching Y.
    PATTERN RECOGNITION, 2007, 40 (12) : 3415 - 3429
  • [49] Recognizing Arabic handwritten words using multiple features and classifier selection
    Aiadi, Oussama
    Korichi, Aicha
    Kherfi, Mohammed Lamine
    2019 4TH INTERNATIONAL CONFERENCE ON NETWORKING AND ADVANCED SYSTEMS (ICNAS 2019), 2019, : 106 - 110
  • [50] A fast intrusion detection system based on swift wrapper feature selection and speedy ensemble classifier
    Zorarpaci, Ezgi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133