A High Accurate Multiple Classifier System for Entity Resolution Using Resampling and Ensemble Selection

被引:2
|
作者
Zhou Xing [1 ]
Diao Xingchun [1 ]
Cao Jianjun [1 ]
机构
[1] PLA Univ Sci & Technol, Nanjing 210007, Jiangsu, Peoples R China
关键词
D O I
10.1155/2015/630176
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Classifiers are often used in entity resolution to classify record pairs into matches, nonmatches, and possible matches, the performance of classifiers is directly related to the performance of entity resolution. In this paper, we develop a multiple classifier system using resampling and ensemble selection. We make full use of the characteristics of entity resolution to distinguish ambiguous instances before classification, so that the algorithm can focus on the ambiguous instances in parallel. Instead of developing an empirical optimal resampling ratio, we vary the ratio in a range to generate multiple resampled data. Further, we use the resampled data to train multiple classifiers and then use ensemble selection to select the best classifiers subset, which is also the best resampling ratio combination. Empirical study shows our method has a relatively high accuracy compared to other state-of-the-art multiple classifiers systems.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Weighted Vote Based Classifier Ensemble Selection Using Genetic Algorithm for Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 6177 : 256 - +
  • [2] Optimal resampling and classifier prototype selection in classifier ensembles using genetic algorithms
    Hakan Altinçay
    Pattern Analysis and Applications, 2004, 7 : 285 - 295
  • [3] Optimal resampling and classifier prototype selection in classifier ensembles using genetic algorithms
    Altinçay, H
    PATTERN ANALYSIS AND APPLICATIONS, 2004, 7 (03) : 285 - 295
  • [4] Optimal resampling and classifier prototype selection in classifier ensembles using genetic algorithms
    Altinçay H.
    Pattern Analysis and Applications, 2004, 7 (3) : 285 - 295
  • [5] A Method for Entity Resolution in High Dimensional Data Using Ensemble Classifiers
    Liu Yi
    Diao Xing-chun
    Cao Jian-jun
    Zhou Xing
    Shang Yu-ling
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
  • [6] Classifier ensemble selection for language verification system
    Liu, ChangE
    Xia, Shanghong
    Liu Jia
    2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 505 - +
  • [7] Combining multiple classifiers using vote based classifier ensemble technique for named entity recognition
    Saha, Sriparna
    Ekbal, Asif
    DATA & KNOWLEDGE ENGINEERING, 2013, 85 : 15 - 39
  • [8] ACCURATE INFERENCE OF UNSEEN COMBINATIONS OF MULTIPLE ROOTCAUSES WITH CLASSIFIER ENSEMBLE
    Zhang, Xuan
    Xiong, Longxiang
    Sun, Ningyuan
    Wang, Mingxia
    Tang, Hao
    Zhao, Yanxing
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 9306 - 9310
  • [9] Classifier Ensemble using Multiobjective Optimization for Named Entity Recognition
    Ekbal, Asif
    Saha, Sriparna
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 783 - 788
  • [10] Multiobjective optimization for classifier ensemble and feature selection: an application to named entity recognition
    Ekbal, Asif
    Saha, Sriparna
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (02) : 143 - 166