共 10 条
- [1] Hernandez M., Stolfo S., Real-world data is dirty: data cleansing and the merge/purge problem, Journal of Data Mining and Knowledge Discovery, 2, 1, pp. 9-37, (1998)
- [2] Lee M.L., Lu H.-J., Wang L.T., Et al., Cleansing data for mining and warehousing, DEXA'99, pp. 751-760, (1999)
- [3] Zhu X., Wu X., Chen S., Eliminating class noise in large datasets, Proceedings of the 20th ICML International Conference on Machine Learning, pp. 920-927, (2003)
- [4] Monge A., Elkan C., The field-matching problem: algorithm and applications, Proc 2nd ACM SIGKDD Int'l Conf Knowledge Discovery and Data Mining, pp. 267-270, (1996)
- [5] Newcombe H.B., Kennedy J.M., Record linkage: making maximum use of the discriminating power of identifying information, Commun ACM (CACM), 5, 11, pp. 563-566, (1962)
- [6] Fellegi I.P., Sunter Alan B., A theory for record linkage, Journal of the American Statistical Association, 64, 328, pp. 1183-1210, (1969)
- [7] Matthew A., Advances in record-linkage methodology as applied to matching the 1985 census of tampa, florida, Journal of the American Statistical Association, 84, 406, pp. 414-420, (1989)
- [8] Hernandez Mauricio A., Stolfo Salvatore J., The merge/purge problem for large databases, SIGMOD'95, pp. 127-138, (1995)
- [9] Su Y., Lee D., Kan M.Y., Et al., Adaptive sorted neighborhood methods for efficient record linkage, JCDL '07: Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries, pp. 185-194, (2007)
- [10] Elmagarmid A.K., Ipeirotis P.G., Verykios V.S., Duplicate record detection: a survey, IEEE Transactions on Knowledge and Data Engineering, 19, 1, pp. 1-16, (2007)