Using unlabelled data to update classification rules with applications in food authenticity studies

被引:64
|
作者
Dean, N
Murphy, TB [1 ]
Downey, G
机构
[1] Univ Dublin Trinity Coll, Dept Stat, Dublin 2, Ireland
[2] Univ Washington, Seattle, WA 98195 USA
[3] TEAGASC, Natl Food Ctr, Dublin 15, Ireland
关键词
classification; discriminant analysis; food authenticity studies; model-based clustering; near infra-red spectroscopy;
D O I
10.1111/j.1467-9876.2005.00526.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
An authentic food is one that is what it purports to be. Food processors and consumers need to be assured that, when they pay for a specific product or ingredient, they are receiving exactly what they pay for. Classification methods are an important tool in food authenticity studies where they are used to assign food samples of unknown type to known types. A classification method is developed where the classification rule is estimated by using both the labelled and the unlabelled data, in contrast with many classical methods which use only the labelled data for estimation. This methodology models the data as arising from a Gaussian mixture model with parsimonious covariance structure, as is done in model-based clustering. A missing data formulation of the mixture model is used and the models are fitted by using the EM and classification EM algorithms. The methods are applied to the analysis of spectra of food-stuffs recorded over the visible and near infra-red wavelength range in food authenticity studies. A comparison of the performance of model-based discriminant analysis and the method of classification proposed is given. The classification method proposed is shown to yield very good misclassification rates. The correct classification rate was observed to be as much as 15% higher than the correct classification rate for model-based discriminant analysis.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [1] Using labelled and unlabelled data to train a multilayer perceptron for colour classification in graphic arts
    Verikas, A
    Gelzinis, A
    Malmqvist, K
    [J]. MULTIPLE APPROACHES TO INTELLIGENT SYSTEMS, PROCEEDINGS, 1999, 1611 : 550 - 559
  • [2] Finding fuzzy classification rules using data mining techniques
    Hu, YC
    Chen, RS
    Tzeng, GH
    [J]. PATTERN RECOGNITION LETTERS, 2003, 24 (1-3) : 509 - 519
  • [3] Creating fuzzy rules for image classification using biased data clustering
    Santos, R
    [J]. NONLINEAR IMAGE PROCESSING X, 1999, 3646 : 151 - 159
  • [4] Data Mining Using Neural Networks in the form of Classification Rules: A Review
    Chakraborty, Manomita
    Biswas, Saroj Kumar
    Purkayastha, Biswajit
    [J]. 2020 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NETWORKS (CINE 2020), 2020,
  • [5] Logical rules for classification of medical data using ontogenic neural algorithm
    Duch, W
    Adamczak, R
    Grabczewski, K
    [J]. PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 831 - 834
  • [6] Joint Feature Selection and Classification for Positive Unlabelled Multi-Label Data Using Weighted Penalized Empirical Risk Minimization
    TEISSEYRE, P. A. W. E. L.
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2022, 32 (02) : 311 - 322
  • [7] VARIABLE SELECTION AND UPDATING IN MODEL-BASED DISCRIMINANT ANALYSIS FOR HIGH DIMENSIONAL DATA WITH FOOD AUTHENTICITY APPLICATIONS
    Murphy, Thomas Brendan
    Dean, Nema
    Raftery, Adrian E.
    [J]. ANNALS OF APPLIED STATISTICS, 2010, 4 (01): : 396 - 421
  • [8] Feature based classification using hierarchical fuzzy rules for the analysis of hyperspectral data
    Yoon, CR
    Kim, HG
    Kim, KO
    [J]. IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 3856 - 3859
  • [9] Evolving Mamdani Fuzzy Rules Using Swarm Algorithms for Accurate Data Classification
    Abdulgader, Musbah
    Kaur, Devinder
    [J]. IEEE ACCESS, 2019, 7 : 175907 - 175916
  • [10] Fusion of Elevation Data into Satellite Image Classification Using Refined Production Rules
    Al Momani, Bilal
    Morrow, Philip
    McClean, Sally
    [J]. IMAGE ANALYSIS AND RECOGNITION: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, PT I, 2011, 6753 : 211 - 220