Fused adjacency matrices to enhance information extraction: The beer benchmark

被引:8
|
作者
Cavallini, Nicola [1 ,2 ]
Savorani, Francesco [3 ]
Bro, Rasmus [2 ]
Cocchi, Marina [1 ]
机构
[1] Univ Modena & Reggio Emilia, Dipartimento Sci Chim & Geol, Via Campi 103, I-41125 Modena, MO, Italy
[2] Univ Copenhagen, Fac Sci, Dept Food Sci, Chemometr & Analyt Technol, Rolighedsvej 26, DK-1958 Frederiksberg C, Denmark
[3] Politecn Torino, Dept Appl Sci & Technol, Corso Duca Abruzzi 24, I-10129 Turin, TO, Italy
关键词
Data fusion; Adjacency matrix; Clustering; Data visualization; Spectroscopy; Beer; NUCLEAR-MAGNETIC-RESONANCE; ARTIFICIAL NEURAL-NETWORKS; PROJECTION PURSUIT; QUALITY-CONTROL; DATA-FUSION; MULTIVARIATE-ANALYSIS; PROCRUSTES ROTATION; SPECTROSCOPY; NMR; PROFILES;
D O I
10.1016/j.aca.2019.02.023
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Multivariate exploratory data analysis allows revealing patterns and extracting information from complex multivariate data sets. However, highly complex data may not show evident groupings or trends in the principal component space, e.g. because the variation of the variables are not grouped but rather continuous. In these cases, classical exploratory methods may not provide satisfactory results when the aim is to find distinct groupings in the data. To enhance information extraction in such situations, we propose a novel approach inspired by the concept of combining weak classifiers, but in the unsupervised context. The approach is based on the fusion of several adjacency matrices obtained by different distance measures on data from different analytical platforms. This paper is intended to present and discuss the potential of the approach through a benchmark data set of beer samples. The beer data were acquired using three spectroscopic techniques: Visible, near-Infrared and Nuclear Magnetic Resonance. The results of fusing the three data sets via the proposed approach are compared with those from the single data blocks (Visible, NIR and NMR) and from a standard mid-level data fusion methodology. It is shown that, with the suggested approach, groupings related to beer style and other features are efficiently recovered, and generally more evident. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:70 / 83
页数:14
相关论文
共 24 条
  • [1] GISB: A Benchmark for Geographic Map Information Extraction
    Martins, Pedro
    Cecilio, Jose
    Abbasi, Maryam
    Furtado, Pedro
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES, BDAS 2016, 2016, 613 : 600 - 609
  • [2] Computer Analysis of Benign and Malignant Breast Specimen Images By Graph Extraction and Adjacency Matrices
    Boroujeni, Amir Momeni
    Gupta, Raavi
    Yousefi, Elham
    LABORATORY INVESTIGATION, 2015, 95 : 400A - 400A
  • [3] Computer Analysis of Benign and Malignant Breast Specimen Images By Graph Extraction and Adjacency Matrices
    Boroujeni, Amir Momeni
    Gupta, Raavi
    Yousefi, Elham
    MODERN PATHOLOGY, 2015, 28 : 400A - 400A
  • [4] The information content of the eigenvalues from modified adjacency matrices: Large scale and small scale correlations
    Benigni, R
    Passerini, L
    Pino, A
    Giuliani, A
    QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1999, 18 (05): : 449 - 455
  • [5] Inference Approach to Enhance a Portuguese Open Information Extraction
    Lima Sena, Cleiton Fernando
    Glauber, Rafael
    Claro, Daniela Barreiro
    ICEIS: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 1, 2017, : 442 - 451
  • [6] AnnIE: An Annotation Platform for Constructing Complete Open Information Extraction Benchmark
    Friedrich, Niklas
    Gashteovski, Kiril
    Yu, Mingying
    Kotnis, Bhushan
    Lawrence, Carolin
    Niepert, Mathias
    Glavas, Goran
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2022, : 44 - 60
  • [7] Application of Text Rank Algorithm Fused With LDA in Information Extraction Model
    Wei, Yunbo
    Ding, Yongsheng
    IEEE ACCESS, 2023, 11 : 84301 - 84312
  • [8] TABLEX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables
    Desai, Harsh
    Kayal, Pratik
    Singh, Mayank
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 554 - 569
  • [9] WiRe57: A Fine-Grained Benchmark for Open Information Extraction
    Lechelle, William
    Gotti, Fabrizio
    Langlais, Philippe
    13TH LINGUISTIC ANNOTATION WORKSHOP (LAW XIII), 2019, : 6 - 15
  • [10] Aggregating Inter-Sentence Information to Enhance Relation Extraction
    Zheng, Hao
    Li, Zhoujun
    Wang, Senzhang
    Yan, Zhao
    Zhou, Jianshe
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 3108 - 3114