Fused adjacency matrices to enhance information extraction: The beer benchmark

被引：8

作者：

Cavallini, Nicola ^{[1
,2
]}

Savorani, Francesco ^{[3
]}

Bro, Rasmus ^{[2
]}

Cocchi, Marina ^{[1
]}

机构：

[1] Univ Modena & Reggio Emilia, Dipartimento Sci Chim & Geol, Via Campi 103, I-41125 Modena, MO, Italy

[2] Univ Copenhagen, Fac Sci, Dept Food Sci, Chemometr & Analyt Technol, Rolighedsvej 26, DK-1958 Frederiksberg C, Denmark

[3] Politecn Torino, Dept Appl Sci & Technol, Corso Duca Abruzzi 24, I-10129 Turin, TO, Italy

来源：

ANALYTICA CHIMICA ACTA | 2019年 / 1061卷

关键词：

Data fusion; Adjacency matrix; Clustering; Data visualization; Spectroscopy; Beer; NUCLEAR-MAGNETIC-RESONANCE; ARTIFICIAL NEURAL-NETWORKS; PROJECTION PURSUIT; QUALITY-CONTROL; DATA-FUSION; MULTIVARIATE-ANALYSIS; PROCRUSTES ROTATION; SPECTROSCOPY; NMR; PROFILES;

D O I：

10.1016/j.aca.2019.02.023

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Multivariate exploratory data analysis allows revealing patterns and extracting information from complex multivariate data sets. However, highly complex data may not show evident groupings or trends in the principal component space, e.g. because the variation of the variables are not grouped but rather continuous. In these cases, classical exploratory methods may not provide satisfactory results when the aim is to find distinct groupings in the data. To enhance information extraction in such situations, we propose a novel approach inspired by the concept of combining weak classifiers, but in the unsupervised context. The approach is based on the fusion of several adjacency matrices obtained by different distance measures on data from different analytical platforms. This paper is intended to present and discuss the potential of the approach through a benchmark data set of beer samples. The beer data were acquired using three spectroscopic techniques: Visible, near-Infrared and Nuclear Magnetic Resonance. The results of fusing the three data sets via the proposed approach are compared with those from the single data blocks (Visible, NIR and NMR) and from a standard mid-level data fusion methodology. It is shown that, with the suggested approach, groupings related to beer style and other features are efficiently recovered, and generally more evident. (C) 2019 Elsevier B.V. All rights reserved.

引用

页码：70 / 83

页数：14

共 24 条

[1] GISB: A Benchmark for Geographic Map Information Extraction
Martins, Pedro
Cecilio, Jose
Abbasi, Maryam
Furtado, Pedro
BEYOND DATABASES, ARCHITECTURES AND STRUCTURES, BDAS 2016, 2016, 613 : 600 - 609
[2] Computer Analysis of Benign and Malignant Breast Specimen Images By Graph Extraction and Adjacency Matrices
Boroujeni, Amir Momeni
Gupta, Raavi
Yousefi, Elham
LABORATORY INVESTIGATION, 2015, 95 : 400A - 400A
[3] Computer Analysis of Benign and Malignant Breast Specimen Images By Graph Extraction and Adjacency Matrices
Boroujeni, Amir Momeni
Gupta, Raavi
Yousefi, Elham
MODERN PATHOLOGY, 2015, 28 : 400A - 400A
[4] The information content of the eigenvalues from modified adjacency matrices: Large scale and small scale correlations
Benigni, R
Passerini, L
Pino, A
Giuliani, A
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1999, 18 (05): : 449 - 455
[5] Inference Approach to Enhance a Portuguese Open Information Extraction
Lima Sena, Cleiton Fernando
Glauber, Rafael
Claro, Daniela Barreiro
ICEIS: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 1, 2017, : 442 - 451
[6] AnnIE: An Annotation Platform for Constructing Complete Open Information Extraction Benchmark
Friedrich, Niklas
Gashteovski, Kiril
Yu, Mingying
Kotnis, Bhushan
Lawrence, Carolin
Niepert, Mathias
Glavas, Goran
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2022, : 44 - 60
[7] Application of Text Rank Algorithm Fused With LDA in Information Extraction Model
Wei, Yunbo
Ding, Yongsheng
IEEE ACCESS, 2023, 11 : 84301 - 84312
[8] TABLEX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables
Desai, Harsh
Kayal, Pratik
Singh, Mayank
DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 554 - 569
[9] WiRe57: A Fine-Grained Benchmark for Open Information Extraction
Lechelle, William
Gotti, Fabrizio
Langlais, Philippe
13TH LINGUISTIC ANNOTATION WORKSHOP (LAW XIII), 2019, : 6 - 15
[10] Aggregating Inter-Sentence Information to Enhance Relation Extraction
Zheng, Hao
Li, Zhoujun
Wang, Senzhang
Yan, Zhao
Zhou, Jianshe
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 3108 - 3114

← 1 2 3 →