A Metadata-Assisted Cascading Ensemble Classification Framework for Automatic Annotation of Open IoT Data

被引:5
|
作者
Montori, Federico [1 ]
Liao, Kewen [2 ]
De Giosa, Matteo [3 ]
Jayaraman, Prem Prakash [4 ]
Bononi, Luciano [1 ]
Sellis, Timos [5 ]
Georgakopoulos, Dimitrios [4 ]
机构
[1] Univ Bologna, Dept Comp Sci & Engn, I-40127 Bologna, Italy
[2] Australian Catholic Univ, Peter Faber Business Sch, Discipline Informat Technol, Sydney, NSW 2060, Australia
[3] Univ Milano Bicocca, Dept Informat Syst & Commun, I-20126 Milan, Italy
[4] Swinburne Univ Technol, Dept Comp Technol, Hawthorn, Vic 3122, Australia
[5] Athena Res & Innovat Ctr, Archimedes Res Unit AI Data Sci & Algorithms, Maroussi 15125, Greece
基金
澳大利亚研究理事会;
关键词
Annotation; classification; Internet of Things (IoT); IoT metadata; open IoT data; sensors; COLLABORATIVE INTERNET; TIME;
D O I
10.1109/JIOT.2023.3263213
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Public Internet of Things (IoT) platforms, such as Thingspeak, significantly increased the availability of open IoT data and enabled faster and cheaper development of novel IoT applications by reducing or even eliminating the need for deploying their own IoT sensors and platforms. However, open IoT data is often heterogeneous, sparse, fuzzy, and lacks accurate description (which we refer to as IoT metadata). These limitations make open IoT data challenging to integrate and use, and prevent the efficient development of IoT applications. In fact, while several sensor data description models have been proposed and standardized, open IoT data currently lack or include only partial metadata description. Therefore, novel techniques for automatically annotating open IoT data are needed to fully unleash the power of open IoT. This article proposes a novel metadata-assisted cascading ensemble classification framework (MACE) for the automatic annotation of IoT data. MACE is capable of sequentially combining standalone classifiers, enabling it to cope with heterogeneous IoT data and different domains of information (e.g., numerical and textual), which have not been considered previously. MACE incorporates a novel ensemble approach for automatically selecting, sorting, filtering, and assembling classifiers in a way that improves annotation performance. This article presents extensive experimental evaluations of MACE using public IoT data sets. Results demonstrate that the MACE framework significantly outperforms existing solutions for open IoT data by as much as 10% in classification accuracy.
引用
收藏
页码:13401 / 13413
页数:13
相关论文
共 50 条
  • [1] Web Service Classification Based on Automatic Semantic Annotation and Ensemble Learning
    Li Yuan-jie
    Cao Jian
    [J]. 2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 2274 - 2279
  • [2] Automatic Clustering and Semantic Annotation for Dynamic IoT Sensor Data
    Yu, Ching-Tzu
    Zou, Yu-Hui
    Li, Hao-Yu
    Lin, Szu-Yin
    [J]. 2018 FIRST INTERNATIONAL COGNITIVE CITIES CONFERENCE (IC3 2018), 2018, : 188 - 189
  • [3] A Metadata Classification Assisted Scientific Data Extraction Architecture
    Chang, Yue-Shan
    Cheng, Hsiang-Tai
    [J]. ADVANCES IN GRID AND PERVASIVE COMPUTING, PROCEEDINGS, 2010, 6104 : 679 - 688
  • [4] Open Metadata: User-centred competitive Data Exploitation with open Framework Data
    Muehlhaeuser, Max
    [J]. ZUKUNFT DER DATENOKONOMIE: ZWISCHEN GESCHAFTSMODELL, KOLLEKTIVGUT UND VERBRAUCHERSCHUTZ, 2019, : 71 - 102
  • [5] Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework
    Li, Li-Jia
    Socher, Richard
    Li Fei-Fei
    [J]. CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 2036 - 2043
  • [6] Utilizing Linked Open Data Sources for Automatic Generation of Semantic Metadata
    Nummiaho, Antti
    Vainikainen, Sari
    Melin, Magnus
    [J]. METADATA AND SEMANTIC RESEARCH, 2010, 108 : 78 - 83
  • [7] An Ensemble of Statistical Metadata and CNN Classification of Class Imbalanced Skin Lesion Data
    Nayak, Sachin
    Vincent, Shweta
    Sumathi, K.
    Kumar, Om Prakash
    Pathan, Sameena
    [J]. INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2021, 68 (02) : 251 - 257
  • [8] An Ensemble of Statistical Metadata and CNN Classification of Class Imbalanced Skin Lesion Data
    Nayak, Sachin
    Vincent, Shweta
    Sumathi, K.
    Kumar, Om Prakash
    Pathan, Sameena
    [J]. International Journal of Electronics and Telecommunications, 2022, 68 (02): : 251 - 257
  • [9] A Classifier Ensemble Framework for Multimedia Big Data Classification
    Yan, Yilin
    Zhu, Qiusha
    Shyu, Mei-Ling
    Chen, Shu-Ching
    [J]. PROCEEDINGS OF 2016 IEEE 17TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI), 2016, : 615 - 622
  • [10] Semantic Annotation Automatic of Curriculum Lattes Using Linked Open Data
    da Silva, Walison Dias
    Parreiras, Fernando Silva
    Gomes Maia, Luiz Claudio
    Brandao, Wladmir Cardoso
    [J]. PERSPECTIVAS EM CIENCIA DA INFORMACAO, 2018, 23 (04): : 53 - 72