Extraction, Identification and Ranking of Network Structures from Data Sets

被引:10
|
作者
Trovati, Marcello [1 ]
Bessis, Nik [1 ]
Huber, Anna [1 ]
Zelenkauskaite, Asta [2 ]
Asimakopoulou, Eleana [1 ]
机构
[1] Univ Derby, Sch Comp & Math, Derby DE22 1GB, England
[2] Drexel Univ, Dept Culture & Commun, Philadelphia, PA USA
关键词
Knowledge discovery; Networks; Information extraction; Social graphs; Data analytics;
D O I
10.1109/CISIS.2014.46
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Networks are widely used to model a variety of complex, often multi-disciplinary, systems in which the relationships between their sub-parts play a significant role. In particular, there is extensive research on the topological properties associated with their structure as this allows the analysis of the overall behaviour of such networks. However, extracting networks from structured and unstructured data sets raises several challenges, including addressing any inconsistency present in the data, as well as the difficulty in investigating their properties especially when the topological structure is not fully determined or not explicitly defined. In this paper, we propose a novel method to address the automated identification, assessment and ranking of the most likely structure associated with networks extracted from a variety of data sets. More specifically, our approach allows to mine data to assess whether their associated networks exhibit properties comparable to well-known structures, namely scale-free, small world and random networks. The main motivation is to provide a toolbox to classify and analyse real-world networks otherwise difficult to fully assess due to their potential lack of structure. This can be used to investigate their dynamical and statistical behaviour which would potentially lead to a better understanding and prediction of the properties of the system(s) they model. Our initial validation shows the potential of our method providing relevant and accurate results.
引用
收藏
页码:331 / 337
页数:7
相关论文
共 50 条
  • [1] Data structures and computational tools for the extraction of SAR information from large compound sets
    Wawer, Mathias
    Lounkine, Eugen
    Wassermann, Anne M.
    Bajorath, Juergen
    DRUG DISCOVERY TODAY, 2010, 15 (15-16) : 630 - 639
  • [2] Neural Network Based Inverse System Identification from Small Data Sets
    Wanigasekara, Chathura
    Swain, Akshya
    Nguang, Sing Kiong
    Prusty, B. Gangadhara
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [3] ANALYSIS OF LARGE SETS OF RANKING DATA
    COHEN, A
    COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1982, 11 (03): : 235 - 256
  • [4] Data management and extraction of biological information from large data sets
    Mount, David
    IN VITRO CELLULAR & DEVELOPMENTAL BIOLOGY-ANIMAL, 2008, 44 : S3 - S4
  • [5] On the role of maximal independent sets in cleaning data sets for supervised ranking
    Rademaker, Michael
    De Baets, Bernard
    De Meyer, Hans
    2006 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2006, : 1619 - +
  • [6] Mesoscopic structures reveal the network between the layers of multiplex data sets
    Iacovacci, Jacopo
    Wu, Zhihao
    Bianconi, Ginestra
    PHYSICAL REVIEW E, 2015, 92 (04)
  • [7] POWDER STRUCTURES FROM LIMITED DATA SETS.
    Poojary, Damodara M.
    Clearfield, Abraham
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 1996, 52 : C59 - C59
  • [8] (Automatic) Cluster Count Extraction from Unlabeled Data Sets
    Sledge, Isaac J.
    Huband, Jacalyn M.
    Bezdek, James C.
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2008, : 3 - +
  • [9] Digital elevation extraction from multiple MTI data sets
    Mercier, JA
    Schowengerdt, RA
    Storey, JC
    Smith, JL
    IMAGING SPECTROMETRY IX, 2003, 5159 : 158 - 166
  • [10] A fast algorithm for learning a ranking function from large-scale data sets
    Raykar, Vikas C.
    Duraiswami, Ramani
    Krishnapuram, Balaji
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (07) : 1158 - 1170