Cornac: Tackling Huge Graph Visualization with Big Data Infrastructure

被引:7
|
作者
Perrot, Alexandre [1 ,2 ]
Auber, David [1 ,2 ]
机构
[1] Univ Bordeaux, LaBRI, UMR 5800, F-33400 Talence, France
[2] CNRS, LaBRI, UMR 5800, F-33400 Talence, France
关键词
Information visualization; big data; graphs; hadoop; spark; ALGORITHM;
D O I
10.1109/TBDATA.2018.2869165
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The size of available graphs has drastically increased in recent years. The real-time visualization of graphs with millions of edges is a challenge but is necessary to grasp information hidden in huge datasets. This article presents an end-to-end technique to visualize huge graphs using an established Big Data ecosystem and a lightweight client running in a Web browser. For that purpose, levels of abstraction and graph tiles are generated by a batch layer and the interactive visualization is provided using a serving layer and client-side real-time computation of edge bundling and graph splatting. A major challenge is to create techniques that work without moving data to an ad hoc system and that take advantage of the horizontal scalability of these infrastructures. We introduce two novel scalable algorithms that enable to generate a canopy clustering and to aggregate graph edges. These two algorithms are both used to produce levels of abstraction and graph tiles. We prove that our technique guarantee a quality of visualization by controlling both the necessary bandwidth required for data transfer and the quality of the produced visualization. Furthermore, we demonstrate the usability of our technique by providing a complete prototype. We present benchmarks on graphs with millions of elements and we compare our results to those obtained by state of the art techniques. Our results show that new Big Data technologies can be incorporated into visualization pipeline to push out the size limits of graphs one can visually analyze.
引用
收藏
页码:80 / 92
页数:13
相关论文
共 50 条
  • [1] HUGE GRAPH VISUALIZATION AND ANALYSIS
    Kolomeychenko, Maxim
    Chepovskiy, Andrey
    [J]. BIZNES INFORMATIKA-BUSINESS INFORMATICS, 2014, 30 (04): : 7 - 16
  • [2] A SURVEY ON BIG DATA: INFRASTRUCTURE, ANALYTICS, VISUALIZATION AND APPLICATIONS
    Saraswathi, S.
    Deepa, G.
    Vennila, G.
    Parthasarathy, S.
    Ramadoss, B.
    [J]. INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING-THEORY APPLICATIONS AND PRACTICE, 2022, 29 (05): : 618 - 648
  • [3] Tulip - A huge graph visualization framework
    Auber, D
    [J]. GRAPH DRAWING SOFTWARE, 2004, : 105 - 126
  • [4] Large Interactive Visualization of Density Functions on Big Data Infrastructure
    Perrot, Alexandre
    Bourqui, Romain
    Hanusse, Nicolas
    Lalanne, Frederic
    Auber, David
    [J]. 2015 IEEE 5TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV), 2015, : 99 - 106
  • [5] Visualization of Big Data
    Kung, Sun-Yuan
    [J]. PROCEEDINGS OF 2015 IEEE 14TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2015, : 447 - 448
  • [6] Big Graph-based Data Visualization Experiences The WordNet Case Study
    Caldarola, Enrico G.
    Picariello, Antonio
    Rinaldi, Antonio M.
    [J]. 2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 104 - 115
  • [7] The Big Picture for Big Data: Visualization
    Shneiderman, Ben
    [J]. SCIENCE, 2014, 343 (6172) : 730 - 730
  • [8] Big Data, Big Picture - Data Visualization of Health
    Bourke, Alison
    Ryan, Patrick B.
    Elhadad, Noemie
    Perer, Adam
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2016, 25 : 48 - 48
  • [9] Big Data Infrastructure: A Survey
    Salvador, Jaime
    Ruiz, Zoila
    Garcia-Rodriguez, Jose
    [J]. BIOMEDICAL APPLICATIONS BASED ON NATURAL AND ARTIFICIAL COMPUTING, PT II, 2017, 10338 : 249 - 258
  • [10] Using big data to accomplish a huge job
    Rivers, Louie, III
    [J]. NATURE SUSTAINABILITY, 2018, 1 (10): : 537 - 537