Exploring Multi-dimensional Data via Subset Embedding

被引:8
|
作者
Xie, Peng [1 ]
Tao, Wenyuan [1 ]
Li, Jie [1 ]
Huang, Wentao [1 ]
Chen, Siming [2 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Fudan Univ, Sch Data Sci, Shanghai, Peoples R China
基金
国家重点研发计划;
关键词
NONLINEAR DIMENSIONALITY REDUCTION; VISUAL EXPLORATION; VISUALIZATION; REPRESENTATION; SETS;
D O I
10.1111/cgf.14290
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multi-dimensional data exploration is a classic research topic in visualization. Most existing approaches are designed for identifying record patterns in dimensional space or subspace. In this paper, we propose a visual analytics approach to exploring subset patterns. The core of the approach is a subset embedding network (SEN) that represents a group of subsets as uniformly-formatted embeddings. We implement the SEN as multiple subnets with separate loss functions. The design enables to handle arbitrary subsets and capture the similarity of subsets on single features, thus achieving accurate pattern exploration, which in most cases is searching for subsets having similar values on few features. Moreover, each subnet is a fully-connected neural network with one hidden layer. The simple structure brings high training efficiency. We integrate the SEN into a visualization system that achieves a 3-step workflow. Specifically, analysts (1) partition the given dataset into subsets, (2) select portions in a projected latent space created using the SEN, and (3) determine the existence of patterns within selected subsets. Generally, the system combines visualizations, interactions, automatic methods, and quantitative measures to balance the exploration flexibility and operation efficiency, and improve the interpretability and faithfulness of the identified patterns. Case studies and quantitative experiments on multiple open datasets demonstrate the general applicability and effectiveness of our approach.
引用
收藏
页码:75 / 86
页数:12
相关论文
共 50 条
  • [1] Visual exploration of multi-dimensional data via rule-based sample embedding
    Zhang, Tong
    Li, Jie
    Xu, Chao
    [J]. VISUAL INFORMATICS, 2024, 8 (03): : 53 - 56
  • [2] Comparison Analysis of Feature Subset Selection Algorithms for Multi-dimensional Data
    Ruan, Jing
    Zhang, Chang-Sheng
    Guo, Jun-Fang
    [J]. 2015 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND INFORMATION SYSTEM (SEIS 2015), 2015, : 681 - 686
  • [3] Network Embedding via Coupled Kernelized Multi-Dimensional Array Factorization
    Xu, Linchuan
    Cao, Jiannong
    Wei, Xiaokai
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (12) : 2414 - 2425
  • [4] Filter co-ordinations for exploring multi-dimensional data
    Sifer, M
    [J]. JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2006, 17 (02): : 107 - 125
  • [5] Multi-Dimensional Network Embedding with Hierarchical Structure
    Ma, Yao
    Ren, Zhaochun
    Jiang, Ziheng
    Tang, Jiliang
    Yin, Dawei
    [J]. WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 387 - 395
  • [6] Embedding multi-dimensional meshes into twisted cubes
    Dong, Qiang
    Yang, Xiaofan
    Wang, Dajin
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2010, 36 (06) : 1021 - 1026
  • [7] Compressing Embedding Table via Multi-dimensional Quantization Encoding for Sequential Recommender Model
    Wang, Feng
    Dai, Miaomiao
    Li, Xudong
    Pan, Liquan
    [J]. 2021 THE 7TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING, ICCIP 2021, 2021, : 234 - 239
  • [8] Visualizing multi-dimensional data
    Eick, SG
    [J]. COMPUTER GRAPHICS-US, 2000, 34 (01): : 61 - 67
  • [9] Visualizing multi-dimensional data
    Eick, Stephen G.
    [J]. Computer Graphics (ACM), 2000, 34 (01): : 61 - 67
  • [10] Multi-dimensional Data Correlation Analysis Method Based on Neighborhood Preserving Embedding Mechanism
    Ge, Zhongdi
    Zhao, Longjun
    Wang, Zhen
    Cui, Dandan
    Yang, Yang
    Gao, Zhipeng
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2021,