Data Sampling in Multi-view and Multi-class Scatterplots via Set Cover Optimization

被引:18
|
作者
Hu, Ruizhen [1 ]
Sha, Tingkai [1 ]
van Kaick, Oliver [2 ]
Deussen, Oliver [3 ,4 ]
Huang, Hui [1 ]
机构
[1] Shenzhen Univ, Visual Comp Res Ctr, Shenzhen, Guangdong, Peoples R China
[2] Carleton Univ, Sch Comp Sci, Ottawa, ON, Canada
[3] Konstanz Univ, Constance, Germany
[4] SIAT, Shenzhen VisuCA Key Lab, Shenzhen, Guangdong, Peoples R China
基金
加拿大自然科学与工程研究理事会;
关键词
Sampling; Scatterplot; SPLOM; Exact Cover Problem; QUALITY METRICS; VISUALIZATION; REDUCTION;
D O I
10.1109/TVCG.2019.2934799
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a method for data sampling in scatterplots by jointly optimizing point selection for different views or classes. Our method uses space-filling curves (Z-order curves) that partition a point set into subsets that, when covered each by one sample, provide a sampling or coreset with good approximation guarantees in relation to the original point set. For scatterplot matrices with multiple views, different views provide different space-filling curves, leading to different partitions of the given point set. For multi-class scatterplots, the focus on either per-class distribution or global distribution provides two different partitions of the given point set that need to be considered in the selection of the coreset. For both cases, we convert the coreset selection problem into an Exact Cover Problem (ECP), and demonstrate with quantitative and qualitative evaluations that an approximate solution that solves the ECP efficiently is able to provide high-quality samplings.
引用
收藏
页码:739 / 748
页数:10
相关论文
共 50 条
  • [41] Multi-View Classification via a Fast and Effective Multi-View Nearest-Subspace Classifier
    Shu, Ting
    Zhang, Bob
    Tang, Yuan Yan
    IEEE ACCESS, 2019, 7 : 49669 - 49679
  • [42] Multi-view classification via Multi-view Partially Common Feature Latent Factor Learning
    Liu, Jian-Wei
    Xie, Hao-Jie
    Lu, Run-Kun
    Luo, Xiong-Lin
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 3323 - 3330
  • [43] Multi-Manifold Optimization for Multi-View Subspace Clustering
    Khan, Aparajita
    Maji, Pradipta
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3895 - 3907
  • [44] A multi-view genomic data simulator
    Michele Fratello
    Angela Serra
    Vittorio Fortino
    Giancarlo Raiconi
    Roberto Tagliaferri
    Dario Greco
    BMC Bioinformatics, 16
  • [45] Multi-View Missing Data Completion
    Zhang, Lei
    Zhao, Yao
    Zhu, Zhenfeng
    Shen, Dinggang
    Ji, Shuiwang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (07) : 1296 - 1309
  • [46] Visual inspection of multivariate volume data based on multi-class noise sampling
    Ding, Zhiyu
    Ding, Ziang
    Chen, Weifeng
    Chen, Haidong
    Tao, Yubo
    Li, Xin
    Chen, Wei
    VISUAL COMPUTER, 2016, 32 (04): : 465 - 478
  • [47] Fundamental sampling patterns for low-rank multi-view data completion
    Ashraphijuo, Morteza
    Wang, Xiaodong
    Aggarwal, Vaneet
    PATTERN RECOGNITION, 2020, 103
  • [48] GDHS: An efficient hybrid sampling method for multi-class imbalanced data classification
    Yan, Yuanting
    Lv, Yan
    Han, Shuangyue
    Yu, Chengjin
    Zhou, Peng
    Neurocomputing, 2025, 637
  • [49] Fundamental sampling patterns for low-rank multi-view data completion
    Ashraphijuo, Morteza
    Wang, Xiaodong
    Aggarwal, Vaneet
    Pattern Recognition, 2020, 103
  • [50] Comparing and Analysis of Different Optimization Techniques on Sparse Multi-Class Data
    Panda, Digbijay
    Singh, Sanika
    Mukherjee, Saurabh
    Chakraborty, Sudeshna
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, : 528 - 531