Online Sketching of Big Categorical Data with Absent Features

被引:0
|
作者
Shen, Yanning [1 ]
Mardani, Morteza
Giannakis, Georgios B.
机构
[1] Univ Minnesota, ECE Dept, Minneapolis, MN 55455 USA
关键词
Rank regularization; categorical data; online sketching;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the scale of data growing every day, reducing the dimensionality (a.k.a. sketching) of high-dimensional vectors has emerged as a task of increasing importance. Relevant issues to address in this context include the sheer volume of data vectors that may consist of categorical (meaning finite-alphabet) features, the typically streaming format of data acquisition, and the possibly absent features. To cope with these challenges, the present paper brings forth a novel rank-regularized maximum likelihood approach that models categorical data as quantized values of analog-amplitude features with low intrinsic dimensionality. This model along with recent online rank regularization advances are leveraged to sketch high-dimensional categorical data 'on the fly.' Simulated tests with synthetic as well as real-world datasets-corroborate the merits of the novel scheme relative to state-of-the-art alternatives.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Online Categorical Subspace Learning for Sketching Big Data with Misses
    Shen, Yanning
    Mardani, Morteza
    Giannakis, Georgios B.
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (15) : 4004 - 4018
  • [2] ONLINE SKETCHING FOR BIG DATA SUBSPACE LEARNING
    Mardani, Morteza
    Giannakis, Georgios B.
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2511 - 2515
  • [3] Big Data Sketching with Model Mismatch
    Chepuri, Sundeep Prabhakar
    Zhang, Yu
    Leus, Geert
    Giannakis, G. B.
    [J]. 2015 49TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2015, : 97 - 101
  • [4] Categorical Big Data Processing
    Salvador-Meneses, Jaime
    Ruiz-Chavez, Zoila
    Garcia-Rodriguez, Jose
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I, 2018, 11314 : 245 - 252
  • [5] Experimental Evaluation of Sketching Techniques for Big Spatial Data
    Siddique, A. B.
    Eldawy, Ahmed
    [J]. PROCEEDINGS OF THE 2018 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '18), 2018, : 522 - 522
  • [6] Big Data Clustering via Random Sketching and Validation
    Traganitis, Panagiotis A.
    Slavakis, Konstantinos
    Giannakis, Georgios B.
    [J]. CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 1046 - 1050
  • [7] LESS: Big Data Sketching and Encryption on Low Power Platform
    Kulkarni, Amey
    Shea, Colin
    Homayoun, Houman
    Mohsenin, Tinoosh
    [J]. PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 1631 - 1634
  • [8] Absent Qualia and Categorical Properties
    O'Sullivan, Brendan
    [J]. ERKENNTNIS, 2012, 76 (03) : 353 - 371
  • [9] Absent Qualia and Categorical Properties
    Brendan O’Sullivan
    [J]. Erkenntnis, 2012, 76 : 353 - 371
  • [10] Sketching for Latent Dirichlet-Categorical Models
    Tassarotti, Joseph
    Tristan, Jean-Baptiste
    Wick, Michael
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 256 - 265