Unsupervised Tensor Mining for Big Data Practitioners

被引:3
|
作者
Papalexakis, Evangelos E. [1 ,2 ]
Faloutsos, Christos [1 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Univ Calif Riverside, Dept Comp Sci & Engn, 355 Winston Chung Hall, Riverside, CA 92521 USA
关键词
big data analytics; data mining; machine learning; DECOMPOSITIONS; FACTORIZATION; UNIQUENESS; SPARSE; RANK;
D O I
10.1089/big.2016.0026
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multiaspect data are ubiquitous in modern Big Data applications. For instance, different aspects of a social network are the different types of communication between people, the time stamp of each interaction, and the location associated to each individual. How can we jointly model all those aspects and leverage the additional information that they introduce to our analysis? Tensors, which are multidimensional extensions of matrices, are a principled and mathematically sound way of modeling such multiaspect data. In this article, our goal is to popularize tensors and tensor decompositions to Big Data practitioners by demonstrating their effectiveness, outlining challenges that pertain to their application in Big Data scenarios, and presenting our recent work that tackles those challenges. We view this work as a step toward a fully automated, unsupervised tensor mining tool that can be easily and broadly adopted by practitioners in academia and industry.
引用
收藏
页码:179 / 191
页数:13
相关论文
共 50 条
  • [31] Supervised and unsupervised data mining with an evolutionary algorithm
    Cattral, R
    Oppacher, F
    Deugo, D
    [J]. PROCEEDINGS OF THE 2001 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2001, : 767 - 774
  • [32] Unsupervised Spatial Data Mining for Smart Homes
    Bouchard, Kevin
    Fortin-Simard, Dany
    Lapalu, Jeremy
    Gaboury, Sebastien
    Bouzouane, Abdenour
    Bouchard, Bruno
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1433 - 1440
  • [33] Fast Tensor Decompositions for Big Data Processing
    Viet-Dung Nguyen
    Abed-Meraim, Karim
    Nguyen Linh-Trung
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2016, : 215 - 221
  • [34] Tensor Completion Algorithms in Big Data Analytics
    Song, Qingquan
    Ge, Hancheng
    Caverlee, James
    Hu, Xia
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2019, 13 (01)
  • [35] Big data management in the mining industry
    Chong-chong Qi
    [J]. International Journal of Minerals, Metallurgy and Materials, 2020, 27 : 131 - 139
  • [36] Mining big data in real time
    Bifet, Albert
    [J]. Informatica (Slovenia), 2013, 37 (01): : 15 - 20
  • [37] Text Mining in Big Data Analytics
    Cogburn, Derrick L.
    Hine, Michael J.
    Peladeau, Normand
    Yoon, Victoria Y.
    [J]. PROCEEDINGS OF THE 51ST ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS), 2018, : 584 - 586
  • [38] Distributed Big Advertiser Data Mining
    Bindra, Ashish
    Pokuri, Sreenivasulu
    Uppala, Krishna
    Teredesai, Ankur
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 914 - 914
  • [39] IoT Big Data Stream Mining
    Morales, Gianmarco De Francisci
    Bifet, Albert
    Khan, Latifur
    Gama, Joao
    Fan, Wei
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 2119 - 2120
  • [40] Fuzzy Models for Big Data Mining
    Ducange, Pietro
    [J]. FUZZY LOGIC AND APPLICATIONS, WILF 2018, 2019, 11291 : 257 - 260