Unsupervised Tensor Mining for Big Data Practitioners

被引:3
|
作者
Papalexakis, Evangelos E. [1 ,2 ]
Faloutsos, Christos [1 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Univ Calif Riverside, Dept Comp Sci & Engn, 355 Winston Chung Hall, Riverside, CA 92521 USA
关键词
big data analytics; data mining; machine learning; DECOMPOSITIONS; FACTORIZATION; UNIQUENESS; SPARSE; RANK;
D O I
10.1089/big.2016.0026
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Multiaspect data are ubiquitous in modern Big Data applications. For instance, different aspects of a social network are the different types of communication between people, the time stamp of each interaction, and the location associated to each individual. How can we jointly model all those aspects and leverage the additional information that they introduce to our analysis? Tensors, which are multidimensional extensions of matrices, are a principled and mathematically sound way of modeling such multiaspect data. In this article, our goal is to popularize tensors and tensor decompositions to Big Data practitioners by demonstrating their effectiveness, outlining challenges that pertain to their application in Big Data scenarios, and presenting our recent work that tackles those challenges. We view this work as a step toward a fully automated, unsupervised tensor mining tool that can be easily and broadly adopted by practitioners in academia and industry.
引用
收藏
页码:179 / 191
页数:13
相关论文
共 50 条
  • [1] Unsupervised data mining suitable for evolutionary and genomic studies in the era of big data
    Iwasaki, Hiroki
    Abe, Takashi
    Wada, Yoshiko
    Wada, Kennosuke
    Ikemura, Toshimichi
    [J]. GENES & GENETIC SYSTEMS, 2014, 89 (06) : 285 - 285
  • [2] Unsupervised data analytics in mining big building operational data for energy efficiency enhancement: A review
    Fan, Cheng
    Xiao, Fu
    Li, Zhengdao
    Wang, Jiayuan
    [J]. ENERGY AND BUILDINGS, 2018, 159 : 296 - 308
  • [3] Balancing Interpretability and Predictive Accuracy for Unsupervised Tensor Mining
    Zabir, Ishmam
    Papalexakis, Evangelos E.
    [J]. 2017 FIFTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2017, : 708 - 710
  • [4] Data Mining with Big Data
    Sowmya, R.
    Suneetha, K. R.
    [J]. PROCEEDINGS OF 2017 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO 2017), 2017, : 246 - 250
  • [5] Data Mining with Big Data
    Wu, Xindong
    Zhu, Xingquan
    Wu, Gong-Qing
    Ding, Wei
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (01) : 97 - 107
  • [6] Three-way unsupervised data mining for power system applications based on tensor decomposition
    Sandoval, Betsy
    Barocio, Emilio
    Korba, Petr
    Sevilla, Felix Rafael Segundo
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2020, 187
  • [7] Mining "Big Data" using Big Data Services
    Reips, Ulf-Dietrich
    Matzat, Uwe
    [J]. INTERNATIONAL JOURNAL OF INTERNET SCIENCE, 2014, 9 (01) : 1 - 8
  • [8] Big Data Mining: Managing the Costs of Data Mining
    Ganasan, Jaya R.
    [J]. 2019 17TH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING (ICT&KE), 2019, : 62 - 65
  • [9] Data field for mining big data
    Wang, Shuliang
    Li, Ying
    Wang, Dakui
    [J]. GEO-SPATIAL INFORMATION SCIENCE, 2016, 19 (02) : 106 - 118
  • [10] Unsupervised pattern clustering for data mining
    Wilamowska, K
    Manic, M
    [J]. IECON'01: 27TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-3, 2001, : 1862 - 1867