Microblog Dimensionality Reduction-A Deep Learning Approach

被引:13
|
作者
Xu, Lei [1 ]
Jiang, Chunxiao [2 ]
Ren, Yong [2 ]
Chen, Hsiao-Hwa [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[3] Natl Cheng Kung Univ, Dept Engn Sci, Tainan 70101, Taiwan
基金
中国博士后科学基金;
关键词
Microblog mining; dimension reduction; text representation; semantic relatedness; deep autoencoder;
D O I
10.1109/TKDE.2016.2540639
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploring potentially useful information from huge amount of textual data produced by microblogging services has attracted much attention in recent years. An important preprocessing step of microblog text mining is to convert natural language texts into proper numerical representations. Due to the short-length characteristics of microblog texts, using term frequency vectors to represent microblog texts will cause "sparse data" problem. Finding proper representations of microblog texts is a challenging issue. In this paper, we apply deep networks to map the high-dimensional representations of microblog texts to low-dimensional representations. To improve the result of dimensionality reduction, we take advantage of the semantic similarity derived from two types of microblog-specific information, namely the retweet relationship and hashtags. Two types of approaches, including modifying training data and modifying the training objective of deep networks, are proposed to make use of microblog-specific information. Experiment results show that the deep models perform better than traditional dimensionality reduction methods such as latent semantic analysis and latent Dirichlet allocation topic model, and the use of microblog-specific information can help to learn better representations.
引用
收藏
页码:1779 / 1789
页数:11
相关论文
共 50 条
  • [21] Joint Precoding and CSI Dimensionality Reduction: An Efficient Deep Unfolding Approach
    Zhang, Jianjun
    Masouros, Christos
    Hanzo, Lajos
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9502 - 9516
  • [22] Dimensionality Reduction and Learning on Networks
    Duke University
    [J].
  • [23] Dimensionality Reduction in Statistical Learning
    Bernstein, Alexander
    Kuleshov, Alexander
    [J]. 2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, : 330 - 335
  • [24] A fuzzy entropy technique for dimensionality reduction in recommender systems using deep learning
    Saravanan, B.
    Mohanraj, V.
    Senthilkumar, J.
    [J]. SOFT COMPUTING, 2019, 23 (08) : 2575 - 2583
  • [25] Exploring Dimensionality Reduction Techniques for Deep Learning Driven QSAR Models of Mutagenicity
    Kalian, Alexander D.
    Benfenati, Emilio
    Osborne, Olivia J.
    Gott, David
    Potter, Claire
    Dorne, Jean-Lou C. M.
    Guo, Miao
    Hogstrand, Christer
    [J]. TOXICS, 2023, 11 (07)
  • [26] A fuzzy entropy technique for dimensionality reduction in recommender systems using deep learning
    B. Saravanan
    V. Mohanraj
    J. Senthilkumar
    [J]. Soft Computing, 2019, 23 : 2575 - 2583
  • [27] A supervised non-linear dimensionality reduction approach for manifold learning
    Raducanu, B.
    Dornaika, F.
    [J]. PATTERN RECOGNITION, 2012, 45 (06) : 2432 - 2444
  • [28] NDR: Noise and Dimensionality Reduction of CSI for Indoor Positioning using Deep Learning
    Sobehy, Abdallah
    Renault, Eric
    Muhlethaler, Paul
    [J]. 2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [29] New Dimensionality Reduction Method of Wind Power Curve Based on Deep Learning
    Zhang, Yujing
    Qiao, Ying
    Lu, Zongxiang
    Wang, Wei
    [J]. 2018 INTERNATIONAL CONFERENCE ON POWER SYSTEM TECHNOLOGY (POWERCON), 2018, : 4357 - 4361
  • [30] Deep Ensemble Transformers for Dimensionality Reduction
    Nareklishvili, Maria
    Geitle, Marius
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 12