Microblog Dimensionality Reduction-A Deep Learning Approach

被引:13
|
作者
Xu, Lei [1 ]
Jiang, Chunxiao [2 ]
Ren, Yong [2 ]
Chen, Hsiao-Hwa [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[3] Natl Cheng Kung Univ, Dept Engn Sci, Tainan 70101, Taiwan
基金
中国博士后科学基金;
关键词
Microblog mining; dimension reduction; text representation; semantic relatedness; deep autoencoder;
D O I
10.1109/TKDE.2016.2540639
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploring potentially useful information from huge amount of textual data produced by microblogging services has attracted much attention in recent years. An important preprocessing step of microblog text mining is to convert natural language texts into proper numerical representations. Due to the short-length characteristics of microblog texts, using term frequency vectors to represent microblog texts will cause "sparse data" problem. Finding proper representations of microblog texts is a challenging issue. In this paper, we apply deep networks to map the high-dimensional representations of microblog texts to low-dimensional representations. To improve the result of dimensionality reduction, we take advantage of the semantic similarity derived from two types of microblog-specific information, namely the retweet relationship and hashtags. Two types of approaches, including modifying training data and modifying the training objective of deep networks, are proposed to make use of microblog-specific information. Experiment results show that the deep models perform better than traditional dimensionality reduction methods such as latent semantic analysis and latent Dirichlet allocation topic model, and the use of microblog-specific information can help to learn better representations.
引用
收藏
页码:1779 / 1789
页数:11
相关论文
共 50 条
  • [41] DeepMorpher: deep learning-based design space dimensionality reduction for shape optimisation
    Abbas, Asad
    Rafiee, Ashkan
    Haase, Max
    [J]. JOURNAL OF ENGINEERING DESIGN, 2023, 34 (03) : 254 - 270
  • [42] Hyperspectral image classification based on manifold spectral dimensionality reduction and deep learning method
    Shi, Yun
    Ma, Donghui
    Lyu, Jie
    Li, Jie
    Shi, Jingjian
    [J]. Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2020, 36 (06): : 151 - 160
  • [43] Rock lithological instance classification by hyperspectral images using dimensionality reduction and deep learning
    Galdames, Francisco J.
    Perez, Claudio A.
    Estevez, Pablo A.
    Adams, Martin
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2022, 224
  • [44] Circumventing the curse of dimensionality in magnetic resonance fingerprinting through a deep learning approach
    Barbieri, Marco
    Lee, Philip K.
    Brizi, Leonardo
    Giampieri, Enrico
    Solera, Francesco
    Castellani, Gastone
    Hargreaves, Brian A.
    Testa, Claudia
    Lodi, Raffaele
    Remondini, Daniel
    [J]. NMR IN BIOMEDICINE, 2022, 35 (04)
  • [45] PCA-KL: a parametric dimensionality reduction approach for unsupervised metric learning
    Levada, Alexandre L. M.
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2021, 15 (04) : 829 - 868
  • [46] PCA-KL: a parametric dimensionality reduction approach for unsupervised metric learning
    Alexandre L. M. Levada
    [J]. Advances in Data Analysis and Classification, 2021, 15 : 829 - 868
  • [47] Transfer Learning for Feature Dimensionality Reduction
    Thribhuvan, Nikhila
    Elayidom, Sudheep
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2022, 19 (05) : 721 - 727
  • [48] Kernel dimensionality reduction for supervised learning
    Fukumizu, K
    Bach, FR
    Jordan, MI
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 81 - 88
  • [49] Learning Discriminant Isomap for Dimensionality Reduction
    Yang, Bo
    Xiang, Ming
    Zhang, Yupei
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [50] Dimensionality reduction approach to multivariate prediction
    Merola, GM
    Abraham, B
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2001, 29 (02): : 191 - 200