Microblog Dimensionality Reduction-A Deep Learning Approach

被引:13
|
作者
Xu, Lei [1 ]
Jiang, Chunxiao [2 ]
Ren, Yong [2 ]
Chen, Hsiao-Hwa [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[3] Natl Cheng Kung Univ, Dept Engn Sci, Tainan 70101, Taiwan
基金
中国博士后科学基金;
关键词
Microblog mining; dimension reduction; text representation; semantic relatedness; deep autoencoder;
D O I
10.1109/TKDE.2016.2540639
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploring potentially useful information from huge amount of textual data produced by microblogging services has attracted much attention in recent years. An important preprocessing step of microblog text mining is to convert natural language texts into proper numerical representations. Due to the short-length characteristics of microblog texts, using term frequency vectors to represent microblog texts will cause "sparse data" problem. Finding proper representations of microblog texts is a challenging issue. In this paper, we apply deep networks to map the high-dimensional representations of microblog texts to low-dimensional representations. To improve the result of dimensionality reduction, we take advantage of the semantic similarity derived from two types of microblog-specific information, namely the retweet relationship and hashtags. Two types of approaches, including modifying training data and modifying the training objective of deep networks, are proposed to make use of microblog-specific information. Experiment results show that the deep models perform better than traditional dimensionality reduction methods such as latent semantic analysis and latent Dirichlet allocation topic model, and the use of microblog-specific information can help to learn better representations.
引用
收藏
页码:1779 / 1789
页数:11
相关论文
共 50 条
  • [1] Textual data dimensionality reduction-a deep learning approach
    Kushwaha, Neetu
    Pant, Millie
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (15-16) : 11039 - 11050
  • [2] Deep Learning in Exploring Semantic Relatedness for Microblog Dimensionality Reduction
    Xu, Lei
    Jiang, Chunxiao
    Ren, Yong
    [J]. 2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 98 - 102
  • [3] Textual data dimensionality reduction - a deep learning approach
    Neetu Kushwaha
    Millie Pant
    [J]. Multimedia Tools and Applications, 2020, 79 : 11039 - 11050
  • [4] Deep learning approach based on dimensionality reduction for designing electromagnetic nanostructures
    Yashar Kiarashinejad
    Sajjad Abdollahramezani
    Ali Adibi
    [J]. npj Computational Materials, 6
  • [5] Deep learning approach based on dimensionality reduction for designing electromagnetic nanostructures
    Kiarashinejad, Yashar
    Abdollahramezani, Sajjad
    Adibi, Ali
    [J]. NPJ COMPUTATIONAL MATERIALS, 2020, 6 (01)
  • [6] Automated English Speech Recognition Using Dimensionality Reduction with Deep Learning Approach
    Yu, Jing
    Ye, Nianhua
    Du, Xueqin
    Han, Lu
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [7] A Deep Learning-based Ranking Approach for Microblog Retrieval
    Ibtihel, Ben Ltaifa
    Lobna, Hlaoua
    Lotfi, Ben Romdhane
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES 2019), 2019, 159 : 352 - 362
  • [8] A dynamic approach to dimensionality reduction in relational learning
    Alphonse, E
    Matwin, S
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2002, 2366 : 255 - 264
  • [9] A manifold learning approach to dimensionality reduction for modeling data
    Turchetti, Claudio
    Falaschetti, Laura
    [J]. INFORMATION SCIENCES, 2019, 491 : 16 - 29
  • [10] Original Approach for Reduction of High Dimensionality In unsupervised learning
    Fidae, Harchli
    Abdelatif, Es-safi
    Mohamed, Ettaouil
    [J]. PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON LOGISTICS OPERATIONS MANAGEMENT (GOL'16), 2016,