Automated Social Text Annotation With Joint Multilabel Attention Networks

被引:15
|
作者
Dong, Hang [1 ,2 ,3 ]
Wang, Wei [2 ]
Huang, Kaizhu [4 ,5 ]
Coenen, Frans [1 ]
机构
[1] Univ Liverpool, Dept Comp Sci, Liverpool L69 7ZX, Merseyside, England
[2] Xian Jiaotong Liverpool Univ, Dept Comp Sci & Software Engn, Suzhou 215123, Peoples R China
[3] Univ Edinburgh, Ctr Med Informat, Usher Inst, Edinburgh EH16 4UX, Midlothian, Scotland
[4] Xian Jiaotong Liverpool Univ, Dept Elect & Elect Engn, Suzhou 215123, Peoples R China
[5] Alibaba Zhejiang Univ Joint Inst Frontier Technol, Hangzhou 310000, Peoples R China
基金
中国国家自然科学基金;
关键词
Attention mechanisms; automated social annotation; deep learning; multilabel classification; recurrent neural networks (RNNs); CLASSIFICATION; QUALITY;
D O I
10.1109/TNNLS.2020.3002798
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated social text annotation is the task of suggesting a set of tags for shared documents on social media platforms. The automated annotation process can reduce users' cognitive overhead in tagging and improve tag management for better search, browsing, and recommendation of documents. It can be formulated as a multilabel classification problem. We propose a novel deep learning-based method for this problem and design an attention-based neural network with semantic-based regularization, which can mimic users' reading and annotation behavior to formulate better document representation, leveraging the semantic relations among labels. The network separately models the title and the content of each document and injects an explicit, title-guided attention mechanism into each sentence. To exploit the correlation among labels, we propose two semantic-based loss regularizers, i.e., similarity and subsumption, which enforce the output of the network to conform to label semantics. The model with the semantic-based loss regularizers is referred to as the joint multilabel attention network (JMAN). We conducted a comprehensive evaluation study and compared JMAN to the state-of-the-art baseline models, using four large, real-world social media data sets. In terms of F-1, JMAN significantly outperformed bidirectional gated recurrent unit (Bi-GRU) relatively by around 12.8%-78.6% and the hierarchical attention network (HAN) by around 3.9%-23.8%. The JMAN model demonstrates advantages in convergence and training speed. Further improvement of performance was observed against latent Dirichlet allocation (LDA) and support vector machine (SVM). When applying the semantic-based loss regularizers, the performance of HAN and Bi-GRU in terms of F-1 was also boosted. It is also found that dynamic update of the label semantic matrices (JMAN(d)) has the potential to further improve the performance of JMAN but at the cost of substantial memory and warrants further study.
引用
收藏
页码:2224 / 2238
页数:15
相关论文
共 50 条
  • [31] An Annotation Schema for the Detection of Social Bias in Legal Text Corpora
    Gumusel, Ece
    Malic, Vincent Quirante
    Donaldson, Devan Ray
    Ashley, Kevin
    Liu, Xiaozhong
    INFORMATION FOR A BETTER WORLD: SHAPING THE GLOBAL FUTURE, PT I, 2022, 13192 : 185 - 194
  • [32] Development of joint attention and social referencing
    Boucenna, Sofiane
    Gaussier, Philippe
    Hafemeister, Laurence
    2011 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING (ICDL), 2011,
  • [33] JIND: joint integration and discrimination for automated single-cell annotation
    Goyal, Mohit
    Serrano, Guillermo
    Argemi, Josepmaria
    Shomorony, Ilan
    Hernaez, Mikel
    Ochoa, Idoia
    BIOINFORMATICS, 2022, 38 (09) : 2488 - 2495
  • [34] A Social Framework for the Organisation and Automated Annotation of Personal Photo Collections
    Hughes, Mark
    THIRD INTERNATIONAL WORKSHOP ON SEMANTIC MEDIA ADAPTATION AND PERSONALIZATION, PROCEEDINGS, 2008, : 112 - 115
  • [35] JAM: Joint attention model for next event recommendation in event-based social networks
    Liao, Guoqiong
    Yang, Lechuan
    Mao, Mingsong
    Wan, Changxuan
    Liu, Dexi
    Liu, Xiping
    KNOWLEDGE-BASED SYSTEMS, 2021, 234
  • [36] DocRicher: An Automatic Annotation System for Text Documents Using Social Media
    Hu, Qiang
    Liu, Qi
    Wang, Xiaoli
    Tung, Anthony K. H.
    Goyal, Shubham
    Yang, Jisong
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 901 - 906
  • [37] Joint user profiling with hierarchical attention networks
    Liu, Xiaojian
    Zhu, Yi
    Wu, Xindong
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (03)
  • [38] Text Classification Using Gated and Transposed Attention Networks
    He, Kang
    Zhu, Min
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [39] Recurrent Attention Networks for Long-text Modeling
    Li, Xianming
    Li, Zongxi
    Luo, Xiaotian
    Xie, Haoran
    Lee, Xing
    Zhao, Yingbin
    Wang, Fu Lee
    Li, Qing
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 3006 - 3019
  • [40] A survey on text mining in social networks
    Irfan, Rizwana
    King, Christine K.
    Grages, Daniel
    Ewen, Sam
    Khan, Samee U.
    Madani, Sajjad A.
    Kolodziej, Joanna
    Wang, Lizhe
    Chen, Dan
    Rayes, Ammar
    Tziritas, Nikolaos
    Xu, Cheng-Zhong
    Zomaya, Albert Y.
    Alzahrani, Ahmed Saeed
    Li, Hongxiang
    KNOWLEDGE ENGINEERING REVIEW, 2015, 30 (02): : 157 - 170