Automated Social Text Annotation With Joint Multilabel Attention Networks

被引:15
|
作者
Dong, Hang [1 ,2 ,3 ]
Wang, Wei [2 ]
Huang, Kaizhu [4 ,5 ]
Coenen, Frans [1 ]
机构
[1] Univ Liverpool, Dept Comp Sci, Liverpool L69 7ZX, Merseyside, England
[2] Xian Jiaotong Liverpool Univ, Dept Comp Sci & Software Engn, Suzhou 215123, Peoples R China
[3] Univ Edinburgh, Ctr Med Informat, Usher Inst, Edinburgh EH16 4UX, Midlothian, Scotland
[4] Xian Jiaotong Liverpool Univ, Dept Elect & Elect Engn, Suzhou 215123, Peoples R China
[5] Alibaba Zhejiang Univ Joint Inst Frontier Technol, Hangzhou 310000, Peoples R China
基金
中国国家自然科学基金;
关键词
Attention mechanisms; automated social annotation; deep learning; multilabel classification; recurrent neural networks (RNNs); CLASSIFICATION; QUALITY;
D O I
10.1109/TNNLS.2020.3002798
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated social text annotation is the task of suggesting a set of tags for shared documents on social media platforms. The automated annotation process can reduce users' cognitive overhead in tagging and improve tag management for better search, browsing, and recommendation of documents. It can be formulated as a multilabel classification problem. We propose a novel deep learning-based method for this problem and design an attention-based neural network with semantic-based regularization, which can mimic users' reading and annotation behavior to formulate better document representation, leveraging the semantic relations among labels. The network separately models the title and the content of each document and injects an explicit, title-guided attention mechanism into each sentence. To exploit the correlation among labels, we propose two semantic-based loss regularizers, i.e., similarity and subsumption, which enforce the output of the network to conform to label semantics. The model with the semantic-based loss regularizers is referred to as the joint multilabel attention network (JMAN). We conducted a comprehensive evaluation study and compared JMAN to the state-of-the-art baseline models, using four large, real-world social media data sets. In terms of F-1, JMAN significantly outperformed bidirectional gated recurrent unit (Bi-GRU) relatively by around 12.8%-78.6% and the hierarchical attention network (HAN) by around 3.9%-23.8%. The JMAN model demonstrates advantages in convergence and training speed. Further improvement of performance was observed against latent Dirichlet allocation (LDA) and support vector machine (SVM). When applying the semantic-based loss regularizers, the performance of HAN and Bi-GRU in terms of F-1 was also boosted. It is also found that dynamic update of the label semantic matrices (JMAN(d)) has the potential to further improve the performance of JMAN but at the cost of substantial memory and warrants further study.
引用
收藏
页码:2224 / 2238
页数:15
相关论文
共 50 条
  • [1] Joint Multi-Label Attention Networks for Social Text Annotation
    Dong, Hang
    Wang, Wei
    Huang, Kaizhu
    Coenen, Frans
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1348 - 1354
  • [2] Multilabel Remote Sensing Image Annotation With Multiscale Attention and Label Correlation
    Huang, Rui
    Zheng, Fengcai
    Huang, Wei
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 (14) : 6951 - 6961
  • [3] Automated Text Psychodiagnostics and the Problem of Monitoring Social Networks
    Yu. M. Kuznetsova
    N. V. Chudova
    A. A. Chuganskaya
    Pattern Recognition and Image Analysis, 2023, 33 : 383 - 388
  • [4] Automated Text Psychodiagnostics and the Problem of Monitoring Social Networks
    Kuznetsova, Yu. M.
    Chudova, N. V.
    Chuganskaya, A. A.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 383 - 388
  • [5] Infant joint attention, neural networks and social cognition
    Mundy, Peter
    Jarrold, William
    NEURAL NETWORKS, 2010, 23 (8-9) : 985 - 997
  • [6] A Multilabel Learning-Based Automatic Annotation Method for Semantic Roles in English Text
    Lei, Li
    Wang, Hao
    IEEE ACCESS, 2023, 11 : 106220 - 106231
  • [7] Multilabel neural networks with applications to functional genomics and text categorization
    Zhang, Min-Ling
    Zhou, Zhi-Hua
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (10) : 1338 - 1351
  • [8] Improving Multilabel Text Classification with Stacking and Recurrent Neural Networks
    Mansueli, Rodrigo
    Domingues, Marcos Aurelio
    Feltrim, Valeria Delisandra
    PROCEEDINGS OF THE 28TH BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB, WEBMEDIA 2022, 2022, : 117 - 122
  • [9] KDTA: Automated Knowledge-Driven Text Annotation
    Papantoniou, Katerina
    Tsatsaronis, George
    Paliouras, Georgios
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2010, 6323 : 611 - 614
  • [10] Automated Discovery of Social Networks in Text-Based Online Communities
    Gruzd, Anatoliy
    GROUP 2009 PROCEEDINGS, 2009, : 379 - 380