Extraction of temporal information from social media messages using the BERT model

被引:18
|
作者
Ma, Kai [1 ]
Tan, Yongjian [1 ]
Tian, Miao [1 ]
Xie, Xuejing [2 ]
Qiu, Qinjun [2 ,3 ,4 ]
Li, Sanfeng [5 ]
Wang, Xin [6 ]
机构
[1] China Three Gorges Univ, Coll Comp & Informat Technol, Yichang 443002, Peoples R China
[2] Natl Engn Res Ctr Geog Informat Syst, Wuhan 430074, Peoples R China
[3] China Univ Geosci, Sch Geog & Informat Engn, Wuhan 430074, Peoples R China
[4] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430074, Peoples R China
[5] Wuhan Zondy Cyber Sci & Technol Co Ltd, Wuhan, Peoples R China
[6] Jinan Rail Transit Grp Co Ltd, Jinan, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Temporal information extraction; Temporal expression recognition; BERT; Natural language processing; SYSTEM;
D O I
10.1007/s12145-021-00756-6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Temporal information extraction from social media messages is of critical importance to several geographical applications. Combined with the characteristics of temporal information descriptions in Chinese text, different time expression patterns formed by time unit combinations are summarized. A deep learning-based information extraction algorithm (named BERT-BiLSTM-CRF) for automatically extracting temporal information from social media messages is proposed. Based on the bidirectional long short-term memory-conditional random field (BiLSTM-CRF) model, the BERT (bidirectional encoder representations from transformers) pretrained language model was used to enhance the generalization ability of the word vector model to capture long-range contextual information; then, the trained word vector was input into the BiLSTM-CRF model for further training. The proposed model was then evaluated on the constructed corpus, a set of manually annotated Chinese texts from social media messages. Among the basic models, the BERT-BiLSTM-CRF achieved the highest average F1-score of 85%. The experimental results show that the proposed method outperforms the current state-of-the-art models.
引用
收藏
页码:573 / 584
页数:12
相关论文
共 50 条
  • [1] Extraction of temporal information from social media messages using the BERT model
    Kai Ma
    Yongjian Tan
    Miao Tian
    Xuejing Xie
    Qinjun Qiu
    Sanfeng Li
    Xin Wang
    [J]. Earth Science Informatics, 2022, 15 : 573 - 584
  • [2] Topic-BERT: Detecting harmful information from social media
    Gao, Wang
    Deng, Hongtao
    Zhu, Xun
    Fang, Yuan
    [J]. INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2021, 15 (03): : 333 - 342
  • [3] Chinese toponym recognition with variant neural structures from social media messages based on BERT methods
    Kai Ma
    YongJian Tan
    Zhong Xie
    Qinjun Qiu
    Siqiong Chen
    [J]. Journal of Geographical Systems, 2022, 24 : 143 - 169
  • [4] Chinese toponym recognition with variant neural structures from social media messages based on BERT methods
    Ma, Kai
    Tan, YongJian
    Xie, Zhong
    Qiu, Qinjun
    Chen, Siqiong
    [J]. JOURNAL OF GEOGRAPHICAL SYSTEMS, 2022, 24 (02) : 143 - 169
  • [5] Social Media Information Classification of Earthquake Disasters Based on BERT Transfer Learning Model
    Lin, Sen
    Liu, Beibei
    Li, Jianwen
    Liu, Xu
    Qin, Kun
    Guo, Guizhen
    [J]. Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2024, 49 (09): : 1661 - 1671
  • [6] Temporal Adaptation of BERT and Performance on Downstream Document Classification: Insights from Social Media
    Rottger, Paul
    Pierrehumbert, Janet B.
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 2400 - 2412
  • [7] Personality Identification from Social Media Using Ensemble BERT and RoBERTa
    Tsani E.F.
    Suhartono D.
    [J]. Informatica (Slovenia), 2023, 47 (04): : 537 - 544
  • [8] Extracting temporal information from short messages
    Cooper, Richard
    Manson, Sinclair
    [J]. DATA MANAGEMENT: DATA, DATA EVERYWHERE, PROCEEDINGS, 2007, 4587 : 224 - +
  • [9] Information Extraction from Social Media: Clustering and Labelling Microblogs
    Hemavathi, D.
    Kavitha, M.
    Ahmed, Narjiya Begum
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON IOT AND ITS APPLICATIONS (IEEE ICIOT), 2017,
  • [10] Spatial Information Extraction from Short Messages
    Zenasni, Sarah
    Kergosien, Eric
    Roche, Mathieu
    Teisseire, Maguelonne
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 95 : 351 - 367