Multilingual and cross-domain temporal tagging

被引:96
|
作者
Stroetgen, Jannik [1 ]
Gertz, Michael [1 ]
机构
[1] Heidelberg Univ, Inst Comp Sci, Heidelberg, Germany
关键词
Temporal information; Temporal tagger; Named entity recognition; Named entity normalization; TIMEX2; TIMEX3;
D O I
10.1007/s10579-012-9179-y
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Extraction and normalization of temporal expressions from documents are important steps towards deep text understanding and a prerequisite for many NLP tasks such as information extraction, question answering, and document summarization. There are different ways to express (the same) temporal information in documents. However, after identifying temporal expressions, they can be normalized according to some standard format. This allows the usage of temporal information in a term- and language-independent way. In this paper, we describe the challenges of temporal tagging in different domains, give an overview of existing annotated corpora, and survey existing approaches for temporal tagging. Finally, we present our publicly available temporal tagger HeidelTime, which is easily extensible to further languages due to its strict separation of source code and language resources like patterns and rules. We present a broad evaluation on multiple languages and domains on existing corpora as well as on a newly created corpus for a language/domain combination for which no annotated corpus has been available so far.
引用
收藏
页码:269 / 298
页数:30
相关论文
共 50 条
  • [1] Multilingual and cross-domain temporal tagging
    Jannik Strötgen
    Michael Gertz
    [J]. Language Resources and Evaluation, 2013, 47 : 269 - 298
  • [2] Cross-Domain Dialogue Act Tagging
    Webb, Nick
    Liu, Ting
    Hepple, Mark
    Wilks, Yorick
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1974 - 1981
  • [3] DBpedia: A Multilingual Cross-Domain Knowledge Base
    Mendes, Pablo N.
    Jakob, Max
    Bizer, Christian
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1813 - 1817
  • [4] Misogyny Detection in Twitter: a Multilingual and Cross-Domain Study
    Pamungkas, Endang Wahyu
    Basile, Valerio
    Patti, Viviana
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [5] Cross-domain Recommendation with Semantic Correlation in Tagging Systems
    Zhang, Qian
    Hao, Peng
    Lu, Jie
    Zhang, Guangquan
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [6] Cross-Domain and Cross-Category Emotion Tagging for Comments of Online News
    Zhang, Ying
    Zhang, Ning
    Si, Luo
    Lu, Yanshan
    Wang, Qifan
    Yuan, Xiaojie
    [J]. SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 627 - 636
  • [7] Personalizing Health and Food Advices by Semantic Enrichment of Multilingual Cross-Domain Questions
    Al-Nazer, Ahmed
    Helmy, Tarek
    [J]. 2015 IEEE 8TH GCC CONFERENCE AND EXHIBITION (GCCCE), 2015,
  • [8] Geo-location driven image tagging via cross-domain learning
    Nie, Weizhi
    Liu, Anan
    Wang, Zhongyang
    Su, Yuting
    [J]. MULTIMEDIA SYSTEMS, 2016, 22 (04) : 395 - 404
  • [9] Geo-location driven image tagging via cross-domain learning
    Weizhi Nie
    Anan Liu
    Zhongyang Wang
    Yuting Su
    [J]. Multimedia Systems, 2016, 22 : 395 - 404
  • [10] Cross-Domain NER using Cross-Domain Language Modeling
    Jia, Chen
    Liang, Xiaobo
    Zhang, Yue
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2464 - 2474