Diachronic Analysis of Time References in News Articles

被引:0
|
作者
Jatowt, Adam [1 ]
Doucet, Antoine [2 ]
Campos, Ricardo [3 ]
机构
[1] Univ Innsbruck, Dept Comp Sci & DiSC, Innsbruck, Austria
[2] Univ La Rochelle, La Rochelle, France
[3] Ci2 Polytech Inst Tomar, LIAAD, INESCTEC, Tomar, Portugal
关键词
temporal expressions; news archives; temporal IR;
D O I
10.1145/3487553.3524671
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time expressions embedded in text are important for many down-stream tasks in NLP and IR. They have been, for example, utilized for timeline summarization, named entity recognition, temporal information retrieval, question answering and others. In this paper, we introduce a novel analytical approach to analyzing characteristics of time expressions in diachronic text collections. Based on a collection of news articles published over a 33-years' long time span, we investigate several aspects of time expressions with a focus on their interplay with publication dates of containing documents. We utilize a graph-based representation of temporal expressions to represent them through their co-occurring named entities. The proposed approach results in several observations that could be utilized in automatic systems that rely on processing temporal signals embedded in text. It could be also of importance for professionals (e.g., historians) who wish to understand fluctuations in collective memories and collective expectations based on large-scale, diachronic document collections.
引用
收藏
页码:918 / 923
页数:6
相关论文
共 50 条