Analysing Entity Context in Multilingual Wikipedia to Support Entity-Centric Retrieval Applications

被引:2
|
作者
Zhou, Yiwei [1 ]
Demidova, Elena [2 ,3 ]
Cristea, Alexandra I. [1 ]
机构
[1] Univ Warwick, Dept Comp Sci, Coventry, W Midlands, England
[2] L3S Res Ctr, Hannover, Germany
[3] Leibniz Univ Hannover, Hannover, Germany
基金
欧洲研究理事会;
关键词
D O I
10.1007/978-3-319-27932-9_17
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Representation of influential entities, such as famous people and multinational corporations, on the Web can vary across languages, reflecting language-specific entity aspects as well as divergent views on these entities in different communities. A systematic analysis of language-specific entity contexts can provide a better overview of the existing aspects and support entity-centric retrieval applications over multilingual Web data. An important source of cross-lingual information about influential entities is Wikipedia - an online community-created ency-clopaedia - containing more than 280 language editions. In this paper we focus on the extraction and analysis of the language-specific entity contexts from different Wikipedia language editions over multilingual data. We discuss alternative ways such contexts can be built, including graph-based and article-based contexts. Furthermore, we analyse the similarities and the differences in these contexts in a case study including 80 entities and five Wikipedia language editions.
引用
收藏
页码:197 / 208
页数:12
相关论文
共 50 条
  • [1] Retrieval, Crawling and Fusion of Entity-centric Data on the Web
    Dietze, Stefan
    [J]. SEMANTIC KEYWORD-BASED SEARCH ON STRUCTURED DATA SOURCES, IKC 2016, 2017, 10151 : 3 - 16
  • [2] A Unified Approach to Entity-Centric Context Tracking in Social Conversations
    Ruckert, Ulrich
    Sunkara, Srinivas
    Rastogi, Abhinav
    Prakash, Sushant
    Khaitan, Pranav
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1275 - 1285
  • [3] Entity-centric Data Fusion on the Web
    Thalhammer, Andreas
    Thoma, Steffen
    Harth, Andreas
    Studer, Rudi
    [J]. PROCEEDINGS OF THE 28TH ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA (HT'17), 2017, : 25 - 34
  • [4] Entity-Centric Visualization of Open Data
    Ojha, Sajan Raj
    Jovanovic, Mladjan
    Giunchiglia, Fausto
    [J]. HUMAN-COMPUTER INTERACTION - INTERACT 2015, PT III, 2015, 9298 : 149 - 166
  • [5] Entity-Centric Contextual Affective Analysis
    Field, Anjalie
    Tsvetkov, Yulia
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2550 - 2560
  • [6] Maskkot - An Entity-centric Annotation Platform
    Stellato, Armando
    Stoermer, Heiko
    Bortoli, Stefano
    Scarpato, Noemi
    Turbati, Andrea
    Bouquet, Paolo
    Pazienza, Maria Teresa
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [7] Building Entity-Centric Event Collections
    Nanni, Federico
    Ponzetto, Simone Paolo
    Dietz, Laura
    [J]. 2017 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL 2017), 2017, : 199 - 208
  • [8] Entity-Centric Search for Enterprise Services
    Roy, Marcus
    Weber, Ingo
    Benatallah, Boualem
    [J]. SERVICE-ORIENTED COMPUTING, ICSOC 2013, 2013, 8274 : 404 - 412
  • [9] Entity-centric scalable concurrency control for distributed interactive applications
    Lee, Dongman
    Yang, Jeonghwa
    Youn, Hee Yong
    Yu, Chansu
    Hyun, Soon J.
    [J]. 2000, IEEE, Piscataway, NJ, United States : 544 - 550
  • [10] ENTSUM: A Data Set for Entity-Centric Summarization
    Maddela, Mounica
    Kulkarni, Mayank
    Preotiuc-Pietro, Daniel
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 3355 - 3366