Building Support Tools for Russian-Language Information Extraction

被引:0
|
作者
Du, Mian [1 ]
von Etter, Peter [1 ]
Kopotev, Mihail [1 ]
Novikov, Mikhail [1 ]
Tarbeeva, Natalia [1 ]
Yangarber, Roman [1 ]
机构
[1] Univ Helsinki, Dept Comp Sci, FIN-00014 Helsinki, Finland
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is currently a paucity of publicly available NLP tools to support analysis of Russian-language text. This especially concerns higher-level applications, such as Information Extraction. We present work on tools for information extraction from text in Russian in the domain of on-line news. On the lower level we employ the AOT toolkit for natural language processing, which provides modules for morphological analysis and partial syntactic chunking. Since the outputs of both lower-level modules contain unresolved ambiguity, we synthesize the outputs and pass the result into a pre-existing English-language analysis pipeline. We describe how the information extraction system is adapted for multi-lingual support, including extensions to the ontologies and to the pattern matching mechanism. While this is work in progress, we present an end-to-end pipeline for event extraction from Russian-language news.
引用
收藏
页码:380 / 387
页数:8
相关论文
共 50 条
  • [1] Information resources in engineering cryology - (A Russian-language version)
    Alexeev, VR
    Kamensky, RM
    [J]. PERMAFROST ENGINEERING, VOL 1, PROCEEDINGS, 2002, : 119 - 124
  • [2] Application of Information Parameters for the Classification of Russian-language Texts
    Filimonov, V. V.
    Zhivodyorov, A. A.
    Chernykh, Y. A.
    Gorbich, L. G.
    [J]. PHYSICS, TECHNOLOGIES AND INNOVATION (PTI-2019), 2019, 2174
  • [3] Russian-language chemical information in Chemical Abstracts Service and VINITI
    V. M. Khutoretsky
    V. M. Efremenkova
    [J]. Russian Chemical Bulletin, 2000, 49 (1) : 185 - 190
  • [4] Russian-language chemical information in Chemical Abstracts Service and VINITI
    Khutoretsky, VM
    Efremenkova, VM
    [J]. RUSSIAN CHEMICAL BULLETIN, 2000, 49 (01) : 185 - 190
  • [5] Information portrait of Kurdistan in the style of Russian-language media space
    Abdalrahman, Abdalrahman Khalid Hussein
    [J]. FILOLOGICHESKIE NAUKI-NAUCHNYE DOKLADY VYSSHEI SHKOLY-PHILOLOGICAL SCIENCES-SCIENTIFIC ESSAYS OF HIGHER EDUCATION, 2019, : 84 - 89
  • [6] BIBLIOGRAPHY OF RUSSIAN-LANGUAGE JUDAICA
    SOIFER, PE
    [J]. CANADIAN-AMERICAN SLAVIC STUDIES-REVUE CANADIENNE-AMERICAINE D ETUDES SLAVES, 1977, 11 (02): : 306 - 310
  • [7] Russian-language literature on the Internet
    Kostyrko, S
    [J]. NOVYI MIR, 2000, (01): : 252 - 255
  • [8] Russian-language literature on the Internet
    Kostyrko, S
    [J]. NOVYI MIR, 2000, (05): : 249 - 255
  • [9] Russian-language newspapers in Germany
    Massing, K
    [J]. OSTEUROPA, 2002, 52 (03): : 362 - 367
  • [10] Using Corpus Linguistics Tools to Analyze a Russian-Language Islamic Extremist Forum
    Litvinova, Tatiana
    Litvinova, Olga
    Panicheva, Polina
    Biryukova, Elizaveta
    [J]. INTERNET SCIENCE (INSCI 2018), 2018, 11193 : 54 - 65