new/sleak - Information Extraction and Visualization for Investigative Data Journalists

被引:0
|
作者
Yimam, Seid Muhie [1 ]
Ulrich, Heiner [2 ]
von Landesberger, Tatiana [3 ]
Rosenbach, Marcel [2 ]
Regneri, Michaela [2 ]
Panchenko, Alexander [1 ]
Lehmann, Franziska [3 ]
Fahrer, Uli [1 ]
Biemann, Chris [1 ]
Ballweg, Kathrin [3 ]
机构
[1] Tech Univ Darmstadt, Comp Sci Dept, FG Language Technol, Darmstadt, Germany
[2] Tech Univ Darmstadt, Comp Sci Dept, Graph Interact Syst Grp, Darmstadt, Germany
[3] SPIEGEL Verlag, Hamburg, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present new/s/leak, a novel tool developed for and with the help of journalists, which enables the automatic analysis and discovery of newsworthy stories from large textual datasets. We rely on different NLP preprocessing steps such named entity tagging, extraction of time expressions, entity networks, relations and metadata. The system features an intuitive web-based user interface based on network visualization combined with data exploring methods and various search and faceting mechanisms. We report the current state of the software and exemplify it with the WikiLeaks PlusD (Cablegate) data.
引用
收藏
页码:163 / 168
页数:6
相关论文
共 50 条
  • [1] The Need to Help Journalists with Data and Information Visualization
    Reilly, Susan
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2017, 37 (02) : 8 - 10
  • [2] New/s/leak 2.0-Multilingual Information Extraction and Visualization for Investigative Journalism
    Wiedemann, Gregor
    Yimam, Seid Muhie
    Biemann, Chris
    [J]. SOCIAL INFORMATICS (SOCINFO 2018), PT II, 2018, 11186 : 313 - 322
  • [3] Information Extraction and Visualization of Unstructured Textual Data
    Hashmi, Syed Usama
    Bansal, Ajay
    [J]. 2019 13TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2019, : 142 - 145
  • [5] netflower: Dynamic Network Visualization for Data Journalists
    Stoiber, C.
    Rind, A.
    Grassinger, F.
    Gutounig, R.
    Goldgruber, E.
    Sedlmair, M.
    Emrich, S.
    Aigner, W.
    [J]. COMPUTER GRAPHICS FORUM, 2019, 38 (03) : 699 - 711
  • [6] A Multilingual Information Extraction Pipeline for Investigative Journalism
    Wiedemann, Gregor
    Yimam, Seid Muhie
    Biemann, Chris
    [J]. CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2018, : 78 - 83
  • [7] Multi-Source Pandemic Data Visualization and Synchronization for Information Extraction
    Zhang, Qi
    Brokaw, James
    [J]. 2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 140 - 146
  • [8] Database integration for investigative data visualization with the temporal analysis system
    Barth, SW
    [J]. COMMAND, CONTROL, COMMUNICATIONS, AND INTELLIGENCE SYSTEMS FOR LAW ENFORCEMENT, 1997, 2938 : 8 - 15
  • [9] Framework of a Collaborative Audio Analysis and Visualization tool for Data Journalists
    Sidiropoulos, Efstathios A.
    Konstantinidis, Evdokimos I.
    Veglis, Andreas A.
    [J]. 2016 11TH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION (SMAP), 2016, : 156 - 160
  • [10] Metabrain: Web Information Extraction and Visualization
    Teixeira, Joao
    Barata, Gabriel
    Goncalves, Daniel
    [J]. PROCEEDINGS OF THE INTERNATIONAL WORKING CONFERENCE ON ADVANCED VISUAL INTERFACES, 2012, : 534 - 537