Story Trees: Representing Documents using Topological Persistence

被引:0
|
作者
Haghighatkhah, Pantea [1 ]
Fokkens, Antske [1 ,2 ]
Sommerauer, Pia [2 ]
Speckmann, Bettina [1 ]
Verbeek, Kevin [1 ]
机构
[1] Eindhoven Univ Technol, Dept Math & Comp Sci, Eindhoven, Netherlands
[2] Vrije Univ Amsterdam, Computat Linguist & Text Min Lab, Amsterdam, Netherlands
关键词
Topical Data Analysis; Semantic Vectors; Document level discourse;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Topological Data Analysis (TDA) focuses on the inherent shape of (spatial) data. As such, it may provide useful methods to explore spatial representations of linguistic data (embeddings) which have become central in NLP. In this paper we aim to introduce TDA to researchers in language technology. We use TDA to represent document structure as so-called story trees. Story trees are hierarchical representations created from semantic vector representations of sentences via persistent homology. They can be used to identify and clearly visualize prominent components of a story line. We showcase their potential by using story trees to create extractive summaries for news stories.
引用
收藏
页码:2413 / 2429
页数:17
相关论文
共 50 条
  • [21] Distributed Localization of Coverage Holes Using Topological Persistence
    Chintakunta, Harish
    Krim, Hamid
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (10) : 2531 - 2541
  • [22] Statistical Topological Data Analysis using Persistence Landscapes
    Bubenik, Peter
    JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 : 77 - 102
  • [23] Representing Music with Prefix Trees
    Han, Yan
    Amin, Nada
    Krishnaswami, Neel
    FARM'19: PROCEEDINGS OF THE 7TH ACM SIGPLAN INTERNATIONAL WORKSHOP ON FUNCTIONAL ART, MUSIC, MODELING, AND DESIGN, 2019, : 83 - 94
  • [24] Representing trees of higher degree
    Benoit, D
    Demaine, ED
    Munro, JI
    Raman, R
    Raman, V
    Rao, SS
    ALGORITHMICA, 2005, 43 (04) : 275 - 292
  • [25] Representing Trees of Higher Degree
    David Benoit
    Erik D. Demaine
    J. Ian Munro
    Rajeev Raman
    Venkatesh Raman
    S. Srinivasa Rao
    Algorithmica, 2005, 43 : 275 - 292
  • [26] Representing trees of higher degree
    Benoit, D
    Demaine, ED
    Munro, JI
    Raman, V
    ALGORITHMS AND DATA STRUCTURES, 1999, 1663 : 169 - 180
  • [27] The Story of Trees
    Sears, William P.
    EDUCATION, 1953, 73 (09): : 582 - 582
  • [28] Representing OCRed documents in HTML']HTML
    Hong, T
    Srihari, SN
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 831 - 834
  • [29] Representing Greece: A story on marble
    Kizis, Costandis
    MATERIA ARQUITECTURA, 2014, (10): : 108 - 112
  • [30] Topological Persistence and Simplification
    Discrete & Computational Geometry, 2002, 28 : 511 - 533