Story Trees: Representing Documents using Topological Persistence

被引:0
|
作者
Haghighatkhah, Pantea [1 ]
Fokkens, Antske [1 ,2 ]
Sommerauer, Pia [2 ]
Speckmann, Bettina [1 ]
Verbeek, Kevin [1 ]
机构
[1] Eindhoven Univ Technol, Dept Math & Comp Sci, Eindhoven, Netherlands
[2] Vrije Univ Amsterdam, Computat Linguist & Text Min Lab, Amsterdam, Netherlands
关键词
Topical Data Analysis; Semantic Vectors; Document level discourse;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Topological Data Analysis (TDA) focuses on the inherent shape of (spatial) data. As such, it may provide useful methods to explore spatial representations of linguistic data (embeddings) which have become central in NLP. In this paper we aim to introduce TDA to researchers in language technology. We use TDA to represent document structure as so-called story trees. Story trees are hierarchical representations created from semantic vector representations of sentences via persistent homology. They can be used to identify and clearly visualize prominent components of a story line. We showcase their potential by using story trees to create extractive summaries for news stories.
引用
收藏
页码:2413 / 2429
页数:17
相关论文
共 50 条
  • [41] Representing topological relationships for spatiotemporal objects
    Tossebro, Erlend
    Nygard, Mads
    GEOINFORMATICA, 2011, 15 (04) : 633 - 661
  • [42] Beyond Representing Orthology Relations by Trees
    K. T. Huber
    G. E. Scholz
    Algorithmica, 2018, 80 : 73 - 103
  • [43] Representing topological relationships for moving objects
    Tossebro, Erlend
    Nygard, Mads
    GEOGRAPHIC INFORMATION SCIENCE, PROCEEDINGS, 2006, 4197 : 383 - 399
  • [44] Beyond Representing Orthology Relations by Trees
    Huber, K. T.
    Scholz, G. E.
    ALGORITHMICA, 2018, 80 (01) : 73 - 103
  • [45] Representing canonical models as probability trees
    del Sagrado, J
    Salmerón, A
    CURRENT TOPICS IN ARTIFICIAL INTELLIGENCE, 2004, 3040 : 478 - 487
  • [46] Topological ubiquity of trees
    Bowler, Nathan
    Elbracht, Christian
    Erde, Joshua
    Gollin, J. Pascal
    Heuer, Karl
    Pitz, Max
    Teegen, Maximilian
    JOURNAL OF COMBINATORIAL THEORY SERIES B, 2022, 157 : 70 - 95
  • [47] The Story Pile - Representing Story in the Board Game Mind Shadows
    Eladhari, Mirjam Palosaari
    INTERACTIVE STORYTELLING, ICIDS 2018, 2018, 11318 : 280 - 284
  • [48] Representing author's intentions of scientific documents
    Hassan, Kanso
    Chantal, Soule-Dupuy
    Said, Tazi
    ICEIS 2007: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS: INFORMATION SYSTEMS ANALYSIS AND SPECIFICATION, 2007, : 489 - 492
  • [49] 3 TAPESTRIES REPRESENTING THE STORY OF PHAETHON
    OURSEL, H
    REVUE DU LOUVRE-LA REVUE DES MUSEES DE FRANCE, 1994, 44 (5-6): : 91 - 92
  • [50] Representing Documents via Latent Keyphrase Inference
    Liu, Jialu
    Ren, Xiang
    Shang, Jingbo
    Cassidy, Taylor
    Voss, Clare R.
    Han, Jiawei
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, : 1057 - 1067