TIARA: Interactive, Topic-Based Visual Text Summarization and Analysis

被引:79
|
作者
Liu, Shixia
Zhou, Michelle X. [1 ]
Pan, Shimei [1 ]
Song, Yangqiu
Qian, Weihong [1 ]
Cai, Weijia [1 ]
Lian, Xiaoxiao [1 ]
机构
[1] IBM Res, Yorktown Hts, NY USA
关键词
Design; Human Factors; Text analytics; interactive text visualization; stacked graph; text trend chart; text summarization; topic model; VISUALIZATION;
D O I
10.1145/2089094.2089101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We are building an interactive visual text analysis tool that aids users in analyzing large collections of text. Unlike existing work in visual text analytics, which focuses either on developing sophisticated text analytic techniques or inventing novel text visualization metaphors, ours tightly integrates state-of-the-art text analytics with interactive visualization to maximize the value of both. In this article, we present our work from two aspects. We first introduce an enhanced, LDA-based topic analysis technique that automatically derives a set of topics to summarize a collection of documents and their content evolution over time. To help users understand the complex summarization results produced by our topic analysis technique, we then present the design and development of a time-based visualization of the results. Furthermore, we provide users with a set of rich interaction tools that help them further interpret the visualized results in context and examine the text collection from multiple perspectives. As a result, our work offers three unique contributions. First, we present an enhanced topic modeling technique to provide users with a time-sensitive and more meaningful text summary. Second, we develop an effective visual metaphor to transform abstract and often complex text summarization results into a comprehensible visual representation. Third, we offer users flexible visual interaction tools as alternatives to compensate for the deficiencies of current text summarization techniques. We have applied our work to a number of text corpora and our evaluation shows promise, especially in support of complex text analyses.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Topic-based automatic summarization algorithm for Chinese short text
    Ma, Tinghuai
    Wang, Hongmei
    Zhao, Yuwei
    Tian, Yuan
    Al-Nabhan, Najla
    [J]. Mathematical Biosciences and Engineering, 2020, 17 (04): : 3582 - 3600
  • [2] TASP : Topic-based abstractive summarization of Facebook text posts
    Benedetto, Irene
    La Quatra, Moreno
    Cagliero, Luca
    Vassio, Luca
    Trevisan, Martino
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [3] Topic-based automatic summarization algorithm for Chinese short text
    Ma, Tinghuai
    Wang, Hongmei
    Zhao, Yuwei
    Tian, Yuan
    Al-Nabhan, Najla
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2020, 17 (04) : 3582 - 3600
  • [4] Topic-based web site summarization
    Zhang, Yongzheng
    Milios, Evangelos
    Zincir-Heywood, Nur
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2010, 6 (04) : 266 - +
  • [5] Text summarization using topic-based vector space model and semantic measure
    Belwal, Ramesh Chandra
    Rai, Sawan
    Gupta, Atul
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (03)
  • [6] CATS: Customizable Abstractive Topic-based Summarization
    Bahrainian, Seyed Ali
    Zerveas, George
    Crestani, Fabio
    Eickhoff, Carsten
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2022, 40 (01)
  • [7] An online multi-source summarization algorithm for text readability in topic-based search
    Curiel, Arturo
    Gutierrez-Soto, Claudio
    Rojano-Caceres, Jose-Rafael
    [J]. COMPUTER SPEECH AND LANGUAGE, 2021, 66
  • [8] A method for the automatic summarization of topic-based clusters of documents
    Pons-Porrata, A
    Ruiz-Shulcloper, J
    Berlanga-Llavori, R
    [J]. PROGRESS IN PATTERN RECOGNITION, SPEECH AND IMAGE ANALYSIS, 2003, 2905 : 596 - 603
  • [9] ColTop: Visual Topic-based Analysis of Scientific Community Structure
    Abdelaal, Moataz
    Heimerl, Florian
    Koch, Steffen
    [J]. 2017 INTERNATIONAL SYMPOSIUM ON BIG DATA VISUAL ANALYTICS (BDVA), 2017, : 55 - 62
  • [10] Topic-based Coordination for Visual Analysis of Evolving Document Collections
    Eler, Danilo Medeiros
    Paulovich, Fernando Vieira
    Ferreira de Oliveira, Maria Cristina
    Minghim, Rosane
    [J]. INFORMATION VISUALIZATION, IV 2009, PROCEEDINGS, 2009, : 149 - 155