Scattertext: a Browser-Based Tool for Visualizing how Corpora Differ

被引:37
|
作者
Kessler, Jason S. [1 ]
机构
[1] CDK Global, Hoffman Estates, IL 60169 USA
关键词
D O I
10.18653/v1/P17-4015
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Scattertext is an open source tool for visualizing linguistic variation between document categories in a language-independent way. The tool presents a scatterplot, where each axis corresponds to the rank-frequency a term occurs in a category of documents. Through a tie-breaking strategy, the tool is able to display thousands of visible term-representing points and find space to legibly label hundreds of them. Scattertext also lends itself to a query-based visualization of how the use of terms with similar embeddings differs between document categories, as well as a visualization for comparing the importance scores of bag-of-words features to univariate metrics.
引用
收藏
页码:85 / 90
页数:6
相关论文
共 50 条
  • [1] Ektron releases browser-based image tool
    不详
    COMPUTER, 2003, 36 (09) : 94 - 94
  • [2] ONZE Miner: the development of a browser-based research tool
    Fromont, Robert
    Hay, Jennifer
    CORPORA, 2008, 3 (02) : 173 - 193
  • [3] A Lightweight Browser-Based Tool for Collaborative and Blinded Image Analysis
    Schippers, Philipp
    Roesch, Gundula
    Sohn, Rebecca
    Holzapfel, Matthias
    Junker, Marius
    Rapp, Anna E.
    Jenei-Lanzl, Zsuzsa
    Drees, Philipp
    Zaucke, Frank
    Meurer, Andrea
    JOURNAL OF IMAGING, 2024, 10 (02)
  • [4] A browser-based tool for visualization and analysis of diffusion MRI data
    Yeatman, Jason D.
    Richie-Halford, Adam
    Smith, Josh K.
    Keshavan, Anisha
    Rokem, Ariel
    NATURE COMMUNICATIONS, 2018, 9
  • [5] Open Meetings as a browser-based teleconferencing tool for EFDA laboratories
    Santos, B.
    Castro, R.
    Santos, J. H.
    Gomes, D.
    Fernandes, H.
    Sousa, J.
    Varandas, C. A. F.
    FUSION ENGINEERING AND DESIGN, 2011, 86 (6-8) : 1282 - 1285
  • [6] Browser-based attacks on Tor
    Abbott, Timothy G.
    Lai, Katherine J.
    Lieberman, Michael R.
    Price, Eric C.
    PRIVACY ENHANCING TECHNOLOGIES, 2007, 4776 : 184 - 199
  • [7] A browser-based tool for visualization and analysis of diffusion MRI data
    Jason D. Yeatman
    Adam Richie-Halford
    Josh K. Smith
    Anisha Keshavan
    Ariel Rokem
    Nature Communications, 9
  • [8] Browser-Based CPU Fingerprinting
    Trampert, Leon
    Rossow, Christian
    Schwarz, Michael
    COMPUTER SECURITY - ESORICS 2022, PT III, 2022, 13556 : 87 - 105
  • [9] Browser model for security analysis of browser-based protocols
    Gross, T
    Pfitzmann, B
    Sadeghi, AR
    COMPUTER SECURITY - ESORICS 2005, PROCEEDINGS, 2005, 3679 : 489 - 508
  • [10] Browser-Based Intrusion Prevention System
    Erete, Ikpeme
    RECENT ADVANCES IN INTRUSION DETECTION, PROCEEDINGS, 2009, 5758 : 371 - 373