'We can't read it all': Theorizing a hermeneutics for large-scale data in the humanities with a case study in stylometry

被引:0
|
作者
Ringler, Hannah [1 ]
机构
[1] Carnegie Mellon Univ, Dept English, Pittsburgh, PA 15213 USA
关键词
D O I
10.1093/llc/fqab100
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Computational methods often produce large amounts of data about texts, which create theoretical and practical challenges for textual interpretation. How can we make claims about texts, when we cannot read every text or analyze every piece of data produced? This article draws on rhetorical and literary theories of textual interpretation to develop a hermeneutical theory for gaining insight about texts with large amounts of computational data. It proposes that computational data about texts can be thought of as analytical lenses that make certain textual features salient. Analysts can read texts with these lenses, and argue for interpretations by arguing for how the analyses of many pieces of data support a particular understanding of text(s). By focusing on validating an understanding of the corpus rather than explaining every piece of data, we allow space for close reading by the human reader, focus our contributions on the humanistic insight we can gain from our corpora, and make it possible to glean insight in a way that is feasible for the limited human reader while still having strategies to argue for (or against) certain interpretations. This theory is demonstrated with an analysis of academic writing using stylometry methods, by offering a view of knowledge-making processes in the disciplines through a close analysis of function words.
引用
收藏
页码:1157 / 1171
页数:15
相关论文
共 50 条
  • [21] A Taxi Zoning Analysis Using Large-Scale Probe Data: A Case Study for Metropolitan Bangkok
    Peungnumsai, Apantri
    Witayangkurn, Apichon
    Nagai, Masahiko
    Miyazaki, Hiroyuki
    REVIEW OF SOCIONETWORK STRATEGIES, 2018, 12 (01): : 21 - 45
  • [22] Soil Moisture Data Assimilation in a Hydrological Model: A Case Study in Belgium Using Large-Scale Satellite Data
    Baguis, Pierre
    Roulin, Emmanuel
    REMOTE SENSING, 2017, 9 (08)
  • [23] "Why can't we all get along?" A conceptual analysis and case study of contentious energy problems
    Felder, Frank A.
    ENERGY POLICY, 2016, 96 : 711 - 716
  • [24] Assimilation of satellite data to optimize large-scale hydrological model parameters: a case study for the SWOT mission
    Pedinotti, V.
    Boone, A.
    Ricci, S.
    Biancamaria, S.
    Mognard, N.
    HYDROLOGY AND EARTH SYSTEM SCIENCES, 2014, 18 (11) : 4485 - 4507
  • [25] Large-Scale Estimation in Cyberphysical Systems Using Streaming Data: A Case Study With Arterial Traffic Estimation
    Hunter, Timothy
    Das, Tathagata
    Zaharia, Matei
    Abbeel, Pieter
    Bayen, Alexandre M.
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2013, 10 (04) : 884 - 898
  • [26] Procedure for managing large-scale progeny test data:: A case study of Scots pine in Finland.
    Venäläinen, M
    Ruotsalainen, S
    SILVA FENNICA, 2002, 36 (02) : 475 - 487
  • [27] Empirical Study of Feature Selection Methods in Regression for Large-Scale Healthcare Data: A Case Study on Estimating Dental Expenditures
    Mayya, Veena
    King, Christian
    Vu, Giang T.
    Gurupur, Varadraj
    IEEE ACCESS, 2024, 12 : 153564 - 153579
  • [28] Can fisheries yield be enhanced by large-scale feeding of a predatory fish stock?: A case study of the Icelandic cod stock
    Björnsson, B
    CANADIAN JOURNAL OF FISHERIES AND AQUATIC SCIENCES, 2001, 58 (10) : 2091 - 2104
  • [29] Can We Define the Risk of Lymph Node Metastasis in Early-Stage Cervical Cancer Patients? A Large-Scale, Retrospective Study
    Gabriella Ferrandina
    Luigi Pedone Anchora
    Valerio Gallotta
    Anna Fagotti
    Enrico Vizza
    Vito Chiantera
    Pierandrea De Iaco
    Alfredo Ercoli
    Giacomo Corrado
    Carolina Bottoni
    Francesco Fanfani
    Giovanni Scambia
    Annals of Surgical Oncology, 2017, 24 : 2311 - 2318
  • [30] Calibrating the building energy model with the short term monitored data A case study of a large-scale residential building
    Tuysuz, Fatih
    Sozer, Hatice
    ENERGY AND BUILDINGS, 2020, 224