'We can't read it all': Theorizing a hermeneutics for large-scale data in the humanities with a case study in stylometry

被引:0
|
作者
Ringler, Hannah [1 ]
机构
[1] Carnegie Mellon Univ, Dept English, Pittsburgh, PA 15213 USA
关键词
D O I
10.1093/llc/fqab100
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Computational methods often produce large amounts of data about texts, which create theoretical and practical challenges for textual interpretation. How can we make claims about texts, when we cannot read every text or analyze every piece of data produced? This article draws on rhetorical and literary theories of textual interpretation to develop a hermeneutical theory for gaining insight about texts with large amounts of computational data. It proposes that computational data about texts can be thought of as analytical lenses that make certain textual features salient. Analysts can read texts with these lenses, and argue for interpretations by arguing for how the analyses of many pieces of data support a particular understanding of text(s). By focusing on validating an understanding of the corpus rather than explaining every piece of data, we allow space for close reading by the human reader, focus our contributions on the humanistic insight we can gain from our corpora, and make it possible to glean insight in a way that is feasible for the limited human reader while still having strategies to argue for (or against) certain interpretations. This theory is demonstrated with an analysis of academic writing using stylometry methods, by offering a view of knowledge-making processes in the disciplines through a close analysis of function words.
引用
收藏
页码:1157 / 1171
页数:15
相关论文
共 50 条
  • [31] Can We Define the Risk of Lymph Node Metastasis in Early-Stage Cervical Cancer Patients? A Large-Scale, Retrospective Study
    Ferrandina, Gabriella
    Anchora, Luigi Pedone
    Gallotta, Valerio
    Fagotti, Anna
    Vizza, Enrico
    Chiantera, Vito
    De Iaco, Pierandrea
    Ercoli, Alfredo
    Corrado, Giacomo
    Bottoni, Carolina
    Fanfani, Francesco
    Scambia, Giovanni
    ANNALS OF SURGICAL ONCOLOGY, 2017, 24 (08) : 2311 - 2318
  • [32] Uncovering Representation Bias in Large-scale Cellular Phone-based Data: A Case Study in North Carolina
    Jardel, Hanna V.
    Delamater, Paul L.
    GEOGRAPHICAL ANALYSIS, 2024, 56 (04) : 723 - 745
  • [33] Modelling passenger waiting time using large-scale automatic fare collection data: An Australian case study
    Tavassoli, Ahmad
    Mesbah, Mahmoud
    Shobeirinejad, Ameneh
    TRANSPORTATION RESEARCH PART F-TRAFFIC PSYCHOLOGY AND BEHAVIOUR, 2018, 58 : 500 - 510
  • [34] Calibration of a large-scale groundwater flow model using GRACE data: a case study in the Qaidam Basin, China
    Hu, Litang
    Jiao, Jiu Jimmy
    HYDROGEOLOGY JOURNAL, 2015, 23 (07) : 1305 - 1317
  • [35] Actionable descriptors of spatiotemporal urban dynamics from large-scale mobile data: A case study in Lisbon city
    Silva, Miguel G.
    Madeira, Sara C.
    Henriques, Rui
    ENVIRONMENT AND PLANNING B-URBAN ANALYTICS AND CITY SCIENCE, 2024, 51 (08) : 1725 - 1741
  • [36] ON THE ADEQUACY OF LARGE-SCALE MODELS IDENTIFIED WITH INCOMPLETE FIELD DATA - A CASE STUDY WITH TWO LAKE MODELS.
    Varis, Olli
    Kettunen, Juhani
    Leonov, Alexander V.
    Aqua Fennica, 1986, 16 (02): : 157 - 165
  • [37] A large-scale study of the impact of node behavior on loosely coupled data dissemination: The case of the distributed Arctic observatory
    Guegan, Loic
    Rais, Issam
    Anshus, Otto
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2025, 197
  • [39] A Study of the Relationship Between Driving and Health Based on Large-Scale Data Analysis Using PLSA and t-SNE
    Mera, Mitsugu
    Michida, Nanae
    Honda, Masanori
    Sakamoto, Kazuo
    Tamada, Yoshinori
    Mikami, Tatsuya
    Nakaji, Shigeyuki
    IEEE ACCESS, 2024, 12 : 99614 - 99659
  • [40] Can the journal impact factor be used as a criterion for the selection of junior researchers? A large-scale empirical study based on ResearcherID data
    Bornmann, Lutz
    Williams, Richard
    JOURNAL OF INFORMETRICS, 2017, 11 (03) : 788 - 799