'We can't read it all': Theorizing a hermeneutics for large-scale data in the humanities with a case study in stylometry

被引:0
|
作者
Ringler, Hannah [1 ]
机构
[1] Carnegie Mellon Univ, Dept English, Pittsburgh, PA 15213 USA
关键词
D O I
10.1093/llc/fqab100
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Computational methods often produce large amounts of data about texts, which create theoretical and practical challenges for textual interpretation. How can we make claims about texts, when we cannot read every text or analyze every piece of data produced? This article draws on rhetorical and literary theories of textual interpretation to develop a hermeneutical theory for gaining insight about texts with large amounts of computational data. It proposes that computational data about texts can be thought of as analytical lenses that make certain textual features salient. Analysts can read texts with these lenses, and argue for interpretations by arguing for how the analyses of many pieces of data support a particular understanding of text(s). By focusing on validating an understanding of the corpus rather than explaining every piece of data, we allow space for close reading by the human reader, focus our contributions on the humanistic insight we can gain from our corpora, and make it possible to glean insight in a way that is feasible for the limited human reader while still having strategies to argue for (or against) certain interpretations. This theory is demonstrated with an analysis of academic writing using stylometry methods, by offering a view of knowledge-making processes in the disciplines through a close analysis of function words.
引用
收藏
页码:1157 / 1171
页数:15
相关论文
共 50 条
  • [41] Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data
    Becquet, Celine
    Blachon, Sylvain
    Jeudy, Baptiste
    Boulicaut, Jean-Francois
    Gandrillon, Olivier
    GENOME BIOLOGY, 2002, 3 (12):
  • [42] Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data
    Céline Becquet
    Sylvain Blachon
    Baptiste Jeudy
    Jean-Francois Boulicaut
    Olivier Gandrillon
    Genome Biology, 3 (12)
  • [44] Large-scale flood modelling based on LiDAR data: a case study in the Southwest Miramichi watershed, New Brunswick, Canada
    Chone, Guenole
    Mazgareanu, Iulia
    Biron, Pascale M.
    Buffin-Belanger, Thomas
    Larouche-Tremblay, Francois
    Perry, Brian
    Fortin, Maxim
    CANADIAN WATER RESOURCES JOURNAL, 2024, 49 (04) : 460 - 478
  • [45] A case study of text mining tweet data: - How to correct information from the affected area in a large-scale disaster -
    Sakamaki, Yoshikazu
    Kamei, Etsuko
    Journal of Japan Industrial Management Association, 2014, 65 (01) : 39 - 50
  • [46] Using Smart Mobile Devices for Collecting Structured Data in Clinical Trials: Results From a Large-Scale Case Study
    Schobel, Johannes
    Pryss, Ruediger
    Reichert, Manfred
    2015 IEEE 28TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2015, : 13 - 18
  • [47] Using Sensor Data to Dynamically Map Large-Scale Models to Site-Scale Forecasts: A Case Study Using the National Water Model
    Fries, Kevin J.
    Kerkez, Branko
    WATER RESOURCES RESEARCH, 2018, 54 (08) : 5636 - 5653
  • [48] All-Cause Hospitalizations after Large-Scale Hurricanes among Older Adults: A Self-Controlled Case Series Study
    Bell, Sue Anne
    Iwashyna, Theodore J.
    Zhang, Xingyu
    Chen, Bingxin
    Davis, Matthew A.
    PREHOSPITAL AND DISASTER MEDICINE, 2021, 36 (01) : 25 - 31
  • [49] What can we learn about health and healthcare from large-scale, routinely-acquired data in the United Kingdom: studies in fragile X syndrome
    McKechanie, A. G.
    Stanfield, A. C.
    Fisher, L.
    Morgan, C. L. I.
    Cooper, A.
    Conway, P.
    JOURNAL OF INTELLECTUAL DISABILITY RESEARCH, 2024, 68 (09) : 1020 - 1021
  • [50] Can we develop real-world prognostic models using observational healthcare data? Large-scale experiment to investigate model sensitivity to database and phenotypes
    Jenna M. Reps
    Peter R. Rijnbeek
    Patrick B. Ryan
    Diagnostic and Prognostic Research, 9 (1)