'We can't read it all': Theorizing a hermeneutics for large-scale data in the humanities with a case study in stylometry

被引:0
|
作者
Ringler, Hannah [1 ]
机构
[1] Carnegie Mellon Univ, Dept English, Pittsburgh, PA 15213 USA
关键词
D O I
10.1093/llc/fqab100
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Computational methods often produce large amounts of data about texts, which create theoretical and practical challenges for textual interpretation. How can we make claims about texts, when we cannot read every text or analyze every piece of data produced? This article draws on rhetorical and literary theories of textual interpretation to develop a hermeneutical theory for gaining insight about texts with large amounts of computational data. It proposes that computational data about texts can be thought of as analytical lenses that make certain textual features salient. Analysts can read texts with these lenses, and argue for interpretations by arguing for how the analyses of many pieces of data support a particular understanding of text(s). By focusing on validating an understanding of the corpus rather than explaining every piece of data, we allow space for close reading by the human reader, focus our contributions on the humanistic insight we can gain from our corpora, and make it possible to glean insight in a way that is feasible for the limited human reader while still having strategies to argue for (or against) certain interpretations. This theory is demonstrated with an analysis of academic writing using stylometry methods, by offering a view of knowledge-making processes in the disciplines through a close analysis of function words.
引用
收藏
页码:1157 / 1171
页数:15
相关论文
共 50 条
  • [1] Intelligent Exploration of Large-Scale Data: What Can We Learn in Two Passes?
    Kamath, Chandrika
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1831 - 1840
  • [2] Can We Bring Culture into the Large-Scale Study of Gentrification? Assessing the Possibilities Using Geodemographic Marketing Data
    Somashekhar, Mahesh
    URBAN AFFAIRS REVIEW, 2021, 57 (05) : 1312 - 1342
  • [3] Towards Large-Scale Meteorological Data Services: A Case Study
    Dimitar Misev
    Peter Baumann
    Jürgen Seib
    Datenbank-Spektrum, 2012, 12 (3) : 183 - 192
  • [4] The Analysis of Large-Scale Climate Data: Jordan Case Study
    Jararweh, Yaser
    Alsmadi, Izzat
    Al-Ayyoub, Mahmoud
    Jenerette, Darrel
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 288 - 294
  • [5] Large-Scale Assessment of Mobile Crowdsensed Data: A Case Study
    Sirocchi, Christel
    Klopfenstein, Lorenz Cuno
    Bogliolo, Alessandro
    IEEE ACCESS, 2022, 10 : 54681 - 54696
  • [6] Full Lifecycle Data Analysis on a Large-scale and Leadership Supercomputer: What Can We Learn from It?
    Yang, Bin
    Wei, Hao
    Zhu, Wenhao
    Zhang, Yuhao
    Liu, Weiguo
    Xue, Wei
    PROCEEDINGS OF THE 2024 USENIX ANNUAL TECHNICAL CONFERENCE, ATC 2024, 2024, : 917 - 933
  • [7] Automated Data Verification in a Large-scale Citizen Science Project: a Case Study
    Yu, Jun
    Kelling, Steve
    Gerbracht, Jeff
    Wong, Weng-Keen
    2012 IEEE 8TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), 2012,
  • [8] Large-scale screening for genes involved in T-cell signaling: do we know all the players now?
    Di Bartolo, V
    Acuto, O
    TRENDS IN IMMUNOLOGY, 2004, 25 (08) : 399 - 402
  • [9] A Case Study of Data Management Challenges Presented in Large-Scale Machine Learning Workflows
    Lee, Claire Songhyun
    Hewes, V.
    Cerati, Giuseppe
    Kowalkowski, Jim
    Aurisano, Adam
    Agrawal, Ankit
    Choudhary, Alok
    Liao, Wei-keng
    2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID, 2023, : 71 - 81
  • [10] CBR Meets Big Data: A Case Study of Large-Scale Adaptation Rule Generation
    Jalali, Vahid
    Leake, David
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2015, 2015, 9343 : 181 - 196