Computational text analysis within the Humanities: How to combine working practices from the contributing fields?

被引:13
|
作者
Kuhn, Jonas [1 ]
机构
[1] Univ Stuttgart, Inst Nat Language Proc IMS, Stuttgart, Germany
关键词
Digital humanities; Adaptation of NLP tool chains; Interdisciplinary working practice; HISTORY;
D O I
10.1007/s10579-019-09459-3
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This position paper is based on a keynote presentation at the COLING 2016 Workshop on Language Technology for Digital Humanities in Osaka, Japan. It departs from observations about working practices in Humanities disciplines following a hermeneutic tradition of text interpretation versus the method-oriented research strategies in Computational Linguistics (CL). The respective praxeological traditions are quite different. Yet more and more researchers are willing to open up towards truly transdisciplinary collaborations, trying to exploit advanced methods from CL within research that ultimately addresses questions from the traditional Humanities disciplines and the Social Sciences. The article identifies two central workflow-related issues for this type of collaborative project in the Digital Humanities (DH) and Computational Social Science: (1) a scheduling dilemma, which affects the point in the course of the project when specifications of the core analysis task are fixed (as early as possible from the computational perspective, but as late as possible from the Humanities perspective) and (2) the subjectivity problem, which concerns the degree of intersubjective stability of the target categories of analysis. CL methodology demands high inter-annotator agreement and theory-independent categories, while the categories in hermeneutic reasoning are often tied to a particular interpretive approach (viz. a theory of literary interpretation) and may bear a non-trivial relation to a reader's pre-understanding. Building a comprehensive methodological framework that helps overcome these issues requires considerable time and patience. The established computational methodology has to be gradually opened up to more hermeneutically oriented research questions; resources and tools for the relevant categories of analysis have to be constructed. This article does not call into question that well-targeted efforts along this path are worthwhile. Yet, it makes the following additional programmatic point regarding directions for future research: It might be fruitful to explore-in parallel-the potential lying in DH-specific variants of the concept of rapid prototyping from Software Engineering. To get an idea of how computational analysis of some aspect of text might contribute to a hermeneutic research question, a prototypical analysis model is constructed, e.g., from related data collections and analysis categories, using transfer techniques. While the initial quality of analysis may be limited, the idea of rapid probing allows scholars to explore how the analysis fits in an actual workflow on the target text data and it can thus provide early feedback for the process of refining the modeling. If the rapid probing method can indeed be incorporated in a hermeneutic framework to the satisfaction of well-disposed Humanities scholars, a swifter exploration of alternative paths of analysis would become possible. This may generate considerable additional momentum for transdisciplinary integration. It is as yet too early to point to truly Humanities-oriented examples of the proposed rapid probing technique. To nevertheless make the programmatic idea more concrete, the article uses two experimental scenarios to argue how rapid probing might help addressing the scheduling dilemma and the subjectivity problem respectively. The first scenario illustrates the transfer of complex analysis pipelines across corpora; the second one addresses rapid annotation experiments targeting character mentions in literary text.
引用
收藏
页码:565 / 602
页数:38
相关论文
共 50 条
  • [1] Computational text analysis within the Humanities: How to combine working practices from the contributing fields?
    Jonas Kuhn
    [J]. Language Resources and Evaluation, 2019, 53 : 565 - 602
  • [2] Working on and with Categories for Text Analysis: Challenges and Findings from and for Digital Humanities Practices
    Gerstorfer, Dominik
    Gius, Evelyn
    Jacke, Janina
    [J]. DIGITAL HUMANITIES QUARTERLY, 2023, 17 (03): : 22 - 22
  • [3] How symbol and text combine to promote sign comprehension: Evidence from eye-tracking
    Hung, Yu-Hsiu
    Tan, Yongsheng
    [J]. DISPLAYS, 2024, 83
  • [4] Working wounded - How do you sponsor employees from within?
    Rosner, B
    [J]. WORKFORCE, 2000, 79 (03): : 30 - 31
  • [5] Wellbeing in line managers during mandatory working from home: How work and personal factors combine
    van Gelder, Marco
    van Veldhoven, Marc
    van de Voorde, Karina
    [J]. FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [6] Analysis of the fields within a bounded wave simulator with a large test object in the working volume
    Gao, C
    Chen, B
    Zhou, BH
    Shi, LH
    Wang, TY
    [J]. ASIA-PACIFIC CONFERENCE ON ENVIRONMENTAL ELECTROMAGNETICS: CEEM'2000, PROCEEDINGS, 2000, : 433 - 437
  • [7] Combining internal and external evaluations within a multilevel evaluation framework: Computational text analysis of lessons from the Asian Development Bank
    Goyal, Nihit
    Howlett, Michael
    [J]. EVALUATION, 2019, 25 (03) : 366 - 380
  • [8] How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis
    Collins, Anne G. E.
    Frank, Michael J.
    [J]. EUROPEAN JOURNAL OF NEUROSCIENCE, 2012, 35 (07) : 1024 - 1035
  • [9] For better or for worse? An analysis of how flexible working practices influence employees' perceptions of job quality
    Kelliher, Clare
    Anderson, Deirdre
    [J]. INTERNATIONAL JOURNAL OF HUMAN RESOURCE MANAGEMENT, 2008, 19 (03): : 419 - 431
  • [10] Analysis of dust diffusion from a self-propelled peanut combine using computational fluid dynamics
    Xu, Hongbo
    Zhang, Peng
    Hu, Zhichao
    Mao, Enrong
    Yan, Jianchun
    Yang, Hongguang
    [J]. BIOSYSTEMS ENGINEERING, 2022, 215 : 104 - 114