Curatr: A Platform for Semantic Analysis and Curation of Historical Literary Texts

被引:4
|
作者
Leavy, Susan [1 ]
Meaney, Gerardine [1 ]
Wade, Karen [1 ]
Greene, Derek [1 ]
机构
[1] Univ Coll Dublin, Dublin, Ireland
来源
基金
爱尔兰科学基金会;
关键词
Text mining; Digital humanities; Corpus curation;
D O I
10.1007/978-3-030-36599-8_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing availability of digital collections of historical and contemporary literature presents a wealth of possibilities for new research in the humanities. The scale and diversity of such collections however, presents particular challenges in identifying and extracting relevant content. This paper presents Curatr, an online platform for the exploration and curation of literature with machine learning-supported semantic search, designed within the context of digital humanities scholarship. The platform provides a text mining workflow that combines neural word embeddings with expert domain knowledge to enable the generation of thematic lexicons, allowing researches to curate relevant sub-corpora from a large corpus of 18th and 19th century digitised texts.
引用
收藏
页码:354 / 366
页数:13
相关论文
共 50 条
  • [21] Texts and interpretation: Introduction to literary analysis
    Diez Lloris, Irene
    [J]. CASTILLA-ESTUDIOS DE LITERATURA, 2014, 5 : CXXII - CXXIV
  • [22] Texts and interpretation: introduction to literary analysis
    Camarero, Jesus
    [J]. MONTEAGUDO, 2018, (23): : 275 - 278
  • [23] Readability Analysis of Bengali Literary Texts
    Phani, Shanta
    Lahiri, Shibamouli
    Biswas, Arindam
    [J]. JOURNAL OF QUANTITATIVE LINGUISTICS, 2019, 26 (04) : 287 - 305
  • [24] Problems in the linguistic analysis of literary texts
    Hundsnurscher, F
    [J]. MUTUAL EXCHANGES: SHEFFIELD-MUNSTER COLLOQUIUM I, 1999, : 38 - 50
  • [25] The Analysis of literary texts: a complete methodology
    Bellos, David
    [J]. FRENCH STUDIES, 2016, 70 (02) : 305 - 305
  • [26] Textuality assumptions and the analysis of literary texts
    Winko, Simone
    [J]. ZEITSCHRIFT FUR GERMANISTISCHE LINGUISTIK, 2008, 36 (03): : 427 - 443
  • [27] Stylometry Analysis of Literary Texts in Polish
    Walkowiak, Tomasz
    Piasecki, Maciej
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2018), PT II, 2018, 10842 : 777 - 787
  • [28] Interactive semantic analysis of technical texts
    Delisle, S
    Barker, K
    Copek, T
    Szpakowicz, S
    [J]. COMPUTATIONAL INTELLIGENCE, 1996, 12 (02) : 273 - 306
  • [29] Semantic analysis of medical free texts
    Romacker, M
    Hahn, U
    Schulz, S
    Klar, R
    [J]. MEDICAL INFOBAHN FOR EUROPE, PROCEEDINGS, 2000, 77 : 438 - 442
  • [30] The Coding of Literary Form Data mining and the information structure of historical texts
    Liddle, Dallas
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1661 - 1666