Digital Approaches to Text Reuse in the Early Chinese Corpus

被引：8

作者：

Sturgeon, Donald ^{[1
]}

机构：

[1] Harvard Univ, Dept East Asian Languages & Civilizat, Cambridge, MA 02138 USA

来源：

JOURNAL OF CHINESE LITERATURE AND CULTURE | 2018年 / 5卷 / 02期

关键词：

text reuse; citation; quotation; similarity; classical Chinese;

D O I：

10.1215/23290048-7256963

中图分类号：

C [社会科学总论];

学科分类号：

03 ; 0303 ;

摘要：

Observed textual similarities between different pieces of writing are frequently cited by textual scholars as grounds for interpretative stances about the meaning of a passage and its authorship, authenticity, and accuracy. Historically, identifying occurrences of such similarities has been a matter of extensive knowledge and recall of the content and locations of passages contained within certain texts, together with painstaking manual comparison by examining printed copies, use of concordances, or more recently, appropriate use of full-text searchable database systems. The development of increasingly comprehensive and accurate digital corpora of early Chinese transmitted writing raises many opportunities to study these phenomena using more systematic digital techniques. These offer the promise of not only vast savings in time and labor but also new insights made possible only through exhaustive comparisons of types that would be entirely impractical without the use of computational methods. This article investigates and contrasts unsupervised techniques for the identification of textual similarities in premodern Chinese works in general, and the classical corpus in particular, taking the text of the Mozi as a concrete example. While specific examples are presented in detail to concretely demonstrate the utility and potential of the techniques discussed, all of the methods described are generally applicable to a wide range of materials. With this in mind, this article also introduces an open-access platform designed to help researchers quickly and easily explore these phenomena within those materials most relevant to their own work.

引用

页码：186 / 213

页数：28

共 50 条

[21] Using an Advanced Text Index Structure for Corpus Exploration in Digital Humanities
Englmeier, Tobias
Buechler, Marco
Gerdjikov, Stefan
Schulz, Klaus U.
DIGITAL HUMANITIES QUARTERLY, 2021, 15 (01):
[22] Corpus Assembly as Text Data Integration from Digital Libraries and the Web
Hahn, Udo
Duan, Tinghui
2019 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL 2019), 2019, : 25 - 28
[23] Reception Reader: Exploring Text Reuse in Early Modern British Publications
Rosson, David
Maekelae, Eetu
Vaara, Ville
Mahadevan, Ananth
Ryan, Yann
Tolonen, Mikko
JOURNAL OF OPEN HUMANITIES DATA, 2023, 9
[24] TEXT AND EDITION IN EARLY CHINESE PHILOSOPHICAL LITERATURE
ROTH, HD
JOURNAL OF THE AMERICAN ORIENTAL SOCIETY, 1993, 113 (02) : 214 - 227
[25] Digital Rights Management for a Chinese XML Text Centre
Wai -man Wong(The Open University of Hong Kong Library
Hong Kong
China)
现代图书情报技术, 2002, (S1) : 172 - 177
[26] A LARGE-SCALE CHINESE LONG-TEXT EXTRACTIVE SUMMARIZATION CORPUS
Chen, Kai
Fu, Guanyu
Chen, Qingcai
Hu, Baotian
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7828 - 7832
[27] Collaborative Digital Research: Case Study of Text Mining a Corpus of Academic Journals
Baillargeon, Tara
Kowalik, Eric
Cook, Jennifer M.
NEW REVIEW OF ACADEMIC LIBRARIANSHIP, 2021, 27 (02) : 230 - 242
[28] DIGITAL TOOLS IN TRANSLATION DIDACTICS: LEXICAL RESOURCES AND CORPUS TOOLS FOR TEXT EDITING
Brozyna-Reczko, Malgorzata
ROCZNIKI HUMANISTYCZNE, 2020, 68 (10): : 181 - 193
[29] VISUALIZATION APPROACHES FOR THE CONSTRUCTION OF KNOWLEDGE IN LAW: application in a digital corpus of jurisprudence
Aguilar, Audilio Gonzales
Luiz Pinto, Adilson
Verlaet, Lise
Vaisman, Coleta
Gallot, Sidonie
INFORMACAO & SOCIEDADE-ESTUDOS, 2013, 23 (03) : 75 - 87
[30] Digital Literary Studies. Corpus Approaches to Poetry, Prose, and Drama
Montoro, Rocio
International Journal of Corpus Linguistics, 2015, 20 (01) : 129 - 137

← 1 2 3 4 5 →