Exploring Themes and Patterns in American Literature Using Text Analysis Techniques

被引:0
|
作者
Cao Y. [1 ]
机构
[1] School of Humanities and Foreign Languages, Huainan Union University, Anhui, Huainan
关键词
American literature; LDA topic modeling; LSTM; Web crawler;
D O I
10.2478/amns-2024-1390
中图分类号
学科分类号
摘要
This study leverages web crawling techniques within the realm of text mining to collect text data from American literary sources. Given the presence of substantial extraneous information in the data, preprocessing is essential to refine the content. Once processed, the data is fed into a Long Short-Term Memory (LSTM) network to generate text vectors that encapsulate characteristics of American literature. Building on this, the Latent Dirichlet Allocation (LDA) topic model extracts salient features from these vectors. An illustrative analysis of American literature is conducted using these methods. Our findings indicate that a significant portion of novels, which mirror the societal life themes prevalent in the United States, aligned with the ideological currents of their time. Notably, these constituted a substantial majority - 62.18% - of all American literary works during the period under review. This research not only facilitates the wider dissemination and exchange of American literary works but also contributes to elevating American literature to a new echelon of global recognition. © 2024 Yu Cao, published by Sciendo.
引用
收藏
相关论文
共 50 条