Computational Thematic Analysis of Poetry via Bimodal Large Language Models

被引:1
|
作者
Choi K. [1 ]
机构
[1] Indiana University, United States
关键词
auxiliary data; computational poetry analysis; context-dependent language model; Digital libraries; multilabel classification;
D O I
10.1002/pra2.812
中图分类号
学科分类号
摘要
This article proposes a multilabel poem topic classification algorithm utilizing large language models and auxiliary data to address the lack of diverse metadata in digital poetry libraries. The study examines the potential of context-dependent language models, specifically bidirectional encoder representations from transformers (BERT), for understanding poetic words and utilizing auxiliary data, such as author's notes, in supplementing poetry text. The experimental results demonstrate that the BERT-based model outperforms the traditional support vector machine-based model across all input types and datasets. We also show that incorporating notes as an additional input improves the performance of the poem-only model. Overall, the study suggests pretrained context-dependent language models and auxiliary data have potential to enhance the accessibility of various poems within collections. This research can eventually assist in promoting the discovery of underrepresented poems in digital libraries, even if they lack associated metadata, thus enhancing the understanding and appreciation of the literary form. Annual Meeting of the Association for Information Science & Technology | Oct. 27 – 31, 2023 | London, United Kingdom. Author(s) retain copyright, but ASIS&T receives an exclusive publication license.
引用
收藏
页码:538 / 542
页数:4
相关论文
共 50 条
  • [1] Using Large Language Models to Support Thematic Analysis in Empirical Legal Studies
    Drapal, Jakub
    Westermann, Hannes
    Savelka, Jaromir
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 379 : 197 - 206
  • [2] Trend Extraction and Analysis via Large Language Models
    Soru, Tommaso
    Marshall, Jim
    18TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC 2024, 2024, : 285 - 288
  • [3] Harmonizing immune cell sequences for computational analysis with large language models
    Alsaafin, Areej
    Tizhoosh, Hamid R.
    BIOLOGY METHODS & PROTOCOLS, 2024, 9 (01):
  • [4] Exploring the Potential of Large Language Models in Computational Argumentation
    Chen, Guizhen
    Cheng, Liying
    Tuan, Luu Anh
    Bing, Lidong
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 2309 - 2330
  • [5] Deafhood, nationhood and nature: Thematic analysis of South African Sign Language poetry
    Morgan, Ruth Z.
    Kaneko, Michiko
    SOUTH AFRICAN JOURNAL OF AFRICAN LANGUAGES, 2018, 38 (03) : 363 - 374
  • [6] Computational Emotion Models: A Thematic Review
    Ojha, Suman
    Vitale, Jonathan
    Williams, Mary-Anne
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2021, 13 (06) : 1253 - 1279
  • [7] Computational Emotion Models: A Thematic Review
    Suman Ojha
    Jonathan Vitale
    Mary-Anne Williams
    International Journal of Social Robotics, 2021, 13 : 1253 - 1279
  • [8] Game Generation via Large Language Models
    Hu, Chengpeng
    Zhao, Yunlong
    Liu, Jialin
    2024 IEEE CONFERENCE ON GAMES, COG 2024, 2024,
  • [9] Text Classification via Large Language Models
    Sun, Xiaofei
    Li, Xiaoya
    Li, Jiwei
    Wu, Fei
    Guo, Shangwei
    Zhang, Tianwei
    Wang, Guoyin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8990 - 9005
  • [10] Improving Patch Correctness Analysis via Random Testing and Large Language Models
    Molina, Facundo
    Manuel Copia, Juan
    Gorla, Alessandra
    2024 IEEE CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION, ICST 2024, 2024, : 317 - 328