Does GPT-3 Grasp Metaphors? Identifying Metaphor Mappings with Generative Language Models

被引：0

作者：

Wachowiak, Lennart ^{[1
]}

Gromann, Dagmar ^{[2
]}

机构：

[1] Kings Coll London, London, England

[2] Univ Vienna, Vienna, Austria

来源：

PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Conceptual metaphors present a powerful cognitive vehicle to transfer knowledge structures from a source to a target domain. Prior neural approaches focus on detecting whether natural language sequences are metaphoric or literal. We believe that to truly probe metaphoric knowledge in pre-trained language models, their capability to detect this transfer should be investigated. To this end, this paper proposes to probe the ability of GPT-3 to detect metaphoric language and predict the metaphor's source domain without any pre-set domains. We experiment with different training sample configurations for fine-tuning and few-shot prompting on two distinct datasets. When provided 12 fewshot samples in the prompt, GPT-3 generates the correct source domain for a new sample with an accuracy of 65.15% in English and 34.65% in Spanish. GPT's most common error is a hallucinated source domain for which no indicator is present in the sentence. Other common errors include identifying a sequence as literal even though a metaphor is present and predicting the wrong source domain based on specific words in the sequence that are not metaphorically related to the target domain.

引用

页码：1018 / 1032

页数：15

共 43 条

[41] Comparative analysis of large language models in psychiatry and mental health: A focus on GPT, AYA, and Nemotron-3-8B - 8B
Gargari, Omid Kohandel
Habibi, Gholamreza
Nilchian, Nima
Farzan, Arman Shafiee
ASIAN JOURNAL OF PSYCHIATRY, 2024, 99
[42] Human-Comparable Sensitivity of Large Language Models inIdenti fying Eligible Studies Through Title and Abstract Screening:3-Layer Strategy Using GPT-3.5 and GPT-4 for Systematic Reviews
Matsui, Kentaro
Utsumi, Tomohiro
Aoki, Yumi
Maruki, Taku
Takeshima, Masahiro
Takaesu, Yoshikazu
JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
[43] Comparative diagnostic accuracy of GPT-4o and LLaMA 3-70b: Proprietary vs. open-source large language models in radiology☆
Li, David
Gupta, Kartik
Bhaduri, Mousumi
Sathiadoss, Paul
Bhatnagar, Sahir
Chong, Jaron
CLINICAL IMAGING, 2025, 118

← 1 2 3 4 5 →