Multilingual Metaphor Processing: Experiments with Semi-Supervised and Unsupervised Learning

被引:26
|
作者
Shutova, Ekaterina [1 ]
Sun, Lin [2 ]
Gutierrez, Elkin Dario [3 ]
Lichtenstein, Patricia [4 ]
Narayanan, Srini [5 ]
机构
[1] Univ Cambridge, Comp Lab, William Gates Bldg, Cambridge CB3 0FD, England
[2] Greedy Intelligence, Hangzhou, Zhejiang, Peoples R China
[3] Univ Calif San Diego, Dept Cognit Sci, 9500 Gilman Dr, La Jolla, CA 92093 USA
[4] Univ Calif Merced, Dept Cognit & Informat Sci, 5200 Lake Rd, Merced, CA 95343 USA
[5] Google Res, Brandschenkestr 110, CH-8002 Zurich, Switzerland
关键词
CONCRETE; ENGLISH; TIME;
D O I
10.1162/COLI_a_00275
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Highly frequent in language and communication, metaphor represents a significant challenge for Natural Language Processing (NLP) applications. Computational work on metaphor has traditionally evolved around the use of hand-coded knowledge, making the systems hard to scale. Recent years have witnessed a rise in statistical approaches to metaphor processing. However, these approaches often require extensive human annotation effort and are predominantly evaluated within a limited domain. In contrast, we experiment with weakly supervised and unsupervised techniqueswith little or no annotationto generalize higher-level mechanisms of metaphor from distributional properties of concepts. We investigate different levels and types of supervision (learning from linguistic examples vs. learning from a given set of metaphorical mappings vs. learning without annotation) in flat and hierarchical, unconstrained and constrained clustering settings. Our aim is to identify the optimal type of supervision for a learning algorithm that discovers patterns of metaphorical association from text. In order to investigate the scalability and adaptability of our models, we applied them to data in three languages from different language groupsEnglish, Spanish, and Russianachieving state-of-the-art results with little supervision. Finally, we demonstrate that statistical methods can facilitate and scale up cross-linguistic research on metaphor.
引用
收藏
页码:71 / 123
页数:53
相关论文
共 50 条
  • [1] Semi-Supervised and Unsupervised Extreme Learning Machines
    Huang, Gao
    Song, Shiji
    Gupta, Jatinder N. D.
    Wu, Cheng
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (12) : 2405 - 2417
  • [2] Ensemble learning with trees and rules: Supervised, semi-supervised, unsupervised
    Akdemir, Deniz
    Jannink, Jean-Luc
    [J]. INTELLIGENT DATA ANALYSIS, 2014, 18 (05) : 857 - 872
  • [3] Multiview Semi-supervised Learning for Ranking Multilingual Documents
    Usunier, Nicolas
    Amini, Massih-Reza
    Goutte, Cyril
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2011, 6913 : 443 - 458
  • [4] Disfluency Correction using Unsupervised and Semi-supervised Learning
    Saini, Nikhil
    Trivedi, Drumil
    Khare, Shreya
    Dhamecha, Tejas, I
    Jyothi, Preethi
    Bharadwaj, Samarth
    Bhattacharyya, Pushpak
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 3421 - 3427
  • [5] Unsupervised identification of points of interest for semi-supervised learning
    Frigui, H
    [J]. FUZZ-IEEE 2005: Proceedings of the IEEE International Conference on Fuzzy Systems: BIGGEST LITTLE CONFERENCE IN THE WORLD, 2005, : 91 - 96
  • [6] Federated Learning in Healthcare with Unsupervised and Semi-Supervised Methods
    Panos-Basterra, Juan
    Dolores Ruiz, M.
    Martin-Bautista, Maria J.
    [J]. FLEXIBLE QUERY ANSWERING SYSTEMS, FQAS 2023, 2023, 14113 : 182 - 193
  • [7] Semi-Supervised and Unsupervised Deep Visual Learning: A Survey
    Chen, Yanbei
    Mancini, Massimiliano
    Zhu, Xiatian
    Akata, Zeynep
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) : 1327 - 1347
  • [8] Iterative double clustering for unsupervised and semi-supervised learning
    El-Yaniv, R
    Souroujon, O
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1025 - 1032
  • [9] COMBINED UNSUPERVISED AND SEMI-SUPERVISED LEARNING FOR DATA CLASSIFICATION
    Breve, Fabricio Aparecido
    Guimaraes Pedronette, Daniel Carlos
    [J]. 2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
  • [10] Unsupervised Selective Labeling for More Effective Semi-supervised Learning
    Wang, Xudong
    Lian, Long
    Yu, Stella X.
    [J]. COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 427 - 445