An Unsupervised Character-Aware Neural Approach to Word and Context Representation Learning

被引:8
|
作者
Marra, Giuseppe [1 ,2 ]
Zugarini, Andrea [1 ,2 ]
Melacci, Stefano [2 ]
Maggini, Marco [2 ]
机构
[1] Univ Firenze, DINFO, Florence, Italy
[2] Univ Siena, DIISM, Siena, Italy
关键词
Recurrent Neural Networks; Unsupervised learning; Word and context embeddings; Natural Language Processing; Deep learning;
D O I
10.1007/978-3-030-01424-7_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised large corpora, they can be transferred to different tasks with positive effects in terms of performances, especially when only a few supervisions are available. In this work, we further extend this concept, and we present an unsupervised neural architecture that jointly learns word and context embeddings, processing words as sequences of characters. This allows our model to spot the regularities that are due to the word morphology, and to avoid the need of a fixed-sized input vocabulary of words. We show that we can learn compact encoders that, despite the relatively small number of parameters, reach high-level performances in downstream tasks, comparing them with related state-of-the-art approaches or with fully supervised methods.
引用
收藏
页码:126 / 136
页数:11
相关论文
共 50 条
  • [1] Character-Aware Neural Morphological Disambiguation
    Toleu, Alymzhan
    Tolegen, Gulmira
    Makazhanov, Aibek
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 666 - 671
  • [2] Character-Aware Neural Language Models
    Kim, Yoon
    Jernite, Yacine
    Sontag, David
    Rush, Alexander M.
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2741 - 2749
  • [3] Character-Aware Convolutional Neural Networks for Paraphrase Identification
    Huang, Jiangping
    Ji, Donghong
    Yao, Shuxin
    Huang, Wenzhi
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II, 2016, 9948 : 177 - 184
  • [4] Explaining Character-Aware Neural Networks for Word-Level Prediction: Do They Discover Linguistic Rules?
    Godin, Frederic
    Demuynck, Kris
    Dambre, Joni
    De Neve, Wesley
    Demeester, Thomas
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3275 - 3284
  • [5] Character-Aware Sub-word Level Language Modeling for Uyghur and Turkish ASR
    Liu, Chang
    Zhang, Zhen
    Zhang, Pengyuan
    Yan, Yonghong
    INTERSPEECH 2019, 2019, : 3495 - 3499
  • [6] Gated Character-aware Convolutional Neural Network for Effective Automated Essay Scoring
    Bai, Huanyu
    Huang, Zhilin
    Hao, Anran
    Hiu, Siu Cheung
    2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021), 2021, : 351 - 359
  • [7] Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition
    Liu, Wei
    Chen, Chaofeng
    Wong, Kwan-Yee K.
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7154 - 7161
  • [8] Learning Sense Representation from Word Representation for Unsupervised Word Sense Disambiguation
    Wang, Jie
    Fu, Zhenxin
    Li, Moxin
    Zhang, Haisong
    Zhao, Dongyan
    Yan, Rui
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13947 - 13948
  • [9] Unsupervised Visual Representation Learning by Context Prediction
    Doersch, Carl
    Gupta, Abhinav
    Efros, Alexei A.
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1422 - 1430
  • [10] Unsupervised Cross-Domain Word Representation Learning
    Bollegala, Danushka
    Maehara, Takanori
    Kawarabayashi, Ken-Ichi
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 730 - 740