An Unsupervised Character-Aware Neural Approach to Word and Context Representation Learning

被引：8

作者：

Marra, Giuseppe ^{[1
,2
]}

Zugarini, Andrea ^{[1
,2
]}

Melacci, Stefano ^{[2
]}

Maggini, Marco ^{[2
]}

机构：

[1] Univ Firenze, DINFO, Florence, Italy

[2] Univ Siena, DIISM, Siena, Italy

来源：

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III | 2018年 / 11141卷

关键词：

Recurrent Neural Networks; Unsupervised learning; Word and context embeddings; Natural Language Processing; Deep learning;

D O I：

10.1007/978-3-030-01424-7_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised large corpora, they can be transferred to different tasks with positive effects in terms of performances, especially when only a few supervisions are available. In this work, we further extend this concept, and we present an unsupervised neural architecture that jointly learns word and context embeddings, processing words as sequences of characters. This allows our model to spot the regularities that are due to the word morphology, and to avoid the need of a fixed-sized input vocabulary of words. We show that we can learn compact encoders that, despite the relatively small number of parameters, reach high-level performances in downstream tasks, comparing them with related state-of-the-art approaches or with fully supervised methods.

引用

页码：126 / 136

页数：11

共 50 条

[1] Character-Aware Neural Morphological Disambiguation
Toleu, Alymzhan
Tolegen, Gulmira
Makazhanov, Aibek
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 666 - 671
[2] Character-Aware Neural Language Models
Kim, Yoon
Jernite, Yacine
Sontag, David
Rush, Alexander M.
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2741 - 2749
[3] Character-Aware Convolutional Neural Networks for Paraphrase Identification
Huang, Jiangping
Ji, Donghong
Yao, Shuxin
Huang, Wenzhi
NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II, 2016, 9948 : 177 - 184
[4] Explaining Character-Aware Neural Networks for Word-Level Prediction: Do They Discover Linguistic Rules?
Godin, Frederic
Demuynck, Kris
Dambre, Joni
De Neve, Wesley
Demeester, Thomas
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3275 - 3284
[5] Character-Aware Sub-word Level Language Modeling for Uyghur and Turkish ASR
Liu, Chang
Zhang, Zhen
Zhang, Pengyuan
Yan, Yonghong
INTERSPEECH 2019, 2019, : 3495 - 3499
[6] Gated Character-aware Convolutional Neural Network for Effective Automated Essay Scoring
Bai, Huanyu
Huang, Zhilin
Hao, Anran
Hiu, Siu Cheung
2021 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2021), 2021, : 351 - 359
[7] Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition
Liu, Wei
Chen, Chaofeng
Wong, Kwan-Yee K.
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7154 - 7161
[8] Learning Sense Representation from Word Representation for Unsupervised Word Sense Disambiguation
Wang, Jie
Fu, Zhenxin
Li, Moxin
Zhang, Haisong
Zhao, Dongyan
Yan, Rui
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13947 - 13948
[9] Unsupervised Visual Representation Learning by Context Prediction
Doersch, Carl
Gupta, Abhinav
Efros, Alexei A.
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1422 - 1430
[10] Unsupervised Cross-Domain Word Representation Learning
Bollegala, Danushka
Maehara, Takanori
Kawarabayashi, Ken-Ichi
PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 730 - 740

← 1 2 3 4 5 →