An Unsupervised Character-Aware Neural Approach to Word and Context Representation Learning

被引：8

作者：

Marra, Giuseppe ^{[1
,2
]}

Zugarini, Andrea ^{[1
,2
]}

Melacci, Stefano ^{[2
]}

Maggini, Marco ^{[2
]}

机构：

[1] Univ Firenze, DINFO, Florence, Italy

[2] Univ Siena, DIISM, Siena, Italy

来源：

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III | 2018年 / 11141卷

关键词：

Recurrent Neural Networks; Unsupervised learning; Word and context embeddings; Natural Language Processing; Deep learning;

D O I：

10.1007/978-3-030-01424-7_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised large corpora, they can be transferred to different tasks with positive effects in terms of performances, especially when only a few supervisions are available. In this work, we further extend this concept, and we present an unsupervised neural architecture that jointly learns word and context embeddings, processing words as sequences of characters. This allows our model to spot the regularities that are due to the word morphology, and to avoid the need of a fixed-sized input vocabulary of words. We show that we can learn compact encoders that, despite the relatively small number of parameters, reach high-level performances in downstream tasks, comparing them with related state-of-the-art approaches or with fully supervised methods.

引用

页码：126 / 136

页数：11

共 50 条

[31] Deep boundary-aware clustering by jointly optimizing unsupervised representation learning
Wang, Ru
Li, Lin
Wang, Peipei
Tao, Xiaohui
Liu, Peiyu
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34309 - 34324
[32] Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations
Noguchi, Atsuhiro
Sun, Xiao
Lin, Stephen
Harada, Tatsuya
COMPUTER VISION - ECCV 2022, PT XVII, 2022, 13677 : 597 - 614
[33] Fully Unsupervised Machine Translation Using Context-Aware Word Translation and Denoising Autoencoder
Chauhan, Shweta
Daniel, Philemon
Saxena, Shefali
Sharma, Ayush
APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
[34] Recommendations with context aware framework using particle swarm optimization and unsupervised learning
Jain, Parul
Dixit, Veer Sain
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4479 - 4490
[35] Coreference aware representation learning for neural named entity recognition
Dai, Zeyu
Fei, Hongliang
Li, Ping
IJCAI International Joint Conference on Artificial Intelligence, 2019, 2019-August : 4946 - 4953
[36] Coreference Aware Representation Learning for Neural Named Entity Recognition
Dai, Zeyu
Fei, Hongliang
Li, Ping
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4946 - 4953
[37] Unsupervised Point Cloud Representation Learning With Deep Neural Networks: A Survey
Xiao, Aoran
Huang, Jiaxing
Guan, Dayan
Zhang, Xiaoqin
Lu, Shijian
Shao, Ling
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) : 11321 - 11339
[38] Variational approach to unsupervised learning algorithms of neural networks
Likhovidov, V
NEURAL NETWORKS, 1997, 10 (02) : 273 - 289
[39] A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning
Yang, Shijie
Li, Liang
Wang, Shuhui
Zhang, Weigang
Huang, Qingming
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7053 - 7061
[40] CONTEXT-AWARE NEURAL CONFIDENCE ESTIMATION FOR RARE WORD SPEECH RECOGNITION
Qiu, David
Munkhdalai, Tsendsuren
He, Yanzhang
Sim, Khe Chai
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 31 - 37

← 1 2 3 4 5 →