An Unsupervised Character-Aware Neural Approach to Word and Context Representation Learning

被引:8
|
作者
Marra, Giuseppe [1 ,2 ]
Zugarini, Andrea [1 ,2 ]
Melacci, Stefano [2 ]
Maggini, Marco [2 ]
机构
[1] Univ Firenze, DINFO, Florence, Italy
[2] Univ Siena, DIISM, Siena, Italy
关键词
Recurrent Neural Networks; Unsupervised learning; Word and context embeddings; Natural Language Processing; Deep learning;
D O I
10.1007/978-3-030-01424-7_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised large corpora, they can be transferred to different tasks with positive effects in terms of performances, especially when only a few supervisions are available. In this work, we further extend this concept, and we present an unsupervised neural architecture that jointly learns word and context embeddings, processing words as sequences of characters. This allows our model to spot the regularities that are due to the word morphology, and to avoid the need of a fixed-sized input vocabulary of words. We show that we can learn compact encoders that, despite the relatively small number of parameters, reach high-level performances in downstream tasks, comparing them with related state-of-the-art approaches or with fully supervised methods.
引用
收藏
页码:126 / 136
页数:11
相关论文
共 50 条
  • [31] Deep boundary-aware clustering by jointly optimizing unsupervised representation learning
    Wang, Ru
    Li, Lin
    Wang, Peipei
    Tao, Xiaohui
    Liu, Peiyu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34309 - 34324
  • [32] Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations
    Noguchi, Atsuhiro
    Sun, Xiao
    Lin, Stephen
    Harada, Tatsuya
    COMPUTER VISION - ECCV 2022, PT XVII, 2022, 13677 : 597 - 614
  • [33] Fully Unsupervised Machine Translation Using Context-Aware Word Translation and Denoising Autoencoder
    Chauhan, Shweta
    Daniel, Philemon
    Saxena, Shefali
    Sharma, Ayush
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [34] Recommendations with context aware framework using particle swarm optimization and unsupervised learning
    Jain, Parul
    Dixit, Veer Sain
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4479 - 4490
  • [35] Coreference aware representation learning for neural named entity recognition
    Dai, Zeyu
    Fei, Hongliang
    Li, Ping
    IJCAI International Joint Conference on Artificial Intelligence, 2019, 2019-August : 4946 - 4953
  • [36] Coreference Aware Representation Learning for Neural Named Entity Recognition
    Dai, Zeyu
    Fei, Hongliang
    Li, Ping
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4946 - 4953
  • [37] Unsupervised Point Cloud Representation Learning With Deep Neural Networks: A Survey
    Xiao, Aoran
    Huang, Jiaxing
    Guan, Dayan
    Zhang, Xiaoqin
    Lu, Shijian
    Shao, Ling
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) : 11321 - 11339
  • [38] Variational approach to unsupervised learning algorithms of neural networks
    Likhovidov, V
    NEURAL NETWORKS, 1997, 10 (02) : 273 - 289
  • [39] A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning
    Yang, Shijie
    Li, Liang
    Wang, Shuhui
    Zhang, Weigang
    Huang, Qingming
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 7053 - 7061
  • [40] CONTEXT-AWARE NEURAL CONFIDENCE ESTIMATION FOR RARE WORD SPEECH RECOGNITION
    Qiu, David
    Munkhdalai, Tsendsuren
    He, Yanzhang
    Sim, Khe Chai
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 31 - 37