An Unsupervised Character-Aware Neural Approach to Word and Context Representation Learning

被引:8
|
作者
Marra, Giuseppe [1 ,2 ]
Zugarini, Andrea [1 ,2 ]
Melacci, Stefano [2 ]
Maggini, Marco [2 ]
机构
[1] Univ Firenze, DINFO, Florence, Italy
[2] Univ Siena, DIISM, Siena, Italy
关键词
Recurrent Neural Networks; Unsupervised learning; Word and context embeddings; Natural Language Processing; Deep learning;
D O I
10.1007/978-3-030-01424-7_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years, neural networks have been intensively used to develop meaningful distributed representations of words and contexts around them. When these representations, also known as "embeddings", are learned from unsupervised large corpora, they can be transferred to different tasks with positive effects in terms of performances, especially when only a few supervisions are available. In this work, we further extend this concept, and we present an unsupervised neural architecture that jointly learns word and context embeddings, processing words as sequences of characters. This allows our model to spot the regularities that are due to the word morphology, and to avoid the need of a fixed-sized input vocabulary of words. We show that we can learn compact encoders that, despite the relatively small number of parameters, reach high-level performances in downstream tasks, comparing them with related state-of-the-art approaches or with fully supervised methods.
引用
收藏
页码:126 / 136
页数:11
相关论文
共 50 条
  • [21] Cluster-aware multiplex InfoMax for unsupervised graph representation learning
    Xu, Xin
    Du, Junping
    Song, Jie
    Xue, Zhe
    Li, Ang
    Guan, Zeli
    NEUROCOMPUTING, 2023, 532 : 94 - 105
  • [22] Fast and Unsupervised Neural Architecture Evolution for Visual Representation Learning
    Xue, Song
    Chen, Hanlin
    Xie, Chunyu
    Zhang, Baochang
    Gong, Xuan
    Doermann, David
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2021, 16 (03) : 22 - 32
  • [23] Unsupervised Point Cloud Representation Learning by Clustering and Neural Rendering
    Mei, Guofeng
    Saltori, Cristiano
    Ricci, Elisa
    Sebe, Nicu
    Wu, Qiang
    Zhang, Jian
    Poiesi, Fabio
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (08) : 3251 - 3269
  • [24] HCUKE: A Hierarchical Context-aware approach for Unsupervised Keyphrase Extraction
    Xu, Chun
    Mao, Xian-Ling
    Xin, Cheng-Xin
    Shang, Yu-Ming
    Che, Tian-Yi
    Mao, Hong-Li
    Huang, Heyan
    KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [25] Temporal Context-Aware Representation Learning for Question Routing
    Zhang, Xuchao
    Cheng, Wei
    Zong, Bo
    Chen, Yuncong
    Xu, Jianwu
    Li, Ding
    Chen, Haifeng
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 753 - 761
  • [26] Chinese word segmentation with local and global context representation learning
    School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing
    100083, China
    不详
    100190, China
    High Technol Letters, 1 (71-77):
  • [27] Chinese word segmentation with local and global context representation learning
    李岩
    Zhang Yinghua
    Huang Xiaoping
    Yin Xucheng
    Hao Hongwei
    High Technology Letters, 2015, 21 (01) : 71 - 77
  • [28] A Trust Aware Unsupervised Learning Approach for Insider Threat Detection
    Aldairi, Maryam
    Karimi, Leila
    Joshi, James
    2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 89 - 98
  • [29] An Unsupervised Learning and Statistical Approach for Vietnamese Word Recognition and Segmentation
    Trung, Hieu Le
    Vu Le Anh
    Trung, Kien Le
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, PROCEEDINGS, 2010, 5991 : 195 - +
  • [30] Deep boundary-aware clustering by jointly optimizing unsupervised representation learning
    Ru Wang
    Lin Li
    Peipei Wang
    Xiaohui Tao
    Peiyu Liu
    Multimedia Tools and Applications, 2022, 81 : 34309 - 34324