Effect of Text Color on Word Embeddings

被引:4
|
作者
Ikoma, Masaya [1 ]
Iwana, Brian Kenji [1 ]
Uchida, Seiichi [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
来源
DOCUMENT ANALYSIS SYSTEMS | 2020年 / 12116卷
关键词
Word embedding; Text color;
D O I
10.1007/978-3-030-57058-3_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In natural scenes and documents, we can find a correlation between text and its color. For instance, the word, "hot," is often printed in red, while "cold" is often in blue. This correlation can be thought of as a feature that represents the semantic difference between the words. Based on this observation, we propose the idea of using text color for word embeddings. While text-only word embeddings (e.g. word2vec) have been extremely successful, they often represent antonyms as similar since they are often interchangeable in sentences. In this paper, we try two tasks to verify the usefulness of text color in understanding the meanings of words, especially in identifying synonyms and antonyms. First, we quantify the color distribution of words from the book cover images and analyze the correlation between the color and meaning of the word. Second, we try to retrain word embeddings with the color distribution of words as a constraint. By observing the changes in the word embeddings of synonyms and antonyms before and after re-training, we aim to understand the kind of words that have positive or negative effects in their word embeddings when incorporating text color information.
引用
收藏
页码:341 / 355
页数:15
相关论文
共 50 条
  • [41] Dataless Short Text Classification Based on Biterm Topic Model and Word Embeddings
    Yang, Yi
    Wang, Hongan
    Zhu, Jiaqi
    Wu, Yunkun
    Jiang, Kailong
    Guo, Wenli
    Shi, Wandong
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3969 - 3975
  • [42] Extending Full Text Search for Legal Document Collections Using Word Embeddings
    Landthaler, Joerg
    Waltl, Bernhard
    Holl, Patrick
    Matthes, Florian
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 294 : 73 - 82
  • [43] Emotion Detection from Text via Ensemble Classification Using Word Embeddings
    Herzig, Jonathan
    Shmueli-Scheuer, Michal
    Konopnicki, David
    ICTIR'17: PROCEEDINGS OF THE 2017 ACM SIGIR INTERNATIONAL CONFERENCE THEORY OF INFORMATION RETRIEVAL, 2017, : 269 - 272
  • [44] Combining Dual Word Embeddings with Open Directory Project based Text Classification
    Aliyeva, Dinara
    Kim, Kang-Min
    Choi, Byung-Ju
    Lee, SangKeun
    PROCEEDINGS OF 2018 IEEE 17TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2018), 2018, : 179 - 186
  • [45] Utilizing Character and Word Embeddings for Text Normalization with Sequence-to-Sequence Models
    Watson, Daniel
    Zalmout, Nasser
    Habash, Nizar
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 837 - 843
  • [46] Interpretable segmentation of medical free-text records based on word embeddings
    Dobrakowski, Adam Gabriel
    Mykowiecka, Agnieszka
    Marciniak, Malgorzata
    Jaworski, Wojciech
    Biecek, Przemyslaw
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2021, 57 (03) : 447 - 465
  • [47] Interpretable segmentation of medical free-text records based on word embeddings
    Adam Gabriel Dobrakowski
    Agnieszka Mykowiecka
    Małgorzata Marciniak
    Wojciech Jaworski
    Przemysław Biecek
    Journal of Intelligent Information Systems, 2021, 57 : 447 - 465
  • [48] Comparison of Word Embeddings of Unaligned Audio and Text Data Using Persistent Homology
    Yessenbayev, Zhandos
    Kozhirbayev, Zhanibek
    SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 700 - 711
  • [49] Examining the effect of whitening on static and contextualized word embeddings
    Sasaki, Shota
    Heinzerling, Benjamin
    Suzuki, Jun
    Inui, Kentaro
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [50] Query-oriented text summarization based on multiobjective evolutionary algorithms and word embeddings
    Fors-Isalguez, Yanet
    Hermosillo-Valadez, Jorge
    Montes-y-Gomez, Manuel
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 3235 - 3244