An Eclectic Approach for Enhancing Language Models Through Rich Embedding Features

被引:0
|
作者
Aldana-Bobadilla, Edwin [1 ,2 ]
Sosa-Sosa, Victor Jesus [2 ]
Molina-Villegas, Alejandro [1 ,3 ]
Gazca-Hernandez, Karina [2 ]
Olivas, Jose Angel [4 ]
机构
[1] CONAHCYT, Mexico City 03940, Mexico
[2] Cinvestav, Unidad Tamaulipas, Ciudad Victoria 87130, Tamaulipas, Mexico
[3] Ctr Invest Ciencias Invest Geoespacial, Mexico City 14240, Mexico
[4] Univ Castilla La Mancha, Grp SMILe, Ciudad Real 13071, Spain
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Task analysis; Semantics; Transformers; Neurons; Linguistics; Self-organizing feature maps; Text analysis; Self-organizing map; word embeddings; feature extraction; natural language processing;
D O I
10.1109/ACCESS.2024.3422971
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text processing is a fundamental aspect of Natural Language Processing (NLP) and is crucial for various applications in fields such as artificial intelligence, data science, and information retrieval. It plays a core role in language models. Most text-processing approaches focus on describing and synthesizing, to a greater or lesser degree, lexical, syntactic, and semantic properties of text in the form of numerical vectors that induce a metric space, in which, it is possible to find underlying patterns and structures related to the original text. Since each approach has strengths and weaknesses, finding a single approach that perfectly extracts representative text properties for every task and application domain is hard. This paper proposes a novel approach capable of synthesizing information from heterogeneous state-of-the-art text processing approaches into a unified representation. Encouraging results demonstrate that using this representation in popular machine-learning tasks not only leads to superior performance but also offers notable advantages in memory efficiency and preservation of underlying information of the distinct sources involved in such a representation.
引用
收藏
页码:100921 / 100938
页数:18
相关论文
共 50 条
  • [41] Enhancing user prompt confidentiality in Large Language Models through advanced differential encryption
    Gupta, Brij B.
    Gaurav, Akshat
    Arya, Varsha
    Alhalabi, Wadee
    Alsalman, Dheyaaldin
    Vijayakumar, P.
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 116
  • [42] Slim Embedding Layers for Recurrent Neural Language Models
    Li, Zhongliang
    Kulhanek, Raymond
    Wang, Shaojun
    Zhao, Yunxin
    Wu, Shuang
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5220 - 5228
  • [43] Using Embedding Models for Lexical Categorization in Morphologically Rich Languages
    Siklosi, Borbala
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 115 - 126
  • [44] Rich Features Embedding for Cross-Modal Retrieval: A Simple Baseline
    Fu, Xin
    Zhao, Yao
    Wei, Yunchao
    Zhao, Yufeng
    Wei, Shikui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (09) : 2354 - 2365
  • [45] Enhancing High-Level Language Concept Comprehension through a Notional Machine Approach of Assembly Language Education
    University of California, Santa Cruz, United States
    ASEE Annu. Conf. Expos. Conf. Proc., 2024,
  • [46] Enhancing Persona Consistency with Large Language Models
    Shi, Haozhe
    Niu, Kun
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024, 2024, : 210 - 215
  • [47] Enhancing Conversational Search with Large Language Models
    Rocchietti, Guido
    Muntean, Cristina Ioana
    Nardini, Franco Maria
    ERCIM NEWS, 2024, (136): : 33 - 34
  • [48] Enhancing Technological Taxonomies by Large Language Models
    Barba, Giuliana
    Lazoi, Mariangela
    Lezzi, Marianna
    HUMAN-CENTRED TECHNOLOGY MANAGEMENT FOR A SUSTAINABLE FUTURE, VOL 2, IAMOT, 2025, : 109 - 117
  • [49] Enhancing Fake News Detection with Large Language Models Through Multi-agent Debates
    Jeptoo, Korir Nancy
    Su, Chengjie
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT II, NLPCC 2024, 2025, 15360 : 474 - 486
  • [50] Enhancing oncology nursing care planning for patients with cancer through Harnessing large language models
    Nashwan, Abdulqadir J.
    Hani, Salam Bani
    ASIA-PACIFIC JOURNAL OF ONCOLOGY NURSING, 2023, 10 (09)