An Eclectic Approach for Enhancing Language Models Through Rich Embedding Features

被引:0
|
作者
Aldana-Bobadilla, Edwin [1 ,2 ]
Sosa-Sosa, Victor Jesus [2 ]
Molina-Villegas, Alejandro [1 ,3 ]
Gazca-Hernandez, Karina [2 ]
Olivas, Jose Angel [4 ]
机构
[1] CONAHCYT, Mexico City 03940, Mexico
[2] Cinvestav, Unidad Tamaulipas, Ciudad Victoria 87130, Tamaulipas, Mexico
[3] Ctr Invest Ciencias Invest Geoespacial, Mexico City 14240, Mexico
[4] Univ Castilla La Mancha, Grp SMILe, Ciudad Real 13071, Spain
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Task analysis; Semantics; Transformers; Neurons; Linguistics; Self-organizing feature maps; Text analysis; Self-organizing map; word embeddings; feature extraction; natural language processing;
D O I
10.1109/ACCESS.2024.3422971
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text processing is a fundamental aspect of Natural Language Processing (NLP) and is crucial for various applications in fields such as artificial intelligence, data science, and information retrieval. It plays a core role in language models. Most text-processing approaches focus on describing and synthesizing, to a greater or lesser degree, lexical, syntactic, and semantic properties of text in the form of numerical vectors that induce a metric space, in which, it is possible to find underlying patterns and structures related to the original text. Since each approach has strengths and weaknesses, finding a single approach that perfectly extracts representative text properties for every task and application domain is hard. This paper proposes a novel approach capable of synthesizing information from heterogeneous state-of-the-art text processing approaches into a unified representation. Encouraging results demonstrate that using this representation in popular machine-learning tasks not only leads to superior performance but also offers notable advantages in memory efficiency and preservation of underlying information of the distinct sources involved in such a representation.
引用
收藏
页码:100921 / 100938
页数:18
相关论文
共 50 条
  • [22] Enhancing Relation Extraction Through Augmented Data: Large Language Models Unleashed
    Ali, Manzoor
    Nisar, Muhammad Sohail
    Saleem, Muhammad
    Moussallem, Diego
    Ngomo, Axel-Cyrille Ngonga
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PT II, NLDB 2024, 2024, 14763 : 68 - 78
  • [23] Enhancing radiology report generation through pre-trained language models
    Leonardi, Giorgio
    Portinale, Luigi
    Santomauro, Andrea
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2024,
  • [24] Enhancing Small Language Models for Graph Tasks Through Graph Encoder Integration
    Oh, Dongryul
    Kang, Sujin
    Kim, Heejin
    Oh, Dongsuk
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [25] A Novel Approach to Analyzing Defects: Enhancing Knowledge Graph Embedding Models for Main Electrical Equipment
    Chen, Yanyu
    Huang, Jianye
    Qian, Jian
    Yi, Longqiang
    Li, Jinhu
    Huang, Jiangsheng
    Zhang, Zhihong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 715 - 725
  • [26] Eclectic approach and curriculum guidelines for teaching oral language: "Me first"
    Bueno Gonzalez, Antonio
    PORTA LINGUARUM, 2009, (11) : 209 - 210
  • [27] Enhancing Neural Decoding with Large Language Models: A GPT-Based Approach
    Lee, Dong Hyeok
    Chung, Chun Kee
    2024 12TH INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE, BCI 2024, 2024,
  • [28] The Eclectic Approach in English Language Teaching Applications: A Qualitative Synthesis of the Literature
    Valledor, Adrian
    Olmedo, Alvaro
    Hellin, Carlos J. J.
    Tayebi, Abdelhamid
    Oton-Tortosa, Salvador
    Gomez, Josefa
    SUSTAINABILITY, 2023, 15 (15)
  • [29] Enhancing Data Retrieval with Custom Embedding Models and ChatGPT
    Toftdahl, Jens Christian
    Shahin, Kamrul Islam
    Hylle, Thomas
    2024 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2024, : 455 - 461
  • [30] Enhancing Video Summarization via Vision-Language Embedding
    Plummer, Bryan A.
    Brown, Matthew
    Lazebnik, Svetlana
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1052 - 1060