Automated Generation of Chinese Text-Image Summaries Using Deep Learning Techniques

被引:0
|
作者
Xu, Meiling [1 ,2 ]
Abd Rahman, Hayati [1 ]
Li, Feng [1 ,2 ]
机构
[1] Univ Teknol MARA, Coll Comp Informat & Math, Shah Alam 40450, Malaysia
[2] Hebei Finance Univ, Coll Comp & Informat Engn, Baoding 071051, Peoples R China
关键词
Chinese text-image summaries; automated summary generation; deep learning; MaliGAN; cross-modal similarity retrieval; adaptive fusion strategy;
D O I
10.18280/ts.400644
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the era of the internet, an abundance of Chinese text-image content is continuously produced, necessitating effective automated technologies for processing and summarizing these materials. Automated generation of Chinese text-image summaries facilitates rapid comprehension of key information, thereby enhancing the efficiency of information consumption. Due to the unique characteristics of the Chinese language, traditional automatic summarization techniques are inadequately transferable, prompting the development of text-image summary generation technologies tailored to Chinese features. Research indicates that while existing natural language processing and deep learning techniques have made strides in text summarization, deficiencies remain in the deep semantic mining and integration of text-image content. This study primarily focuses on two aspects: Firstly, a generative approach based on an enhanced MaliGAN model, employing deep learning models to improve text generation quality. Secondly, a retrieval-based approach, utilizing cross-modal similarity retrieval to extract text information most relevant to the image content, guiding summary generation. Additionally, this study innovatively proposes a model architecture comprising segmentation, cross-modal retrieval, and adaptive fusion strategy modules, significantly augmenting the accuracy and reliability of text-image summary generation.
引用
收藏
页码:2835 / 2843
页数:9
相关论文
共 50 条
  • [31] Automated ontology generation from a plain text using statistical and NLP techniques
    Kumar, Naresh
    Kumar, Minakshi
    Singh, Manjeet
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2016, 7 (01) : 282 - 293
  • [32] A Deep Learning Approach for Text Generation
    Elmogy, Ahmed
    Mahmoud, Belal
    Saleh, Mohamed
    29TH INTERNATIONAL CONFERENCE ON COMPUTER THEORY AND APPLICATIONS (ICCTA 2019), 2019, : 102 - 106
  • [33] Adversarial Image Generation using Evolution and Deep Learning
    Soderlund, Jacob
    Blair, Alan
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 1384 - 1391
  • [34] Automatic image caption generation using deep learning
    Verma, Akash
    Yadav, Arun Kumar
    Kumar, Mohit
    Yadav, Divakar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (2) : 5309 - 5325
  • [35] Automatic image caption generation using deep learning
    Akash Verma
    Arun Kumar Yadav
    Mohit Kumar
    Divakar Yadav
    Multimedia Tools and Applications, 2024, 83 : 5309 - 5325
  • [36] Image Caption Generation using Deep Learning Technique
    Amritkar, Chetan
    Jabade, Vaishali
    2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [37] Synthetic Face Image Generation Using Deep Learning
    Sireesha, C.
    Venunath, P. Sai
    Surya, N. Sri
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND COMMUNICATION SYSTEMS, ICACECS 2021, 2022, : 231 - 240
  • [38] Chinese Text Detection Using Deep Learning Model and Synthetic Data
    Gao, Wei-wei
    Zhang, Jun
    Chen, Peng
    Wang, Bing
    Xia, Yi
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT I, 2018, 10954 : 503 - 512
  • [39] Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings
    Carvalho, Micael
    Cadene, Remi
    Picard, David
    Soulier, Laure
    Thome, Nicolas
    Cord, Matthieu
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 35 - 44
  • [40] A Novel System for Image Text Recognition and Classification using Deep Learning
    Manzoor, Syed Ishfaq
    Singla, Jimmy
    2021 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES (ICCS 2021), 2021, : 61 - 64