Automated Generation of Chinese Text-Image Summaries Using Deep Learning Techniques

被引:0
|
作者
Xu, Meiling [1 ,2 ]
Abd Rahman, Hayati [1 ]
Li, Feng [1 ,2 ]
机构
[1] Univ Teknol MARA, Coll Comp Informat & Math, Shah Alam 40450, Malaysia
[2] Hebei Finance Univ, Coll Comp & Informat Engn, Baoding 071051, Peoples R China
关键词
Chinese text-image summaries; automated summary generation; deep learning; MaliGAN; cross-modal similarity retrieval; adaptive fusion strategy;
D O I
10.18280/ts.400644
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the era of the internet, an abundance of Chinese text-image content is continuously produced, necessitating effective automated technologies for processing and summarizing these materials. Automated generation of Chinese text-image summaries facilitates rapid comprehension of key information, thereby enhancing the efficiency of information consumption. Due to the unique characteristics of the Chinese language, traditional automatic summarization techniques are inadequately transferable, prompting the development of text-image summary generation technologies tailored to Chinese features. Research indicates that while existing natural language processing and deep learning techniques have made strides in text summarization, deficiencies remain in the deep semantic mining and integration of text-image content. This study primarily focuses on two aspects: Firstly, a generative approach based on an enhanced MaliGAN model, employing deep learning models to improve text generation quality. Secondly, a retrieval-based approach, utilizing cross-modal similarity retrieval to extract text information most relevant to the image content, guiding summary generation. Additionally, this study innovatively proposes a model architecture comprising segmentation, cross-modal retrieval, and adaptive fusion strategy modules, significantly augmenting the accuracy and reliability of text-image summary generation.
引用
收藏
页码:2835 / 2843
页数:9
相关论文
共 50 条
  • [41] A Survey on Image Semantic Segmentation Using Deep Learning Techniques
    Cheng, Jieren
    Li, Hua
    Li, Dengbo
    Hua, Shuai
    Sheng, Victor S.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1941 - 1957
  • [42] Safe overtaking using image processing and deep learning techniques
    Perepu, Satheesh K.
    Kumar, Prasanna P.
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING (ICOCO), 2021, : 55 - 60
  • [43] Study of CT image texture using deep learning techniques
    Dutta, Sandeep
    Fan, Jiahua
    Chevalier, David
    MEDICAL IMAGING 2018: IMAGE PERCEPTION, OBSERVER PERFORMANCE, AND TECHNOLOGY ASSESSMENT, 2018, 10577
  • [44] Different Techniques of Image SR Using Deep Learning: A Review
    Sharma, Ajay
    Shrivastava, Bhavana P.
    IEEE SENSORS JOURNAL, 2023, 23 (03) : 1724 - 1733
  • [45] Thermal Face Image Classification using Deep Learning Techniques
    Chatterjee, Prosenjit
    Zaman, A. N. K.
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1208 - 1213
  • [46] Deep learning in automated text classification: a case study using toxicological abstracts
    Varghese A.
    Agyeman-Badu G.
    Cawley M.
    Environment Systems and Decisions, 2020, 40 (4) : 465 - 479
  • [47] Automated Detection of Cystitis in Ultrasound Images Using Deep Learning Techniques
    Sankari, V. M. Raja
    Raykar, Dattanand Arun
    Snekhalatha, U.
    Karthik, Varshini
    Shetty, Veerendra
    IEEE ACCESS, 2023, 11 : 104179 - 104190
  • [48] Automated Molecular Subtyping of Breast Carcinoma Using Deep Learning Techniques
    Niyas, S.
    Bygari, Ramya
    Naik, Rachita
    Viswanath, Bhavishya
    Ugwekar, Dhananjay
    Mathew, Tojo
    Kavya, J.
    Kini, Jyoti R.
    Rajan, Jeny
    IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2023, 11 : 161 - 169
  • [50] Automated Generation of Lung Cytological Images from Image Findings Using Text-to-Image Technology
    Teramoto, Atsushi
    Kiriyama, Yuka
    Michiba, Ayano
    Yazawa, Natsuki
    Tsukamoto, Tetsuya
    Imaizumi, Kazuyoshi
    Fujita, Hiroshi
    COMPUTERS, 2024, 13 (11)