Features to Text: A Comprehensive Survey of Deep Learning on Semantic Segmentation and Image Captioning

被引:10
|
作者
Oluwasammi, Ariyo [1 ]
Aftab, Muhammad Umar [2 ]
Qin, Zhiguang [1 ]
Son Tung Ngo [3 ]
Thang Van Doan [3 ]
Son Ba Nguyen [3 ]
Son Hoang Nguyen [3 ]
Giang Hoang Nguyen [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 610054, Peoples R China
[2] Natl Univ Comp & Emerging Sci, Dept Comp Sci, Chiniot Faisalabad Campus, Islamabad 35400, Chiniot, Pakistan
[3] FPT Univ, ICT Dept, Hanoi 10000, Vietnam
基金
中国国家自然科学基金;
关键词
RANDOM-FIELDS; CLASSIFICATION; CONNECTIONS; ATTENTION; NETWORKS; LANGUAGE; VISION; FUSION; MODELS;
D O I
10.1155/2021/5538927
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
With the emergence of deep learning, computer vision has witnessed extensive advancement and has seen immense applications in multiple domains. Specifically, image captioning has become an attractive focal direction for most machine learning experts, which includes the prerequisite of object identification, location, and semantic understanding. In this paper, semantic segmentation and image captioning are comprehensively investigated based on traditional and state-of-the-art methodologies. In this survey, we deliberate on the use of deep learning techniques on the segmentation analysis of both 2D and 3D images using a fully convolutional network and other high-level hierarchical feature extraction methods. First, each domain's preliminaries and concept are described, and then semantic segmentation is discussed alongside its relevant features, available datasets, and evaluation criteria. Also, the semantic information capturing of objects and their attributes is presented in relation to their annotation generation. Finally, analysis of the existing methods, their contributions, and relevance are highlighted, informing the importance of these methods and illuminating a possible research continuation for the application of semantic image segmentation and image captioning approaches.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Deep Learning for Military Image Captioning
    Das, Subrata
    Jain, Lalit
    Das, Amp
    [J]. 2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2165 - 2171
  • [32] Image Captioning using Deep Learning
    Jain, Yukti Sanjay
    Dhopeshwar, Tanisha
    Chadha, Supreet Kaur
    Pagire, Vrushali
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021,
  • [33] Image Captioning Using Deep Learning
    Adithya, Paluvayi Veera
    Kalidindi, Mourya Viswanadh
    Swaroop, Nallani Jyothi
    Vishwas, H. N.
    [J]. ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT III, 2024, 2092 : 42 - 58
  • [34] Research of animals image semantic segmentation based on deep learning
    Liu, Shouqiang
    Li, Miao
    Li, Min
    Xu, Qingzhen
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (01):
  • [35] Deep CRF-Graph Learning for Semantic Image Segmentation
    Ding, Fuguang
    Wang, Zhenhua
    Guo, Dongyan
    Chen, Shengyong
    Zhang, Jianhua
    Shao, Zhanpeng
    [J]. PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 360 - 368
  • [36] Semantic Image Segmentation with Deep Learning for Vine Leaf Phenotyping
    Tamvakis, Petros N.
    Kiourt, Chairi
    Solomou, Alexandra D.
    Ioannakis, George
    Tsirliganis, Nestoras C.
    [J]. IFAC PAPERSONLINE, 2022, 55 (32): : 83 - 88
  • [37] Advancements in Deep Learning Architectures for Image Recognition and Semantic Segmentation
    Nimma, Divya
    Uddagiri, Arjun
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 1172 - 1185
  • [38] An Innovative Deep Learning Approach for Image Semantic and Instance Segmentation
    Chen C.
    Gao G.
    Liu L.
    Qiao Y.
    [J]. Journal of Computing and Information Technology, 2023, 31 (03) : 167 - 183
  • [39] Semantic image segmentation algorithm in a deep learning computer network
    Defu He
    Chao Xie
    [J]. Multimedia Systems, 2022, 28 : 2065 - 2077
  • [40] Semantic image segmentation algorithm in a deep learning computer network
    He, Defu
    Xie, Chao
    [J]. MULTIMEDIA SYSTEMS, 2022, 28 (06) : 2065 - 2077