Dense Image Captioning Based on Precise Feature Extraction

被引：4

作者：

Zhang, Zhiqiang ^{[1
]}

Zhang, Yunye ^{[1
]}

Shi, Yan ^{[1
]}

Yu, Wenxin ^{[1
]}

Nie, Li ^{[1
]}

He, Gang ^{[2
]}

Fan, Yibo ^{[3
]}

Yang, Zhuo ^{[4
]}

机构：

[1] Southwest Univ Sci & Technol, Mianyang, Sichuan, Peoples R China

[2] Xidian Univ, Xian, Peoples R China

[3] Fudan Univ, State Key Lab ASIC & Syst, Shanghai, Peoples R China

[4] Guangdong Univ Technol, Guangzhou, Peoples R China

来源：

NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V | 2019年 / 1143卷

基金：

中国国家自然科学基金;

关键词：

Dense captioning; Computer vision; Feature extraction; Location and description; Deep learning;

D O I：

10.1007/978-3-030-36802-9_10

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image captioning is a challenging problem in computer vision, which has numerous practical applications. Recently, the method of dense image captioning has emerged, which realizes the full understanding of the image by localizing and describing multiple salient regions covering the image. Despite there are state-of-the-art approaches encouraging progress, the ability to position and to describe the target area correspondingly is not enough as we expect. To alleviate this challenge, a precise feature extraction method (PFE) is proposed in this paper to further enhance the effect of dense image captioning. Our model is evaluated on the Visual Genome dataset. It demonstrated that our method is better than other state-of-the-art methods.

引用

页码：83 / 90

页数：8

共 50 条

[11] Incorporating retrieval-based method for feature enhanced image captioning
Shanshan Zhao
Lixiang Li
Haipeng Peng
Applied Intelligence, 2023, 53 : 9731 - 9743
[12] Auxiliary feature extractor and dual attention-based image captioning
Zhao, Qian
Wu, Guichang
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3615 - 3626
[13] Automatic Defect Description of Railway Track Line Image Based on Dense Captioning
Wei, Dehua
Wei, Xiukun
Jia, Limin
SENSORS, 2022, 22 (17)
[14] Improving Image Captioning with Feature Filtering and Injection
Guo, Menghao
Chen, Qiaohong
Fang, Xian
Bao, Jia
Xiang, Shenxiang
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT II, 2023, 14255 : 373 - 384
[15] Leveraging auxiliary image descriptions for dense video captioning
Boran, Emre
Erdem, Aykut
Ikizler-Cinbis, Nazli
Erdem, Erkut
Madhyastha, Pranava
Specia, Lucia
PATTERN RECOGNITION LETTERS, 2021, 146 : 70 - 76
[16] A Dual-Feature-Based Adaptive Shared Transformer Network for Image Captioning
Shi, Yinbin
Xia, Ji
Zhou, MengChu
Cao, Zhengcai
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 13
[17] Image captioning based on global-local feature and adaptive-attention
Zhao X.-H.
Yin L.-F.
Zhao C.-L.
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (01): : 126 - 134
[18] Thangka Image Captioning Based on Semantic Concept Prompt and Multimodal Feature Optimization
Hu, Wenjin
Qiao, Lang
Kang, Wendong
Shi, Xinyue
JOURNAL OF IMAGING, 2023, 9 (08)
[19] Denoising-Based Multiscale Feature Fusion for Remote Sensing Image Captioning
Huang, Wei
Wang, Qi
Li, Xuelong
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (03) : 436 - 440
[20] Distinctive-Attribute Extraction for Image Captioning
Kim, Boeun
Lee, Young Han
Jung, Hyedong
Cho, Choongsang
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 133 - 144

← 1 2 3 4 5 →