Automatic image annotation method based on a convolutional neural network with threshold optimization

被引:14
|
作者
Cao, Jianfang [1 ,2 ]
Zhao, Aidi [1 ]
Zhang, Zibang [1 ]
机构
[1] Taiyuan Univ Sci & Technol, Sch Comp Sci & Technol, Taiyuan, Peoples R China
[2] Xinzhou Teachers Univ, Dept Comp Sci & Technol, Xinzhou, Peoples R China
来源
PLOS ONE | 2020年 / 15卷 / 09期
关键词
OBJECT RECOGNITION; CLASSIFICATION; MODEL; CNN;
D O I
10.1371/journal.pone.0238956
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this study, a convolutional neural network with threshold optimization (CNN-THOP) is proposed to solve the issue of overlabeling or downlabeling arising during the multilabel image annotation process in the use of a ranking function for label annotation along with prediction probability. This model fuses the threshold optimization algorithm to the CNN structure. First, an optimal model trained by the CNN is used to predict the test set images, and batch normalization (BN) is added to the CNN structure to effectively accelerate the convergence speed and obtain a group of prediction probabilities. Second, threshold optimization is performed on the obtained prediction probability to derive an optimal threshold for each class of labels to form a group of optimal thresholds. When the prediction probability for this class of labels is greater than or equal to the corresponding optimal threshold, this class of labels is used as the annotation result for the image. During the annotation process, the multilabel annotation for the image to be annotated is realized by loading the optimal model and the optimal threshold. Verification experiments are performed on the MIML, COREL5K, and MSRC datasets. Compared with the MBRM, the CNN-THOP increases the average precision on MIML, COREL5K, and MSRC by 27%, 28% and 33%, respectively. Compared with the E2E-DCNN, the CNN-THOP increases the average recall rate by 3% on both COREL5K and MSRC. The most precise annotation effect for CNN-THOP is observed on the MIML dataset, with a complete matching degree reaching 64.8%.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Large scale automatic image annotation based on convolutional neural network
    Wang, Ronggui
    Xie, Yunfei
    Yang, Juan
    Xue, Lixia
    Hu, Min
    Zhang, Qingyang
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 49 : 213 - 224
  • [2] Deep Convolutional Neural Network with KNN Regression for Automatic Image Annotation
    Bensaci, Ramla
    Khaldi, Belal
    Aiadi, Oussama
    Benchabana, Ayoub
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [3] Image Annotation Based on Convolutional Neural Network and Topic Model
    Zhang Lei
    Cai Ming
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (20)
  • [4] Rock Image Automatic Annotation based on Deep Neural Network
    Zhao, Huitao
    Zhao, Xiaohui
    Liu, Wenxiang
    Wei, Junjie
    Cheng, Guojian
    Qiang, Xinjian
    [J]. PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 1824 - 1828
  • [5] Improving loss function for deep convolutional neural network applied in automatic image annotation
    Salar, Ali
    Ahmadi, Ali
    [J]. VISUAL COMPUTER, 2024, 40 (03): : 1617 - 1629
  • [6] Improving loss function for deep convolutional neural network applied in automatic image annotation
    Ali Salar
    Ali Ahmadi
    [J]. The Visual Computer, 2024, 40 : 1617 - 1629
  • [7] Progressive medical image annotation with convolutional neural network-based interactive segmentation method
    Bai, Yunkun
    Sun, Guangmin
    Li, Yu
    Le Shen
    Li Zhang
    [J]. MEDICAL IMAGING 2021: IMAGE PROCESSING, 2021, 11596
  • [8] Fully automatic image colorization based on Convolutional Neural Network
    Varga, Domonkos
    Sziranyi, Tamas
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3691 - 3696
  • [9] An Improved Automatic Image Annotation Approach Using Convolutional Neural Network-Slantlet Transform
    Adnan, Myasar Mundher
    Rahim, Mohd Shafry Mohd
    Khan, Amjad Rehman
    Saba, Tanzila
    Fati, Suliman Mohamed
    Bahaj, Saeed Ali
    [J]. IEEE ACCESS, 2022, 10 : 7520 - 7532
  • [10] A method of image classification based on convolutional neural network
    Dong, Zhe
    Jiang, Mingyang
    Pei, Zhili
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2018, 124 : 47 - 48