You Only Look Once: Unified, Real-Time Object Detection

被引:19113
|
作者
Redmon, Joseph [1 ]
Divvala, Santosh [1 ,2 ]
Girshick, Ross [3 ]
Farhadi, Ali [1 ,2 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Allen Inst AI, Seattle, WA USA
[3] Facebook Res, Menlo Pk, CA USA
关键词
D O I
10.1109/CVPR.2016.91
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present YOLO, a new approach to object detection. Prior work on object detection repurposes classifiers to perform detection. Instead, we frame object detection as a regression problem to spatially separated bounding boxes and associated class probabilities. A single neural network predicts bounding boxes and class probabilities directly from full images in one evaluation. Since the whole detection pipeline is a single network, it can be optimized end-to-end directly on detection performance. Our unified architecture is extremely fast. Our base YOLO model processes images in real-time at 45 frames per second. A smaller version of the network, Fast YOLO, processes an astounding 155 frames per second while still achieving double the mAP of other real-time detectors. Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background. Finally, YOLO learns very general representations of objects. It outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.
引用
收藏
页码:779 / 788
页数:10
相关论文
共 50 条
  • [1] Transformers only look once with nonlinear combination for real-time object detection
    Xia, Ruiyang
    Li, Guoquan
    Huang, Zhengwen
    Pang, Yu
    Qi, Man
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (15): : 12571 - 12585
  • [2] Transformers only look once with nonlinear combination for real-time object detection
    Ruiyang Xia
    Guoquan Li
    Zhengwen Huang
    Yu Pang
    Man Qi
    [J]. Neural Computing and Applications, 2022, 34 : 12571 - 12585
  • [3] Polyp Recognition and Localization with You-Only-Look-Once (YOLO) Real-Time Object Detection System
    Li, Weiquan James
    Ang, Tiing Leong
    Chong, Dewei
    Chia, Tiongsun
    Fock, Kwong Ming
    [J]. DIGESTION, 2021, 102 (01) : 101 - 101
  • [4] YOLOH: You Only Look One Hourglass for Real-Time Object Detection
    Wang, Shaobo
    Chen, Renhai
    Wu, Hongyue
    Li, Xiaozhe
    Feng, Zhiyong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2104 - 2115
  • [5] Making You Only Look Once Faster: Toward Real-Time Intelligent Transportation Detection
    Dai, Yuan
    Liu, Weiming
    Xie, Wei
    Liu, Ruikang
    Zheng, Zhongxing
    Long, Kejun
    Wang, Liang
    Mao, Liang
    Qiu, Qisheng
    Ling, Guangzheng
    [J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2023, 15 (03) : 8 - 25
  • [6] Research on the Real-Time Detection of Red Fruit Based on the You Only Look Once Algorithm
    Mei, Song
    Ding, Wenqin
    Wang, Jinpeng
    [J]. PROCESSES, 2024, 12 (01)
  • [7] You Only Look at Once for Real-Time and Generic Multi-Task
    Wang, Jiayuan
    Wu, Q. M. Jonathan
    Zhang, Ning
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (09) : 12625 - 12637
  • [8] IYOLO-NL: An improved you only look once and none left object detector for real-time face mask detection
    Zhou, Yan
    [J]. HELIYON, 2023, 9 (08)
  • [9] Jensen-Shannon Divergence You Only Look Once: A Real-Time Robotic Grasp Detection Network
    Han, Tianjiao
    Yu, Dan
    [J]. ADVANCED INTELLIGENT SYSTEMS, 2024, 6 (05)
  • [10] Real-time detection of concrete cracks via enhanced You Only Look Once Network: Algorithm and software
    Fu, Ronghua
    Zhang, Yufeng
    Zhu, Kai
    Strauss, Alfred
    Cao, Maosen
    [J]. ADVANCES IN ENGINEERING SOFTWARE, 2024, 195