A Hybrid Framework for Object Distance Estimation using a Monocular Camera

被引:2
|
作者
Patel, Vaibhav [1 ]
Mehta, Varun [2 ]
Bolic, Miodrag [1 ]
Mantegh, Iraj [2 ]
机构
[1] Univ Ottawa, Sch Elect Engn & Comp Sci SEECS, 800 King Edward, Ottawa, ON, Canada
[2] Natl Res Council Canada, Montreal, PQ, Canada
关键词
Object distance estimation; Monocular camera; Hybrid framework; Object detection;
D O I
10.1109/DASC58513.2023.10311189
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Object distance estimation using the monocular camera is a challenging problem in computer vision with many practical applications. Various algorithms are developed for distance estimation using a monocular camera; some involve traditional techniques, while others are based on Deep Learning (DL). Both methods have limitations, such as requiring camera calibration parameters, limited distance estimation range, or the object of interest should be relatively large to get accurate distance estimation. Due to these drawbacks, such algorithms cannot be easily generalized for many practical applications. In this paper, we propose a hybrid monocular distance estimation framework that consists of You Look Only Once version 7 (YOLOv7) algorithm for visual object detection and linear regression model for distance estimation. For our use case, this framework is trained on our field-captured Unmanned Aerial Vehicle (UAV) dataset to detect and estimate distance of UAVs. The dataset includes videos of UAVs obtained from different Point of View (POV) using a Pan-Tilt-Zoom (PTZ) camera that captures and tracks UAVs in the large field of view. Video frames are synchronized with the distance range data obtained from Radio Detection and Ranging (RADAR) sensor which will act as ground truth for regression model. The regression model is trained on input features such as bounding box coordinates, the average number of red, blue, and yellow pixels within the bounding box, and embedded features of detected objects obtained from YOLOv7 and output were RADAR range measurements. Trained UAV detection network has mAP(0.5) of 0.854, mAP(.5:.95) of 0.595 and distance estimation regressor has Mean Squared Error (MSE) of 0.06375 on independent test set. We validated this framework on our field dataset and demonstrated that our approach could detect and estimate distance efficiently and accurately. This framework can be extended for any real-world monocular distance estimation use case just by retraining the YOLOv7 model for desired object detection class and regression model for object-specific distance estimation.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Monocular Distance Estimation using Pinhole Camera Approximation to Avoid Vehicle Crash and Back-over Accidents
    Megalingam, Rajesh Kannan
    Shriram, Vignesh
    Likhith, Bommu
    Rajesh, Gangireddy
    Ghanta, Sriharsha
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,
  • [42] Fundamental Matrix Based Moving Object Detection Using Monocular Camera
    Choi, Yeongyu
    Park, Ju H.
    Jung, Ho-Youl
    2017 25TH INTERNATIONAL CONFERENCE ON SYSTEMS ENGINEERING (ICSENG), 2017, : 147 - 150
  • [43] Moving Object Detection Using Monocular Moving Camera with Normal Flows
    Yuan, Ding
    Yu, Yalong
    Qiang, Jingjing
    Hung, Chih-Cheng
    Yin, Jihao
    2017 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (RCAR), 2017, : 34 - 39
  • [44] Object Distance Estimation Using a Single Image Taken from a Moving Rolling Shutter Camera
    Kim, Namhoon
    Bae, Junsu
    Kim, Cheolhwan
    Park, Soyeon
    Sohn, Hong-Gyoo
    SENSORS, 2020, 20 (14) : 1 - 17
  • [45] Citrus fruit diameter estimation in the field using monocular camera
    Qu, Hongchun
    Du, Haitong
    Tang, Xiaoming
    Zhai, Shidong
    BIOSYSTEMS ENGINEERING, 2025, 252 : 47 - 60
  • [46] Object Recognition and Distance Extraction System Using Camera
    Yoon, Youngjin
    Han, Dongseok
    3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021), 2021, : 114 - 116
  • [47] Eye Tracking using Monocular Camera for Gaze Estimation Applications
    Yang, Guojun
    Saniie, Jafar
    2016 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY (EIT), 2016, : 292 - 296
  • [48] Indirect Object-to-Robot Pose Estimation from an External Monocular RGB Camera
    Tremblay, Jonathan
    Tyree, Stephen
    Mosier, Terry
    Birchfield, Stan
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 4227 - 4234
  • [49] Cyclist Speed Estimation Using Accelerometer, Gyroscope, and Monocular Camera
    Gunawan, Ignasius Ian Savio
    Gu, Yanlei
    Goncharenko, Igor
    Kamijo, Shunsuke
    2023 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, ICCE, 2023,
  • [50] ROAD REGION ESTIMATION AND OBSTACLES EXTRACTION USING A MONOCULAR CAMERA
    Qian, Shaohua
    Tan, Joo Kooi
    Kim, Hyoungseop
    Ishikawa, Seiji
    Morie, Takashi
    Shinomiya, Takashi
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2013, 9 (09): : 3561 - 3572