Efficient and Scalable Object Localization in 3D on Mobile Device

被引:4
|
作者
Gupta, Neetika [1 ]
Khan, Naimul Mefraz [1 ]
机构
[1] Toronto Metropolitan Univ, Dept Elect Comp & Biomed Engn, Toronto, ON M5B 2K3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
object localization; object detection; ARCore;
D O I
10.3390/jimaging8070188
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Two-Dimensional (2D) object detection has been an intensely discussed and researched field of computer vision. With numerous advancements made in the field over the years, we still need to identify a robust approach to efficiently conduct classification and localization of objects in our environment by just using our mobile devices. Moreover, 2D object detection limits the overall understanding of the detected object and does not provide any additional information in terms of its size and position in the real world. This work proposes an object localization solution in Three-Dimension (3D) for mobile devices using a novel approach. The proposed method works by combining a 2D object detection Convolutional Neural Network (CNN) model with Augmented Reality (AR) technologies to recognize objects in the environment and determine their real-world coordinates. We leverage the in-built Simultaneous Localization and Mapping (SLAM) capability of Google's ARCore to detect planes and know the camera information for generating cuboid proposals from an object's 2D bounding box. The proposed method is fast and efficient for identifying everyday objects in real-world space and, unlike mobile offloading techniques, the method is well designed to work with limited resources of a mobile device.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Rapid and scalable 3D object recognition using LIDAR data
    Matei, Bogdan C.
    Tan, Yi
    Sawhney, Harpreet S.
    Kumar, Rakesh
    AUTOMATIC TARGET RECOGNITION XVI, 2006, 6234
  • [22] A Scalable Modular Architecture of 3D Object Acquisition for Manufacturing Automation
    Luo, Ren C.
    Kuo, Chia-Wen
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2015, : 269 - 274
  • [23] SliceNets - A Scalable Approach for Object Detection in 3D CT Scans
    Yang, Anqi
    Pan, Feng
    Saragadam, Vishwanath
    Duy Dao
    Hui, Zhuo
    Chang, Jen-Hao Rick
    Sankaranarayanan, Aswin C.
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 335 - 344
  • [24] Efficient object detection by prediction in 3D space
    Pang, Yanwei
    Jiang, Xiaoheng
    Li, Xuelong
    Pan, Jing
    SIGNAL PROCESSING, 2015, 112 : 64 - 73
  • [25] 3D object watermarking by a 3D hidden object
    Kishk, S
    Javidi, B
    OPTICS EXPRESS, 2003, 11 (08): : 874 - 888
  • [26] MeshReduce: Scalable and Bandwidth Efficient 3D Scene Capture
    Jin, Tao
    Dasari, Mallesham
    Smith, Connor
    Apicharttrisorn, Kittipat
    Seshan, Srinivasan
    Rowe, Anthony
    2024 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES, VR 2024, 2024, : 20 - 30
  • [27] Automatic 3D Object Recognition and Localization for Robotic Grasping
    Santo, Bruno
    Antao, Liliana
    Goncalves, Gil
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS (ICINCO), 2021, : 416 - 425
  • [28] Delving into Localization Errors for Monocular 3D Object Detection
    Ma, Xinzhu
    Zhang, Yinmin
    Xu, Dan
    Zhou, Dongzhan
    Yi, Shuai
    Li, Haojie
    Ouyang, Wanli
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4719 - 4728
  • [29] Monocular Visual Object 3D Localization in Road Scenes
    Wang, Yizhou
    Huang, Yen-Ting
    Hwang, Jenq-Neng
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 917 - 925
  • [30] Automatic localization of curvilinear object in 3D ultrasound images
    Barva, M
    Kybic, J
    Mari, JB
    Cachard, CB
    Hlavác, V
    MEDICAL IMAGING 2005: ULTRASONIC IMAGING AND SIGNAL PROCESSING, 2005, 5750 : 455 - 462