TR3D: TOWARDS REAL-TIME INDOOR 3D OBJECT DETECTION

被引:7
|
作者
Rukhovich, Danila [1 ]
Vorontsova, Anna [1 ]
Konushin, Anton [1 ]
机构
[1] Samsung Res, Beijing, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年
关键词
3D object detection; indoor scene understanding; point clouds;
D O I
10.1109/ICIP49359.2023.10222644
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, sparse 3D convolutions have changed 3D object detection. Performing on par with the voting-based approaches, 3D CNNs are memory-efficient and scale to large scenes better. However, there is still room for improvement. With a conscious, practice-oriented approach to problem-solving, we analyze the performance of such methods and localize the weaknesses. Applying modifications that resolve the found issues one by one, we end up with TR3D: a fast fully-convolutional 3D object detection model trained end-to-end, that achieves state-of-the-art results on the standard benchmarks, ScanNet v2, SUN RGB-D, and S3DIS. Moreover, to take advantage of both point cloud and RGB inputs, we introduce an early fusion of 2D and 3D features. We employ our fusion module to make conventional 3D object detection methods multimodal and demonstrate an impressive boost in performance. Our model with early feature fusion, which we refer to as TR3D+FF, outperforms existing 3D object detection approaches on the SUN RGB-D dataset. Overall, besides being accurate, both TR3D and TR3D+FF models are lightweight, memory-efficient, and fast, thereby marking another milestone on the way toward real-time 3D object detection. Code is available at https://github.com/SamsungLabs/tr3d.
引用
收藏
页码:281 / 285
页数:5
相关论文
共 50 条
  • [41] Real-Time Plane Segmentation and Obstacle Detection of 3D Point Clouds for Indoor Scenes
    Wang, Zhe
    Liu, Hong
    Qian, Yueliang
    Xu, Tao
    COMPUTER VISION - ECCV 2012, PT II, 2012, 7584 : 22 - 31
  • [42] Real-time 3D Object Detection from Point Clouds using an RGB-D Camera
    Wang, Ya
    Xu, Shu
    Zell, Andreas
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 407 - 414
  • [43] A survey on 3D object detection in real time for autonomous driving
    Contreras, Marcelo
    Jain, Aayush
    Bhatt, Neel P.
    Banerjee, Arunava
    Hashemi, Ehsan
    FRONTIERS IN ROBOTICS AND AI, 2024, 11
  • [44] Real-time 3D object recognition for automatic tracker initialization
    Blaskó, G
    Fua, P
    IEEE AND ACM INTERNATIONAL SYMPOSIUM ON AUGMENTED REALITY, PROCEEDINGS, 2001, : 175 - 176
  • [45] 3D gesture based real-time object selection and recognition
    Raheja, Jagdish Lal
    Chandra, Mona
    Chaudhary, Ankit
    PATTERN RECOGNITION LETTERS, 2018, 115 : 14 - 19
  • [46] Real-Time 3D Visual Sensor for Robust Object Recognition
    Attamimi, Muhammad
    Mizutani, Akira
    Nakamura, Tomoaki
    Nagai, Takayuki
    Funakoshi, Kotaro
    Nakano, Mikio
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 4560 - 4565
  • [47] Towards Real-Time 3D Editable Model Generation for Existing Indoor Building Environments on a Tablet
    Arnaud, Adrien
    Gouiffes, Michele
    Ammi, Mehdi
    FRONTIERS IN VIRTUAL REALITY, 2022, 3
  • [48] Real-Time 3D Object Detection From Point Cloud Through Foreground Segmentation
    Wang, Bo
    Zhu, Ming
    Lu, Ying
    Wang, Jiarong
    Gao, Wen
    Wei, Hua
    IEEE ACCESS, 2021, 9 : 84886 - 84898
  • [49] 3D Attention Based YOLO-SWINF for Real-Time Video Object Detection
    Moturi, Pradeep
    Khanna, Mukund
    Singh, Kunal
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2023, PT I, 2023, 675 : 491 - 502
  • [50] A real-time 3D multi-view rendering from a real-time 3D capture
    1600, Blackwell Publishing Ltd (44):