Learning graph-based representations for scene flow estimation

被引:0
|
作者
Mingliang Zhai
Hao Gao
Ye Liu
Jianhui Nie
Kang Ni
机构
[1] Nanjing University of Posts and Telecommunications,School of Automation
[2] Nanjing University of Posts and Telecommunications,School of Computer Science
来源
关键词
Deep learning; Scene flow estimation; Graph convolutional networks; 3D point cloud; Scene understanding;
D O I
暂无
中图分类号
学科分类号
摘要
Scene flow estimation is a fundamental task of autonomous driving. Compared with optical flow, scene flow can provide sufficient 3D motion information of the dynamic scene. With the increasing popularity of 3D LiDAR sensors and deep learning technology, 3D LiDAR-based scene flow estimation methods have achieved outstanding results on public benchmarks. Current methods usually adopt Multiple Layer Perceptron (MLP) or traditional convolution-like operation for feature extraction. However, the characteristics of point clouds are not exploited adequately in these methods, and thus some key semantic and geometric structures are not well captured. To address this issue, we propose to introduce graph convolution to exploit the structural features adaptively. In particular, multiple graph-based feature generators and a graph-based flow refinement module are deployed to encode geometric relations among points. Furthermore, residual connections are used in the graph-based feature generator to enhance feature representation and deep supervision of the graph-based network. In addition, to focus on short-term dependencies, we introduce a single gate-based recurrent unit to refine scene flow predictions iteratively. The proposed network is trained on the FlyingThings3D dataset and evaluated on the FlyingThings3D, KITTI, and Argoverse datasets. Comprehensive experiments show that all proposed components contribute to the performance of scene flow estimation, and our method can achieve potential performance compared to the recent approaches.
引用
收藏
页码:7317 / 7334
页数:17
相关论文
共 50 条
  • [1] Learning graph-based representations for scene flow estimation
    Zhai, Mingliang
    Gao, Hao
    Liu, Ye
    Nie, Jianhui
    Ni, Kang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 7317 - 7334
  • [2] Video scene detection using graph-based representations
    Sakarya, Ufuk
    Telatar, Ziya
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2010, 25 (10) : 774 - 783
  • [3] Similarity learning for graph-based image representations
    de Mauro, C
    Diligenti, M
    Gori, M
    Maggini, M
    PATTERN RECOGNITION LETTERS, 2003, 24 (08) : 1115 - 1122
  • [4] Learning Fair Representations for Recommendation: A Graph-based Perspective
    Wu, Le
    Chen, Lei
    Shao, Pengyang
    Hong, Richang
    Wang, Xiting
    Wang, Meng
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 2198 - 2208
  • [5] Learning Graph-based Disentangled Representations for Next POI Recommendation
    Wang, Zhaobo
    Zhu, Yanmin
    Liu, Haobing
    Wang, Chunyang
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1154 - 1163
  • [6] Learning Tactile Models for Factor Graph-based Estimation
    Sodhi, Paloma
    Kaess, Michael
    Mukadam, Mustafa
    Anderson, Stuart
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13686 - 13692
  • [7] Graph-based representations of point clouds
    Natali, Mattia
    Biasotti, Silvia
    Patane, Giuseppe
    Falcidieno, Bianca
    GRAPHICAL MODELS, 2011, 73 : 151 - 164
  • [8] A Graph-Based Approach for Video Scene Detection
    Sakarya, Ufuk
    Telatar, Zjya
    2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 34 - +
  • [9] Complexities of Graph-Based Representations for Elementary Functions
    Nagayama, Shinobu
    Sasao, Tsutomu
    IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (01) : 106 - 119
  • [10] Graph-based Relational Learning
    NEC Laboratories Europe GmbH, Germany
    不详
    不详
    不详
    NEC Tech. J., 1 (101-105):