Multi-view PointNet for 3D Scene Understanding

Maximilian Jaritz; Jiayuan Gu; Hao Su

Communication Dans Un Congrès Année : 2019

Multi-view PointNet for 3D Scene Understanding

(1, 2) , (3) , (3)

1
2
3

Maximilian Jaritz

Fonction : Auteur
PersonId : 1021301

Robotics & Intelligent Transportation Systems

Valeo Driving Assistance Domain

Jiayuan Gu

Fonction : Auteur

Department of Computer Science and Engineering [Univ California San Diego]

Hao Su

Fonction : Auteur

Department of Computer Science and Engineering [Univ California San Diego]

Résumé

Fusion of 2D images and 3D point clouds is important because information from dense images can enhance sparse point clouds. However, fusion is challenging because 2D and 3D data live in different spaces. In this work, we propose MVPNet (Multi-View PointNet), where we aggregate 2D multi-view image features into 3D point clouds, and then use a point based network to fuse the features in 3D canonical space to predict 3D semantic labels. To this end, we introduce view selection along with a 2D-3D feature aggregation module. Extensive experiments show the benefit of leveraging features from dense images and reveal superior robustness to varying point cloud density compared to 3D-only methods. On the ScanNetV2 benchmark, our MVPNet significantly outperforms prior point cloud based approaches on the task of 3D Semantic Segmentation. It is much faster to train than the large networks of the sparse voxel approach. We provide solid ablation studies to ease the future design of 2D-3D fusion methods and their extension to other tasks, as we showcase for 3D instance segmentation.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Maximilian Jaritz : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02387461

Soumis le : vendredi 29 novembre 2019-17:55:39

Dernière modification le : jeudi 11 janvier 2024-11:24:04

Dates et versions

hal-02387461 , version 1 (29-11-2019)

Identifiants

HAL Id : hal-02387461 , version 1
ARXIV : 1909.13603

Citer

Maximilian Jaritz, Jiayuan Gu, Hao Su. Multi-view PointNet for 3D Scene Understanding. Proceedings of the IEEE International Conference on Computer Vision Workshops, Oct 2019, Seoul, South Korea. ⟨hal-02387461⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INRIA2

145 Consultations

0 Téléchargements

Multi-view PointNet for 3D Scene Understanding

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager