Robust 3D Object Detection from LiDAR Point Cloud Data with Spatial Information Aggregation

Nerea Aranjuelo Ansa; Guus Engels; Luis Unzueta Irurtia; Ignacio Arganda-Carreras; Marcos Nieto Doncel; Oihana Otaegui Madurga

Ayuda

Robust 3D Object Detection from LiDAR Point Cloud Data with Spatial Information Aggregation

Nerea Aranjuelo ^[1] ^[3] ; Guus Engels ^[3] ; Luis Unzueta ^[3] ; Ignacio Arganda-Carreras ^[1] ^[4] ^[2] ; Marcos Nieto ^[3] ; Oihana Otaegui ^[3]
1. [1] Universidad del País Vasco/Euskal Herriko Unibertsitatea
  
  Universidad del País Vasco/Euskal Herriko Unibertsitatea
  
  Leioa, España
2. [2] Donostia International Physics Center
  
  Donostia International Physics Center
  
  San Sebastián, España
3. [3] Vicomtech, Basque Research and Technology Alliance (BRTA, San Sebastian)
4. [4] Ikerbasque, Basque Foundation for Science (Bilbao)
Mostrar afiliaciones +
Localización: 15th International Conference on Soft Computing Models in Industrial and Environmental Applications (SOCO 2020): Burgos, Spain ; September 2020 / coord. por Álvaro Herrero Cosío, Carlos Cambra Baseca, Daniel Urda Muñoz, Javier Sedano Franco, Héctor Quintián Pardo, Emilio Santiago Corchado Rodríguez, 2021, ISBN 978-3-030-57802-2, págs. 813-823
Idioma: inglés
Texto completo no disponible (Saber más ...)
Resumen
- Current 3D object detectors from Bird’s Eye View (BEV) LiDAR point cloud data rely on Convolutional Neural Networks (CNNs), which have originally been designed for camera images. Therefore, they look for the same target features, regardless of the position of the objects with respect to the sensor. Discarding this spatial information makes 3D object detection unreliable and not robust, because objects in LiDAR point clouds contain distance dependent features. The position of a group of points can be decisive to know if they represent an object or not. To solve this, we propose a network extension called FeatExt operation that enables the model to be aware of both the target objects features and their spatial location. FeatExt operation expands a group of feature maps extracted from a BEV representation to include the distance to a specific position of interest in the scene, in this case the distance with respect to the LiDAR. When adding the proposed operation to a baseline network in an intermediate fusion fashion, it shows up to an 8.9 average precision boost in the KITTI BEV benchmark. Our proposal can be easily added to improve existing object detection networks.