Ayuda
Ir al contenido

Dialnet


Robust 3D Object Detection from LiDAR Point Cloud Data with Spatial Information Aggregation

    1. [1] Universidad del País Vasco/Euskal Herriko Unibertsitatea

      Universidad del País Vasco/Euskal Herriko Unibertsitatea

      Leioa, España

    2. [2] Donostia International Physics Center

      Donostia International Physics Center

      San Sebastián, España

    3. [3] Vicomtech, Basque Research and Technology Alliance (BRTA, San Sebastian)
    4. [4] Ikerbasque, Basque Foundation for Science (Bilbao)
  • Localización: 15th International Conference on Soft Computing Models in Industrial and Environmental Applications (SOCO 2020): Burgos, Spain ; September 2020 / coord. por Álvaro Herrero Cosío, Carlos Cambra Baseca, Daniel Urda Muñoz, Javier Sedano Franco, Héctor Quintián Pardo, Emilio Santiago Corchado Rodríguez, 2021, ISBN 978-3-030-57802-2, págs. 813-823
  • Idioma: inglés
  • Texto completo no disponible (Saber más ...)
  • Resumen
    • Current 3D object detectors from Bird’s Eye View (BEV) LiDAR point cloud data rely on Convolutional Neural Networks (CNNs), which have originally been designed for camera images. Therefore, they look for the same target features, regardless of the position of the objects with respect to the sensor. Discarding this spatial information makes 3D object detection unreliable and not robust, because objects in LiDAR point clouds contain distance dependent features. The position of a group of points can be decisive to know if they represent an object or not. To solve this, we propose a network extension called FeatExt operation that enables the model to be aware of both the target objects features and their spatial location. FeatExt operation expands a group of feature maps extracted from a BEV representation to include the distance to a specific position of interest in the scene, in this case the distance with respect to the LiDAR. When adding the proposed operation to a baseline network in an intermediate fusion fashion, it shows up to an 8.9 average precision boost in the KITTI BEV benchmark. Our proposal can be easily added to improve existing object detection networks.


Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus

Opciones de compartir

Opciones de entorno