pith. sign in

arxiv: 1710.07368 · v1 · pith:B7E4TZNEnew · submitted 2017-10-19 · 💻 cs.CV

SqueezeSeg: Convolutional Neural Nets with Recurrent CRF for Real-Time Road-Object Segmentation from 3D LiDAR Point Cloud

classification 💻 cs.CV
keywords datalidarpointsegmentationsqueezesegtrainingaccuracycloud
0
0 comments X
read the original abstract

In this paper, we address semantic segmentation of road-objects from 3D LiDAR point clouds. In particular, we wish to detect and categorize instances of interest, such as cars, pedestrians and cyclists. We formulate this problem as a point- wise classification problem, and propose an end-to-end pipeline called SqueezeSeg based on convolutional neural networks (CNN): the CNN takes a transformed LiDAR point cloud as input and directly outputs a point-wise label map, which is then refined by a conditional random field (CRF) implemented as a recurrent layer. Instance-level labels are then obtained by conventional clustering algorithms. Our CNN model is trained on LiDAR point clouds from the KITTI dataset, and our point-wise segmentation labels are derived from 3D bounding boxes from KITTI. To obtain extra training data, we built a LiDAR simulator into Grand Theft Auto V (GTA-V), a popular video game, to synthesize large amounts of realistic training data. Our experiments show that SqueezeSeg achieves high accuracy with astonishingly fast and stable runtime (8.7 ms per frame), highly desirable for autonomous driving applications. Furthermore, additionally training on synthesized data boosts validation accuracy on real-world data. Our source code and synthesized data will be open-sourced.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. From Spherical to Gaussian: A Comparative Analysis of Point Cloud Cropping Strategies in Large-Scale 3D Environments

    cs.CV 2026-05 unverdicted novelty 5.0

    Gaussian and linear cropping strategies for large point clouds improve 3D neural network performance over spherical crops, especially in outdoor scenes, and achieve new state-of-the-art results.

  2. From Spherical to Gaussian: A Comparative Analysis of Point Cloud Cropping Strategies in Large-Scale 3D Environments

    cs.CV 2026-05 unverdicted novelty 5.0

    Gaussian and related cropping strategies for point cloud subclouds improve 3D neural network performance over spherical cropping on large outdoor scenes.

  3. SynthCity: A large scale synthetic point cloud

    cs.CV 2019-07 unverdicted novelty 5.0

    SynthCity is a 367.9M point synthetic full-colour Mobile Laser Scanning point cloud with per-point labels from nine categories, generated in Blender for an urban environment.

  4. How much real data do we actually need: Analyzing object detection performance using synthetic and real data

    cs.CV 2019-07 unverdicted novelty 3.0

    Synthetic data can partially substitute for real data in object detection training, with performance tied to domain similarity and the volume of real data included.