HybridPillars: Hybrid Point-Pillar Network for Real-time Two-stage 3D Object Detection

Abstract: LiDAR-based 3D object detection is an important perceptual task in various fields such as intelligent transportation, autonomous driving and robotics. Existing two-stage point-voxel methods contribute to the boost of accuracy on 3D object detection by utilizing precise point-wise features to refine 3D proposals. Although obtaining promising results, these methods are not suitable for real-time applications. Firstly, the inference speed of existing point-voxel hybrid frameworks is slow because the acquisition of point features from voxel features consumes a lot of time. Secondly, existing point-voxel methods rely on 3D convolution for voxel feature learning, which increases the difficulty of deployment on embedded computing platforms. To address these issues, we propose a real-time two-stage detection network, named HybridPillars. We first propose a novel hybrid framework by integrating a point feature encoder into a point-pillar pipeline efficiently. By combining point-based and pillar-based networks, our method can discard 3D convolution to reduce computational complexity. Furthermore, we propose a novel pillar feature aggregation network to efficiently extract BEV features from point-wise features, thereby significantly enhancing the performance of our network. Extensive experiments demonstrate that our proposed HybridPillars not only boosts the inference speed, but also achieves competitive detection performance compared with other methods.

1. Recommended Environment

We have tested this project with the following environments:

Ubuntu 18.04
Python 3.7.13
PyTorch 1.7.0, cuda 11.0 version
CUDA NVCC 11.1
Spconv 2.1.21

2. Installation

pip install -r requirement.txt
bash compile.sh

3. Prepare Data

Prepare KITTI dataset and road planes

# Download KITTI and organize it into the following form:
├── data
│   ├── kitti
│   │   │── ImageSets
│   │   │── training
│   │   │   ├──calib & velodyne & label_2 & image_2 & (optional: planes)
│   │   │── testing
│   │   │   ├──calib & velodyne & image_2

# Generatedata infos:
python -m pcdet.datasets.kitti.kitti_dataset create_kitti_infos tools/cfgs/dataset_configs/kitti_dataset.yaml

4. Train

cd tools
# a. train the two-stage model
python ./train.py --cfg_file ./cfg/kitti_models/hybridpillars.yaml

or

# b. train the single-stage model
python ./train.py --cfg_file ./cfg/kitti_models/hybridpillars-ssd.yaml

Support single or multiple GPUs training.

5. Test

python test.py --cfg-file ${CONFIG_FILE} --ckpt ${CKPT}

6. FLOPs Calculation Method

Please following link 1 and link 2 to install thop with SPCONV extension
We provide an API for FLOPs Calculation

from pcdet.utils.spconv_utils import spconv
from thop import profile, clever_format, profile_acts

def cal_flops(model, batch_dict):
    macs, params, acts = profile_acts(model, inputs=(batch_dict,),
                           custom_ops={
                            spconv.SubMConv3d: spconv.SubMConv3d.count_your_model,
                            spconv.SparseConv3d: spconv.SparseConv3d.count_your_model,
                            spconv.SubMConv2d: spconv.SubMConv2d.count_your_model,
                            spconv.SparseConv2d: spconv.SparseConv2d.count_your_model}
                           )
    return macs, params, acts

...

macs, params, acts = cal_flops(model, data_dict)

7. Acknowledgement

Thanks for the OpenPCDet, this implementation is mainly based on the pcdet v0.6.0.
Parts of our code refer to the excellent work IA-SSD.

License

This project is released under the Apache 2.0 license.

Citation

Coming soon.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
docs		docs
pcdet		pcdet
tools		tools
.gitignore		.gitignore
README.md		README.md
compile.sh		compile.sh
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HybridPillars: Hybrid Point-Pillar Network for Real-time Two-stage 3D Object Detection

1. Recommended Environment

2. Installation

3. Prepare Data

4. Train

5. Test

6. FLOPs Calculation Method

7. Acknowledgement

License

Citation

About

Releases

Packages

Languages

huangzhicong3/HybridPillars

Folders and files

Latest commit

History

Repository files navigation

HybridPillars: Hybrid Point-Pillar Network for Real-time Two-stage 3D Object Detection

1. Recommended Environment

2. Installation

3. Prepare Data

4. Train

5. Test

6. FLOPs Calculation Method

7. Acknowledgement

License

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages