# SparseDrive **Repository Path**: wenb11/SparseDrive ## Basic Information - **Project Name**: SparseDrive - **Description**: 包括一个对称的稀疏感知模块和一个并行运动规划器。稀疏感知模块通过对称的模型架构统一了检测、跟踪和在线地图构建，学习驾驶场景的完全稀疏表示。对于运动预测和规划，将规划建模为多模态问题，提出了一种分层规划选择策略，其中包括一个避免碰撞的重新评分模块，以选择合理且安全的轨迹作为最终规划输出。SparseDrive 在所有任务的性能上都大大超越了以前的最先进方法，同时实现了更高的训练和推理效率 - **Primary Language**: Unknown - **License**: MIT - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 1 - **Created**: 2024-12-20 - **Last Updated**: 2025-01-22 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation https://github.com/swc-17/SparseDrive/assets/64842878/867276dc-7c19-4e01-9a8e-81c4ed844745 ## News * **`24 June, 2024`:** We reorganize code for better readability. Code & Models are released. * **`31 May, 2024`:** We release the SparseDrive paper on [arXiv](https://arxiv.org/abs/2405.19620). Code & Models will be released in June, 2024. Please stay tuned! ## Introduction > SparseDrive is a Sparse-Centric paradigm for end-to-end autonomous driving. - We explore the sparse scene representation for end-to-end autonomous driving and propose a Sparse-Centric paradigm named SparseDrive, which unifies multiple tasks with sparse instance representation. - We revise the great similarity shared between motion prediction and planning, correspondingly leading to a parallel design for motion planner. We further propose a hierarchical planning selection strategy incorporating a collision-aware rescore module to boost the planning performance. - On the challenging nuScenes benchmark, SparseDrive surpasses previous SOTA methods in terms of all metrics, especially the safety-critical metric collision rate, while keeping much higher training and inference efficiency.

Overview of SparseDrive. SparseDrive first encodes multi-view images into feature maps, then learns sparse scene representation through symmetric sparse perception, and finally perform motion prediction and planning in a parallel manner. An instance memory queue is devised for temporal modeling.

Model architecture of symmetric sparse perception, which unifies detection, tracking and online mapping in a symmetric structure.

Model structure of parallel motion planner, which performs motion prediction and planning simultaneously and outputs safe planning trajectory.

padding: 2px;">Overview of SparseDrive. SparseDrive first encodes multi-view images into feature maps, then learns sparse scene representation through symmetric sparse perception, and finally perform motion prediction and planning in a parallel manner. An instance memory queue is devised for temporal modeling.

Model architecture of symmetric sparse perception, which unifies detection, tracking and online mapping in a symmetric structure.

Model structure of parallel motion planner, which performs motion prediction and planning simultaneously and outputs safe planning trajectory.

## Results in paper - Comprehensive results for all tasks on [nuScenes](https://github.com/nutonomy/nuscenes-devkit). | Method | NDS | AMOTA | minADE (m) | L2 (m) Avg | Col. (%) Avg | Training Time (h) | FPS | | :---: | :---:| :---: | :---: | :---: | :---: | :---: | :---: | | UniAD | 0.498 | 0.359 | 0.71 | 0.73 | 0.61 | 144 | 1.8 | | SparseDrive-S | 0.525 | 0.386 | 0.62 | 0.61 | 0.08 | **20** | **9.0** | | SparseDrive-B | **0.588** | **0.501** | **0.60** | **0.58** | **0.06** | 30 | 7.3 | - Open-loop planning results on [nuScenes](https://github.com/nutonomy/nuscenes-devkit). | Method | L2 (m) 1s | L2 (m) 2s | L2 (m) 3s | L2 (m) Avg | Col. (%) 1s | Col. (%) 2s | Col. (%) 3s | Col. (%) Avg | FPS | | :---: | :---: | :---: | :---: | :---:| :---: | :---: | :---: | :---: | :---: | | UniAD | 0.45 | 0.70 | 1.04 | 0.73 | 0.62 | 0.58 | 0.63 | 0.61 | 1.8 | | VAD | 0.41 | 0.70 | 1.05 | 0.72 | 0.03 | 0.19 | 0.43 | 0.21 |4.5 | | SparseDrive-S | **0.29** | 0.58 | 0.96 | 0.61 | 0.01 | 0.05 | 0.18 | 0.08 | **9.0** | | SparseDrive-B | **0.29** | **0.55** | **0.91** | **0.58** | **0.01** | **0.02** | **0.13** | **0.06** | 7.3 | ## Results of released checkpoint We found some missed detection for collision rate evaluation in previous code, so we re-implement the evaluation metric in released code and provide updated results. ## Main results | Model | config | ckpt | log | det: NDS | mapping: mAP | track: AMOTA |track: AMOTP | motion: EPA_car |motion: minADE_car| motion: minFDE_car | motion: MissRate_car | planning: CR | planning: L2 | | :---: | :---: | :---: | :---: | :---: | :---:|:---:|:---: | :---: | :----: | :----: | :----: | :----: | :----: | | Stage1 |[cfg](projects/configs/sparsedrive_small_stage1.py)|[ckpt](https://github.com/swc-17/SparseDrive/releases/download/v1.0/sparsedrive_stage1.pth)|[log](https://github.com/swc-17/SparseDrive/releases/download/v1.0/sparsedrive_stage1_log.txt)|0.5260|0.5689|0.385|1.260| | | | | | | | Stage2 |[cfg](projects/configs/sparsedrive_small_stage2.py)|[ckpt](https://github.com/swc-17/SparseDrive/releases/download/v1.0/sparsedrive_stage2.pth)|[log](https://github.com/swc-17/SparseDrive/releases/download/v1.0/sparsedrive_stage2_log.txt)|0.5257|0.5656|0.372|1.248|0.492|0.61|0.95|0.133|0.097%|0.61| ## Detailed results for planning | Method | L2 (m) 1s | L2 (m) 2s | L2 (m) 3s | L2 (m) Avg | Col. (%) 1s | Col. (%) 2s | Col. (%) 3s | Col. (%) Avg | | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | | UniAD | 0.45 | 0.70 | 1.04 | 0.73 | 0.66 | 0.66 | 0.72 | 0.68 | | UniAD-wo-post-optim | 0.32 | 0.58 | 0.94 | 0.61 | 0.17 | 0.27 | 0.42 | 0.29 | | VAD | 0.41 | 0.70 | 1.05 | 0.72 | 0.03 | 0.21 | 0.49 | 0.24 | | SparseDrive-S | 0.30 | 0.58 | 0.95 | 0.61 | 0.01 | 0.05 | 0.23 | 0.10 | ## Quick Start [Quick Start](docs/quick_start.md) ## Citation If you find SparseDrive useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry. ``` @article{sun2024sparsedrive, title={SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation}, author={Sun, Wenchao and Lin, Xuewu and Shi, Yining and Zhang, Chuang and Wu, Haoran and Zheng, Sifa}, journal={arXiv preprint arXiv:2405.19620}, year={2024} } ``` ## Acknowledgement - [Sparse4D](https://github.com/HorizonRobotics/Sparse4D) - [UniAD](https://github.com/OpenDriveLab/UniAD) - [VAD](https://github.com/hustvl/VAD) - [StreamPETR](https://github.com/exiawsh/StreamPETR) - [StreamMapNet](https://github.com/yuantianyuan01/StreamMapNet) - [mmdet3d](https://github.com/open-mmlab/mmdetection3d)