# DHD
**Repository Path**: chen-suzeyu/dhd
## Basic Information
- **Project Name**: DHD
- **Description**: 基于视觉的三维占位预测任务中,二维到三维的视图转换是必不可少的步骤。以前的方法大多是前向投影,如BEVPooling和VoxelPooling,它们都是将2D图像特征映射到3D网格中。然而,当前表示某一高度范围内特征的网格通常会引入许多属于其他高度范围的混淆特征。为了应对这一挑战,我们提出了深度高度解耦(DHD),这是一个新颖的框架,在过滤掉令人困惑的特征之前,它包含了显式高度。
- **Primary Language**: Unknown
- **License**: Apache-2.0
- **Default Branch**: main
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2025-07-01
- **Last Updated**: 2025-07-01
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
Accepted to ICRA 2025!
Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction
Yuan Wu1*;,
Zhiqiang Yan1*† ,
Zhengxue Wang1,
Xiang Li2,
Le Hui3,
Jian Yang1†
*equal contribution
†corresponding author
1Nanjing University of Science and Technology
2Nankai University
3Northwestern Polytechnical University
[Paper]
[Project Page]
## Method

**DHD** comprises a feature extractor, HeightNet, DepthNet, MGHS, SFA, and predictor. The feature extractor first acquires 2D image feature. Then, DepthNet extracts context feature and depth prediction. HeightNet generates the height map to determine the height value at each pixel. Next, MGHS integrates the output of HeightNet and DepthNet, acquiring height-refined feature and depth-based feature. Finally, the dual features are fed into the SFA to obtain the aggregated feature, which serves as input for the predictor.
## Get Started
### Installation and Data Preparation
Step1、Prepare environment as that in [Install](doc/install.md).
Step2、Prepare nuScene and generate pkl file by runing:
```python
python tools/create_data_bevdet.py
```
The final directory structure for 'data' folder is like
```shell
└── data
└── nuscenes
├── v1.0-trainval
├── maps
├── sweeps
├── samples
├── gts
├── bevdetv2-nuscenes_infos_train.pkl
└── bevdetv2-nuscenes_infos_val.pkl
```
### Train & Test
```shell
# train:
tools/dist_train.sh ${config} ${num_gpu}
# train DHD-S:
tools/dist_train.sh projects/configs/DHD/DHD-S.py 4
# test:
tools/dist_test.sh ${config} ${ckpt} ${num_gpu} --eval mAP
# test DHD-S:
tools/dist_test.sh projects/configs/DHD/DHD-S.py model_weight/DHD-S.pth 4 --eval mAP
```
### Model weights
The pretrained weights in 'ckpts' folder can be found here.
All DHD model weights can be found here.
## Experiment
### Quantitative comparison
### Visual comparison

## Acknowledgements
Many thanks to the authors of BEVDet and FlashOcc.
## Citation
```
@article{wu2024deep,
title={Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction},
author={Wu, Yuan and Yan, Zhiqiang and Wang, Zhengxue and Li, Xiang and Hui, Le and Yang, Jian},
journal={arXiv preprint arXiv:2409.07972},
year={2024}
}
```