💡DistillationDPO💡

Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion

by An Zhao¹, Shengyuan Zhang¹, Ling Yang², Zejian Li*¹, Jiale Wu¹, Haoran Xu³, AnYang Wei³,Perry Pengyun GU³, Lingyun Sun¹

¹Zhejiang University ²Peking University ³Zhejiang Green Zhixing Technology co., ltd

Abstract

The application of diffusion models in 3D LiDAR scene completion is limited due to diffusion's slow sampling speed. Score distillation accelerates diffusion sampling but with performance degradation, while post-training with direct policy optimization (DPO) boosts performance using preference data. This paper proposes Distillation-DPO, a novel diffusion distillation framework for LiDAR scene completion with preference aligment. First, the student model generates paired completion scenes with different initial noises. Second, using LiDAR scene evaluation metrics as preference, we construct winning and losing sample pairs. Such construction is reasonable, since most LiDAR scene metrics are informative but non-differentiable to be optimized directly. Third, Distillation-DPO optimizes the student model by exploiting the difference in score functions between the teacher and student models on the paired completion scenes. Such procedure is repeated until convergence. Extensive experiments demonstrate that, compared to state-of-the-art LiDAR scene completion diffusion models, Distillation-DPO achieves higher-quality scene completion while accelerating the completion speed by more than 5-fold. Our method is the first to explore adopting preference learning in distillation to the best of our knowledge and provide insights into preference-aligned distillation.

Environment setup

The following commands are tested with Python 3.8 and CUDA 11.1.

Install required packages:

sudo apt install build-essential python3-dev libopenblas-dev

pip3 install -r requirements.txt

Install MinkowskiEngine for sparse tensor processing:

pip3 install -U MinkowskiEngine==0.5.4 --install-option="--blas=openblas" -v --no-deps

Setup the code on the code main directory:

pip3 install -U -e .

Training

We use The SemanticKITTI dataset for training.

The SemanticKITTI dataset has to be downloaded from the official site and extracted in the following structure:

DistillationDPO/
└── datasets/
    └── SemanticKITTI
        └── dataset
          └── sequences
            ├── 00/
            │   ├── velodyne/
            |   |       ├── 000000.bin
            |   |       ├── 000001.bin
            |   |       └── ...
            │   └── labels/
            |       ├── 000000.label
            |       ├── 000001.label
            |       └── ...
            ├── 01/
            │   ...
            ...

Ground truth scenes are not provided explicitly in SemanticKITTI. To generate the ground complete scenes you can run the map_from_scans.py script. This will use the dataset scans and poses to generate the sequence map to be used as ground truth during training:

python map_from_scans.py --path datasets/SemanticKITTI/dataset/sequences/

We use the diffusion-dpo fine-tuned version of LiDiff as the teacher model as well as the teacher assistant models. Download the pre-trained weights from here and place it at checkpoints/lidiff_ddpo_refined.ckpt.

Once the sequences map is generated and the teacher model is downloaded you can then train the model. The training can be started with:

python trains/DistillationDPO.py --SemanticKITTI_path datasets/SemanticKITTI --pre_trained_diff_path checkpoints/lidiff_ddpo_refined.ckpt

Inference & Visualization

We use pyrender for offscreen rendering. Please see this guide for installation of pyrender.

After correct installation of pyrender, download the refinement model 'refine_net.ckpt' from here and place it at checkpoints/refine_net.ckpt. We also provide the pre-trained weights of distillation-dpo. Download 'distillationdpo_st.ckpt' from here and place it at checkpoints/distillationdpo_st.ckpt.

Then run the inference script with the following command:

python utils/eval_path_get_pics.py --diff checkpoints/distillationdpo_st.ckpt --refine checkpoints/refine_net.ckpt

This script will read all scenes in a sepcified sequence of SemanticKITTI dataset and the result images will be saved under exp/.

Citation

If you find our paper useful or relevant to your research, please kindly cite our papers:

@misc{zhao2025diffusiondistillationdirectpreference,
    title={Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion}, 
    author={An Zhao and Shengyuan Zhang and Ling Yang and Zejian Li and Jiale Wu and Haoran Xu and AnYang Wei and Perry Pengyun GU and Lingyun Sun},
    year={2025},
    eprint={2504.11447},
    archivePrefix={arXiv},
    primaryClass={cs.CV},
    url={https://arxiv.org/abs/2504.11447}, 
}

Credits

DistillationDPO is highly built on the following amazing open-source projects:

Lidiff: Scaling Diffusion Models to Real-World 3D LiDAR Scene Completion

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

💡DistillationDPO💡

Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion

Abstract

Environment setup

Training

Inference & Visualization

Citation

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
datasets		datasets
models		models
pics		pics
trains		trains
utils		utils
.gitignore		.gitignore
README.md		README.md
map_from_scans.py		map_from_scans.py
setup.py		setup.py

happyw1nd/DistillationDPO

Folders and files

Latest commit

History

Repository files navigation

💡DistillationDPO💡

Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion

Abstract

Environment setup

Training

Inference & Visualization

Citation

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages