dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching

Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache).

🔥 News

[2025/05/23] The code of our paper has been released.
[2025/05/22] Our paper has been released.

✨️ Key Highlights

Speedup: Achieves up to 9.1x speedup over standard dLLM pipelines, with no performance loss on most tasks.
Evaluation: Evaluated on LLaDA 8B and Dream 7B.
Latency: Approaches ARM-level inference speeds in many scenarios.

🚀 Pipeline

Here's an overview of the process behind our dLLM-Cache method:

🛠️ Installation

To get started with dLLM-Cache, follow the installation instructions below.

Clone the Repository:

git clone https://github.com/maomaocun/dLLM-Cache.git
cd dLLM-Cache

Set Up the Environment: Create a Python environment with conda or virtualenv and install dependencies:

bash install.sh

demo:

python demo_{model_name}.py

Running Experiments: Run experiments using the provided scripts:

bash scripts/run_{model_name}_{task_name}_base.sh

📘 Example Usage

GSM8K with LLaDA

bash scripts/run_LLaDA_gsm8k_base.sh

BBH with Dream

bash scripts/run_Dream_bbh_base.sh

📮 Contact

If you have any questions, please email yangyicun187@gmail.com.

🎉 Acknowledgements

This repository was built off of LLaDA, Dream and lm-evaluation-harness.

📌 Citation

If you find dLLM-Cache useful for your research and applications, please cite using this BibTeX:

@misc{liu2025dllm,
      title={dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching}, 
      author={Zhiyuan Liu and Yicun Yang and Yaojie Zhang and Junjie Chen and Chang Zou and Qingyan Wei and Shaobo Wang and Linfeng Zhang},
      year={2025},
      url={https://github.com/maomaocun/dLLM-cache},
}

Name	Name	Last commit message
Latest commit History 42 Commits
asset	asset
cache	cache
hook	hook
metrics	metrics
model	model
scripts	scripts
utils	utils	8000
.gitignore	.gitignore
LICENSE	LICENSE
README.md	README.md
accelerate_config.yaml	accelerate_config.yaml
demo_Dream.py	demo_Dream.py
demo_LLaDA.py	demo_LLaDA.py
evaluation_script.py	evaluation_script.py
install.sh	install.sh
requirements.txt	requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching

🔥 News

✨️ Key Highlights

🚀 Pipeline

🛠️ Installation

📘 Example Usage

📮 Contact

🎉 Acknowledgements

📌 Citation

🌟 Star History

About

Uh oh!

Releases

Packages

Languages

License

wh-forker/dLLM-cache

Folders and files

Latest commit

History

Repository files navigation

dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching

🔥 News

✨️ Key Highlights

🚀 Pipeline

🛠️ Installation

📘 Example Usage

📮 Contact

🎉 Acknowledgements

📌 Citation

🌟 Star History

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages