# DreamLite **Repository Path**: thzsen/DreamLite ## Basic Information - **Project Name**: DreamLite - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2026-04-30 - **Last Updated**: 2026-04-30 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing

[![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/carlofkl/DreamLite) [![Paper](https://img.shields.io/badge/Paper-DreamLite-b31b1b.svg?logo=arXiv)](https://arxiv.org/abs/2603.28713) [![Project Page](https://img.shields.io/badge/Project_page-More_visualizations-green?logo=bytedance)](https://carlofkl.github.io/dreamlite/) [![Visitors](https://visitor-badge.laobi.icu/badge?page_id=carlofkl.DreamLite)](https://github.com/ByteVisionLab/DreamLite)

## 🌿 Overview We introduce **DreamLite**, a compact and unified on-device diffusion model (**0.39B**) that seamlessly supports both **text-to-image generation** and **text-guided image editing** within a single network architecture. Built upon a pruned mobile U-Net backbone, DreamLite unifies multimodal conditioning through **In-Context Spatial Concatenation** directly in the latent space. By leveraging progressive step distillation, DreamLite achieves ultra-fast **4-step inference**, capable of generating or editing a **1024×1024** image in ~**3 seconds** on an iPhone 17 Pro (powered by 4-bit Qwen-VL and fp16 VAE+UNet) — operating **fully on-device with zero cloud dependency**.

Figure 1. The overall unified architecture of DreamLite.

--- ## 📰 News - **[2026.04]** 🎉🎉🎉 We officially released the inference code. - **[2026.03]** 🎉🎉🎉 DreamLite is publicly announced! Check out our [project page](https://carlofkl.github.io/dreamlite/) and [arXiv paper](https://arxiv.org/abs/2603.28713). --- ## 🎬 On-Device Demo Experience real-time generation and editing on an iPhone 17 Pro. No internet connection or cloud processing required.

Human Portrait & Style Transfer	Nature Landscape & Background Swap	Product & Object Replacement

> **Note**: If demos fail to render natively on GitHub, please visit our [Project Page](https://carlofkl.github.io/dreamlite/) to watch the full demonstrations. --- ## ⚙️ Getting Started ### 1. Environment Setup ```bash # Clone the repository git clone https://github.com/ByteVisionLab/DreamLite.git cd DreamLite # Create and activate a conda environment conda create -n dreamlite python=3.10 -y conda activate dreamlite # Install dependencies pip install -r requirements.txt ``` Ensure the model weights (DreamLite-base and DreamLite-mobile) are placed in the following directory structure: ``` DreamLite/ ├── models/ │ ├── DreamLite-base/ │ └── DreamLite-mobile/ ``` ### 2. Inference via CLI You can readily generate or edit images utilizing our provided command-line interfaces. ```bash # ========================================== # DreamLite-base: 28 Steps (High Fidelity) # ========================================== # Text-to-Image Generation python infer.py --prompt "A close-up of a fire spitting dragon cinematic shot." # Text-guided Image Editing python infer.py --prompt "Transfer this image to oil-painting style." --image_path ./inputs/source.png # ========================================== # DreamLite-mobile: 4 Steps (Ultra Fast) # ========================================== # Text-to-Image Generation python infer_mobile.py --prompt "A portrait of a young woman with flowers." # Text-guided Image Editing python infer_mobile.py --prompt "Change the background to a dense forest." --image_path ./inputs/source.png ``` ### 3. Benchmark Evaluation We provide comprehensive benchmark evaluation scripts (GenEval & ImgEdit) to facilitate performance comparisons between DreamLite and other state-of-the-art models. Please configure your local dataset paths within `tools/benchmark/infer_geneval.py` and `tools/benchmark/infer_imgedit.py` prior to execution. ```bash # Run the benchmark evaluation python tools/benchmark/infer_geneval.py --save_dir ./output/benchmark/geneval_output --geneval_json "YOUR_GENEVAL/evaluation_metadata.jsonl" python tools/benchmark/infer_imgedit.py --save_dir ./output/benchmark/imgedit_output --json_path "YOUR_IMGEDIT_PATH/ImgEdit/Benchmark/Basic/basic_edit.json" --img_root "YOUR_IMGEDIT_IMAGES_PATH/ImgEdit/Benchmark/singleturn" ``` ### 4. Interactive Gradio Demo We provide a user-friendly web interface powered by Gradio. You can try our live demo on Hugging Face Spaces, or deploy it locally on your own machine (GPU/CPU). [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/carlofkl/DreamLite) To run the interactive demo locally: ```bash # Launch the local web server python tools/app.py ``` ## 🤗 Checkpoints We offer two distinct variants of the DreamLite model to provide an optimal balance between visual fidelity and on-device inference latency. > [!NOTE] > **Model Access:** Model weights are currently undergoing safety review. To request early access, please contact us at 📧 klfeng1206@outlook.com with an email titled **"DreamLite Access Request"**. > > In your email, please ensure to include: > 1. **Your Name & Affiliation** (e.g., University, Company, or personal portfolio). > 2. **Intended Use Case** (Please briefly describe how you plan to use the DreamLite model). ⚠️ **Important Usage and Compliance Notice**: By accessing and using these models, you agree to abide by our ethical guidelines. These models must **NOT** be used to generate, edit, or distribute any content that is sexually explicit, pornographic, violent, discriminatory, or otherwise illegal. We strictly prohibit the use of DreamLite for malicious purposes.

Model Variant	Params	Resolution	Steps	Guidance
DreamLite (Base)	0.39B	1024×1024	28	CFG & IMG_CFG
DreamLite (Mobile)	0.39B	1024×1024	4	No CFG

## 📊 Main Results Quantitative comparison with state-of-the-art methods on generation and editing benchmarks.

Text-to-Image generation comparison.

Text-guided image editing comparison.

Method	Params	GenEval ↑	DPG ↑	ImgEdit ↑	GEdit-EN-Q ↑
FLUX.1-Dev / Kontext	12B	0.67	84.0	3.76	6.79
BAGEL	7B	0.82	85.1	3.42	7.20
OmniGen2	4B	0.80	83.6	3.44	6.79
LongCat-Image / Edit	6B	0.87	86.6	4.49	7.55
DeepGen1.0	2B	0.83	84.6	4.03	7.54
SANA-1.6B	1.6B	0.67	84.8	-	-
SANA-0.6B	0.6B	0.64	83.6	-	-
SnapGen++ (small)	0.4B	0.66	85.2	-	-
VIBE	1.6B	-	-	3.85	7.28
EditMGT	0.96B	-	-	2.89	6.33
DreamLite (Ours)	0.39B	0.72	85.8	4.11	6.88

## 🎛️ LoRA Fine-tuning We provide comprehensive support for LoRA fine-tuning and inference, enabling lightweight customization of DreamLite on your own domain-specific datasets. For detailed instructions, training scripts, and examples, please refer to our dedicated **[LoRA Fine-Tuning Guide](lora/README.md)**. ## 📑 Open-Source Plan - [X] Release paper on arXiv - [X] Release inference code - [X] Release LoRA training - [ ] Release model weights on HuggingFace - [X] Release online demo - [ ] On-device Deployment Reference ## 🙏 Acknowledgement We thank the great work from [SDXL](https://github.com/Stability-AI/generative-models), [SnapGen](https://snap-research.github.io/snapgen/), [Qwen](https://qwen.ai/home) and [TAESDXL](https://github.com/madebyollin/taesd). The work is under supervision from Prof. Wangmeng Zuo. ## 🪪 License **Code**: Apache-2.0 **Model weights**: see WEIGHTS_LICENSE, CC BY-NC 4.0 ## 📄 Citation If our work assists your research, feel free to give us a star ⭐ or cite us using: ```bibtex @article{feng2026dreamlite, title={DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing}, author={Kailai Feng and Yuxiang Wei and Bo Chen and Yang Pan and Hu Ye and Songwei Liu and Chenqian Yan and Yuan Gao}, journal={arXiv preprint arXiv:2603.28713}, year={2026} } ```