⚡️ Z-Image — A New Era of Ultra-Efficient Image Generation
Experience next-generation AI image generation with Z-Image, a powerful 6B-parameter foundation model built on single-stream diffusion transformers. Designed for speed, quality, and control, Z-Image delivers photorealistic visuals, precise instruction following, and bilingual text rendering—all in one highly optimized framework.
🚀 Z-Image-Turbo
The distilled, high-efficiency version of Z-Image featuring only 8 NFEs for generation. Built for extreme performance:
- ⚡ Sub-second inference on enterprise GPUs
- 💻 Runs comfortably on 16GB VRAM consumer devices
- 📸 Exceptional photorealistic quality
- 🌏 Accurate English & Chinese text generation
- 🎯 Strong instruction adherence
Perfect for teams and creators who require speed + reliability at scale.
🧱 Z-Image-Base
The full, non-distilled foundation model—released openly to empower community innovation. Ideal for:
- Fine-tuning
- Custom pipelines
- Research & development
- Specialized downstream tasks
Unlock the full potential of large-scale diffusion transformers.
✍️ Z-Image-Edit
A specialized variant fine-tuned for image editing and image-to-image generation.
- Natural-language-driven edits
- Creative transformations
- Style changes
- High-fidelity content preservation
Designed for creators who need precision editing powered by AI.
FAQs
What is Z-Image?
Z-Image is a highly efficient 6B-parameter AI image generation model built with a single-stream diffusion transformer architecture.
What makes Z-Image-Turbo fast?
The Turbo model uses advanced distillation techniques and requires only 8 NFEs, enabling sub-second inference even on consumer GPUs.
Does Z-Image support bilingual text rendering?
Yes. Z-Image excels at generating English and Chinese text inside,images with high accuracy.
Is fine-tuning supported?
The Z-Image-Base checkpoint is released specifically for community R&D and custom fine-tuning.

