Login
Start for Free

⚡️ Z-Image — A New Era of Ultra-Efficient Image Generation

Experience next-generation AI image generation with Z-Image, a powerful 6B-parameter foundation model built on single-stream diffusion transformers. Designed for speed, quality, and control, Z-Image delivers photorealistic visuals, precise instruction following, and bilingual text rendering—all in one highly optimized framework.

🚀 Z-Image-Turbo

The distilled, high-efficiency version of Z-Image featuring only 8 NFEs for generation. Built for extreme performance:


  • ⚡ Sub-second inference on enterprise GPUs
  • 💻 Runs comfortably on 16GB VRAM consumer devices
  • 📸 Exceptional photorealistic quality
  • 🌏 Accurate English & Chinese text generation
  • 🎯 Strong instruction adherence

Perfect for teams and creators who require speed + reliability at scale.


🧱 Z-Image-Base

The full, non-distilled foundation model—released openly to empower community innovation. Ideal for:


  • Fine-tuning
  • Custom pipelines
  • Research & development
  • Specialized downstream tasks

Unlock the full potential of large-scale diffusion transformers.


✍️ Z-Image-Edit

A specialized variant fine-tuned for image editing and image-to-image generation.


  • Natural-language-driven edits
  • Creative transformations
  • Style changes
  • High-fidelity content preservation

Designed for creators who need precision editing powered by AI.

FAQs

What is Z-Image?

Z-Image is a highly efficient 6B-parameter AI image generation model built with a single-stream diffusion transformer architecture.

What makes Z-Image-Turbo fast?

The Turbo model uses advanced distillation techniques and requires only 8 NFEs, enabling sub-second inference even on consumer GPUs.

Does Z-Image support bilingual text rendering?

Yes. Z-Image excels at generating English and Chinese text inside,images with high accuracy.

Is fine-tuning supported?

The Z-Image-Base checkpoint is released specifically for community R&D and custom fine-tuning.