models/z-image-turbo-flow-dpo - v1.0

z-image-turbo-flow-dpo - v1.0

|
4/4/2026
|
12:22:28 PM
| Discussion|
0
A massive planet-sized stone cracked with glowing lava impacts a rocky mountainous wasteland under bright moonlight, with debris floating and a dark figure clinging to a ledge nearby.

Recommended Parameters

samplers

FlowMatchEulerDiscreteScheduler

steps

8

Tips

Keep LoRA scale between 0.6 and 1.0 for most photorealistic results.

Do not use this LoRA as an image-to-image restorer; it modifies the prior distribution of text-to-image generation.

Pushing LoRA scale beyond 1.5 may cause over-sharpened or overly saturated images.

Z-Image-Turbo Photorealistic Lighting LoRA (Flow-DPO)

This is a specialized LoRA adapter for Alibaba-Tongyi/Z-Image-Turbo, finetuned using Flow-DPO (Direct Preference Optimization for Flow Matching) to significantly enhance photorealistic lighting, cinematic shadows, and overall image quality.

By utilizing Flow-DPO on perfectly spatially-aligned image pairs, this LoRA fixes the common "flat," "washed-out," or "plastic" artifacts often found in ultra-fast distilled models, delivering stunning, physically accurate lighting in just 8 inference steps.

🧠 Training Details & Methodology

This model was trained using a custom implementation of Flow-DPO (Improving Video Generation with Human Feedback, arXiv:2501.13918).

1. The Dataset (Strict Spatial Alignment)

To prevent the model from hallucinating or altering image structures (Catastrophic Forgetting), the preference dataset was constructed using strict spatial alignment:

  • Win (Chosen): High-quality, professional photographs with perfect lighting and textures.

  • Lose (Rejected): The exact same images degraded programmatically (Gaussian blur, lowered contrast, extreme exposure shifts, gaussian noise, and heavy JPEG compression artifacts).

  • Alignment: No cropping or warping was applied, ensuring the Flow Matching trajectory learned to solely correct lighting and texture.

2. Discrete Timestep Distillation Preservation

Unlike standard diffusion models where $t$ is sampled continuously $t \in [0, 1]$, Z-Image-Turbo is a distilled model specifically optimized for 8 fixed timesteps. During the Flow-DPO training, we dynamically extracted the exact discrete $t$-distribution from the FlowMatchEulerDiscreteScheduler and restricted the random sampling to these exact 8 nodes. This ensures the LoRA retains the turbo model's extreme speed without causing output blurriness.

3. Hyperparameters

  • Base Model: Alibaba-Tongyi/Z-Image-Turbo (6B Single-Stream DiT)

  • Learning Rate: 1e-4

  • KL Penalty ($\beta$): 1.0

  • Effective Batch Size: 1

  • Mixed Precision: bfloat16

⚠️ Limitations

  • Not an Image-to-Image Restorer: This LoRA changes the prior distribution of the Text-to-Image generation. It is designed to generate better original images from text prompts, not to be used as an img2img filter to fix user-uploaded bad photos (unless combined with RF-Inversion techniques, which are highly unstable for 8-step models).

  • Color Saturation: Pushing the LoRA scale too high (e.g., > 1.5) might result in over-sharpened or overly saturated images due to the nature of DPO margin maximization. Keep the scale around 0.6 - 1.0 for the most photorealistic results.

Contributor

Previous
Ray Zimage base NSFW - v2
Next
Porcelain Art - V1

Model Details

Model type

LoCon

Base model

ZImageTurbo

Model version

v1.0

Model hash

1fd3c728ad

Creator

Discussion

Please log in to leave a comment.

Model Collection - z-image-turbo-flow-dpo

Images by z-image-turbo-flow-dpo - v1.0

A massive planet-sized stone cracked with glowing lava impacts a rocky mountainous wasteland under bright moonlight, with debris floating and a dark figure clinging to a ledge nearby.

photorealistic lighting Images

Portrait of a slender woman with white hair and red eyes wearing a black bodysuit and metal cuffs, posed with red claws and a red and white abstract background with crosses.
Hyperrealistic gecko with obsidian-black skin and glowing orange patterned scales, positioned diagonally on a neutral beige background with soft shadows.
Close-up biomechanical cyberpunk face constructed from polished copper alloy mosaic tiles featuring electric blue circuitry and golden seams under spotlight lighting.
Photorealistic colossal Ekranoplan warship gliding over dark, turbulent ocean waves under stormy skies with lightning and glowing cybernetic plating.
Architectural rendering of a French modern style building with red brick facade, tall arched window, blue metal roof, and iron door, surrounded by landscaped garden.
A hyper-realistic portrait of a woman wearing a glowing ethereal dress composed of luminous clouds and crackling lightning, illuminated with blue-white light in a softly blurred bedroom.
Photorealistic split-frame image of a giraffe wearing a diving mask submerged underwater with calm lakebed and bubbles visible, while above water a storm rages with lightning and a lighthouse in the distance.
Two female barbarian warriors walking towards a large cave entrance topped by a giant humanoid skull with prominent canines in a dusty, arid mountainous landscape under bright late afternoon sun.
A highly detailed photorealistic image of a cartoon cat sitting on a wooden kitchen countertop with ambient sunlight coming through the window, showing intricate lighting and sharp focus.

style Images

A dynamic portrait of a Polish archer holding a royal scimitar and drawing a bow, wrapped in the Polish flag with detailed armor, set against a dark fantasy background.
3D rendered giant robot standing in a field at night, glowing eyes, facing a boy in a yellow hoodie with starry sky and forest background.
Detailed illustration of a girl with long black hair and a curvy figure, rendered in an artistic chalk and coal style with dynamic composition.
Oil painting of a black raven perched on a gnarled tree branch against a vibrant sunset sky with dramatic clouds and distant mountains.
A striking upper body portrait of a gothic woman with vibrant orange and black abstract patterns overlaying her face, glowing eyes, and a sensual, sinister smile, depicted in a highly detailed charcoal drawing style with intense, vibrant colors and volumetric lighting.
Stylized Parisian woman in black haute couture dress and hat walking a white Scottish terrier on a yellow background, shown in side profile with minimalist pop art style.
Full-body shot of a ballet dancer with violet hair in a pleated dress on a polished floor with ribbons and petals.
Woman with short blue hair in black mini dress at amusement park.
Woman in animal print leggings and satin camisole in a handcrafted living room.
Lower body wearing gold ankle boots and multicolor dress.