UDCAI/Z-Image-Fun-Distill-ComfyUI

UDCAI

Texto a imagen

LoRA de texto a imagen para Z-Image, convertido para compatibilidad con ComfyUI. Está pensado para usarse con el modelo base Tongyi-MAI/Z-Image y derivados. La versión Fun-distill-2602 fue ajustada eliminando selectivamente algunas claves feed-forward para mejorar la compatibilidad con schedulers y samplers, manteniendo beneficios de contraste y brillo.

Como usar

Instalación y uso con Diffusers:
pip install -U diffusers transformers accelerate

import torch
from diffusers import DiffusionPipeline

# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("Tongyi-MAI/Z-Image", dtype=torch.bfloat16, device_map="cuda")
pipe.load_lora_weights("UDCAI/Z-Image-Fun-Distill-ComfyUI")

prompt = "A realistic photo of a young woman with short black hair and vibrant pink tips, wearing a metallic spiked headband and a matching spiked choker. She has large, expressive red eyes and is depicted with a slightly surprised or breathless expression, her mouth parted. She is dressed in a black and white cheerleader uniform, including a cropped tank top with a bold graphic print that says \"PHAETHON\" and a pleated mini-skirt. She is posing with her hands holding yellow pom-poms. Her skin is glistening with droplets of sweat or water, and there are floating water bubbles around her. Attached to her backside is a large, dark grey shark-like tail fin decorated with white graffiti. The background shows a sunlit athletic stadium with a running track under a clear blue sky. The overall style is high-detail with vibrant colors and dynamic lighting."
image = pipe(prompt).images[0]

Recomendaciones del autor: usar con Z-Image, preferiblemente a 8 pasos y CFG 1. Evitar sageattention. Los samplers que añaden ruido por paso, como LCM, parecen funcionar bien.

Funcionalidades

Generación de imágenes a partir de texto mediante Diffusers y LoRA.
Compatible con ComfyUI.
Diseñado para el modelo base Tongyi-MAI/Z-Image y sus derivados.
Optimizado para funcionar bien a 8 pasos con CFG 1.
Ajustes selectivos en pesos feed-forward para permitir el uso correcto de distintos schedulers y samplers.
Licencia Apache 2.0.

Casos de uso

Crear imágenes de alta definición desde prompts de texto usando Z-Image en ComfyUI.
Ejecutar flujos de generación con LoRA distilada en pocos pasos.
Probar schedulers y samplers variados en ComfyUI con una conversión adaptada para mayor compatibilidad.
Usar Z-Image-Fun-Lora-Distill-8-Steps en entornos locales como ComfyUI, Diffusers, Draw Things o DiffusionBee.