3h ago

SEGA Enables High-Resolution Generation In Diffusion Transformers Without Fine-Tuning

0337222.9K

——0——

Original post

Image diffusion models like Flux natively output at 1k resolution, but what if we want to generate much higher resolution images (6k+)? SEGA modifies the RoPE encodings during the diffusion process to generate high-resolution images---no fine-tuning required!

8:52 AM · May 23, 2026