This repository curates research on Diffusion and Flow-based models, offering a structured overview of key concepts, advancements, and applications in generative AI. It covers diverse topics from inference acceleration and model optimization to image editing and personalized generation. Contributions are welcome to keep this resource comprehensive and up-to-date.
- Inference Acceleration
- Quantization
- Optimization
- Model Distillation
- Latent Domain Applications
- Flow Matching Concepts
- Diffusion Concepts
- Image Editing
- Personalized - ID Preserving Image Generation
Training-free Diffusion Acceleration with Bottleneck Sampling
[Paper]
[Project Page]
NAMI: Efficient Image Generation via Progressive Rectified Flow Transformers
[Paper]
SVDQUANT: ABSORBING OUTLIERS BY LOW-RANK COMPONENTS FOR 4-BIT DIFFUSION MODELS
[Paper]
From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers
[Paper]
[Project Page]
SAGEATTENTION: ACCURATE 8-BIT ATTENTION FOR PLUG-AND-PLAY INFERENCE ACCELERATION
[Paper]
[Project Page]
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization
[Paper]
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
[Paper]
[Project Page]
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
[Paper]
FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision
[Paper]
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
[Paper]
[Project Page]
FLOW MATCHING FOR GENERATIVE MODELING
[Paper]
FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing
[Paper]
[Project Page]
FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models
[Paper]
[Project Page]
HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer
[Paper]
[Project Page]
Qwen-Image
[Paper]
[Project Page]
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
[Paper]
[Project Page]
LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers
[Paper]
[Project Page]
FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers
[Paper]