Collections
Discover the best community collections!
Collections including paper arxiv:2403.03206
-
SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher
Paper • 2408.14176 • Published • 61 -
Diffusion Models Are Real-Time Game Engines
Paper • 2408.14837 • Published • 121 -
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Paper • 2408.11039 • Published • 58 -
OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model
Paper • 2409.01199 • Published • 14
-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 28 -
black-forest-labs/FLUX.1-dev
Text-to-Image • Updated • 1.17M • • 7.8k -
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text • Updated • 1.56M • • 1.02k -
zer0int/CLIP-GmP-ViT-L-14
Zero-Shot Image Classification • Updated • 4.6k • 364
-
Rich feature hierarchies for accurate object detection and semantic segmentation
Paper • 1311.2524 • Published • 1 -
DeepPose: Human Pose Estimation via Deep Neural Networks
Paper • 1312.4659 • Published • 1 -
Generative Adversarial Networks
Paper • 1406.2661 • Published • 2 -
scikit-image: Image processing in Python
Paper • 1407.6245 • Published • 1
-
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Paper • 2403.03206 • Published • 60 -
Denoising Diffusion Probabilistic Models
Paper • 2006.11239 • Published • 3 -
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Paper • 2307.01952 • Published • 84