SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding Paper • 2412.09604 • Published 26 days ago • 35
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding Paper • 2412.09604 • Published 26 days ago • 35
CaptionEmporium/flickr-megalith-10m-internvl2-multi-caption Viewer • Updated Aug 28, 2024 • 8.51M • 370 • 9
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer Paper • 2401.10208 • Published Jan 18, 2024 • 1
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process Paper • 2306.05423 • Published Jun 8, 2023