IDOL: Instant Photorealistic 3D Human Creation from a Single Image Paper • 2412.14963 • Published 18 days ago • 6
Large Motion Video Autoencoding with Cross-modal Video VAE Paper • 2412.17805 • Published 14 days ago • 24
WavePulse: Real-time Content Analytics of Radio Livestreams Paper • 2412.17998 • Published 14 days ago • 10
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization Paper • 2412.21037 • Published 7 days ago • 22
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration Paper • 2501.01320 • Published 4 days ago • 9
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Paper • 2501.01427 • Published 4 days ago • 42
LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity Paper • 2412.09856 • Published 25 days ago • 9
Motion Control for Enhanced Complex Action Video Generation Paper • 2411.08328 • Published Nov 13, 2024 • 5
Number it: Temporal Grounding Videos like Flipping Manga Paper • 2411.10332 • Published Nov 15, 2024 • 13
TEXGen: a Generative Diffusion Model for Mesh Textures Paper • 2411.14740 • Published Nov 22, 2024 • 15
SketchAgent: Language-Driven Sequential Sketch Generation Paper • 2411.17673 • Published Nov 26, 2024 • 18
Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters Paper • 2411.18197 • Published Nov 27, 2024 • 14
3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes Paper • 2411.14974 • Published Nov 22, 2024 • 17
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Paper • 2411.18613 • Published Nov 27, 2024 • 50
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers Paper • 2411.18673 • Published Nov 27, 2024 • 8
Mimir: Improving Video Diffusion Models for Precise Text Understanding Paper • 2412.03085 • Published Dec 4, 2024 • 12
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation Paper • 2412.04448 • Published Dec 5, 2024 • 9
Structured 3D Latents for Scalable and Versatile 3D Generation Paper • 2412.01506 • Published Dec 2, 2024 • 50
MV-Adapter: Multi-view Consistent Image Generation Made Easy Paper • 2412.03632 • Published Dec 4, 2024 • 23