VidTwin: Video VAE with Decoupled Structure and Dynamics Paper • 2412.17726 • Published 15 days ago • 8
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints Paper • 2412.07760 • Published 28 days ago • 50
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation Paper • 2406.09961 • Published Jun 14, 2024 • 55
Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training Paper • 2403.00758 • Published Mar 1, 2024 • 2
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers Paper • 2309.08532 • Published Sep 15, 2023 • 53