Diffusion Priors for Dynamic View Synthesis from Monocular Videos Paper • 2401.05583 • Published Jan 10, 2024 • 9
TRIPS: Trilinear Point Splatting for Real-Time Radiance Field Rendering Paper • 2401.06003 • Published Jan 11, 2024 • 23
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models Paper • 2401.06066 • Published Jan 11, 2024 • 45
PALP: Prompt Aligned Personalization of Text-to-Image Models Paper • 2401.06105 • Published Jan 11, 2024 • 48
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? Paper • 2501.05510 • Published 9 days ago • 35
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper • 2501.05874 • Published 9 days ago • 61
Diffusion Adversarial Post-Training for One-Step Video Generation Paper • 2501.08316 • Published 4 days ago • 29
PokerBench: Training Large Language Models to become Professional Poker Players Paper • 2501.08328 • Published 4 days ago • 13
Potential and Perils of Large Language Models as Judges of Unstructured Textual Data Paper • 2501.08167 • Published 4 days ago • 6
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models Paper • 2501.06751 • Published 7 days ago • 31
MangaNinja: Line Art Colorization with Precise Reference Following Paper • 2501.08332 • Published 4 days ago • 50
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 4 days ago • 258