-
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Paper • 2501.00958 • Published • 87 -
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
Paper • 2501.01257 • Published • 42 -
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Paper • 2501.01423 • Published • 33 -
REDUCIO! Generating 1024times1024 Video within 16 Seconds using Extremely Compressed Motion Latents
Paper • 2411.13552 • Published
Raffaele Salvi
Rufy992
·
AI & ML interests
Interest for research
Recent Activity
updated
a collection
about 12 hours ago
Articoli PHD
updated
a collection
about 14 hours ago
Articoli PHD
updated
a collection
about 14 hours ago
Articoli PHD
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet