ProgCo: Program Helps Self-Correction of Large Language Models Paper • 2501.01264 • Published 4 days ago • 23
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Paper • 2501.01257 • Published 4 days ago • 41
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published 5 days ago • 82
On the Compositional Generalization of Multimodal LLMs for Medical Imaging Paper • 2412.20070 • Published 10 days ago • 40
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published 12 days ago • 86