Dynamic Scaling of Unit Tests for Code Reward Modeling Paper • 2501.01054 • Published 5 days ago • 15
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Paper • 2501.01257 • Published 4 days ago • 41
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper • 2412.18619 • Published 22 days ago • 49
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published 12 days ago • 86
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response Paper • 2412.14922 • Published 18 days ago • 84
Direct Preference Optimization Using Sparse Feature-Level Constraints Paper • 2411.07618 • Published Nov 12, 2024 • 15
m3hrdadfi/wav2vec2-base-100k-gtzan-music-genres Automatic Speech Recognition • Updated Jul 6, 2021 • 141 • 20