Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published 8 days ago • 27
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 1 day ago • 53
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search Paper • 2412.18319 • Published 15 days ago • 34
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 18 days ago • 38
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8 Text Generation • Updated Oct 10, 2024 • 6.18k • 18
HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1 Sentence Similarity • Updated 5 days ago • 20.8k • 24