2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published 5 days ago • 82
view article Article **Fine-tune SmolLM's on custom synthetic data** By prithivMLmods • 1 day ago • 10
Mirrored mergekit-ready models Collection Mirrored models tweaked to be more friendly for mergekit. No pickles allowed. • 7 items • Updated 1 day ago • 1
view article Article 🐺🐦⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram • 4 days ago • 30
view article Article Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️ By Sri-Vigneshwar-DJ • 2 days ago • 2
Deepthink and Reasoning Collection Best for Deepthink and Reasoning • 12 items • Updated 2 days ago • 12
x1 Series Collection A family of x1 models which have a built-in reasoning abilities. • 1 item • Updated 3 days ago • 1
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • 3 days ago • 10
ANIMA: Biomimicry Models Collection Different chat versions of the ANIMA model • 5 items • Updated Mar 12, 2024 • 1
Nexus Internal Knowledge Map Collection Various versions of the same models that have been trained on the Internal Knowledge Map dataset using different methods and frameworks • 9 items • Updated Mar 31, 2024 • 1
sd-seer Collection Experimental models to convert natural language stable diffusion prompts to tags • 2 items • Updated Oct 20, 2024 • 1
bellman Collection The currently active versions of bellman, the Swedish finetune. This will be updated to only contain the recommended versions. • 4 items • Updated Oct 21, 2024 • 1
OdysseyXL Collection Odyssey Labs' collection of OdysseyXL, A diffusion model designed for hyperrealism. • 6 items • Updated 4 days ago • 2