Virgo: A Preliminary Exploration on Reproducing o1-like MLLM Paper • 2501.01904 • Published 3 days ago • 12
Paper - Multimodal Collection Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 95 items • Updated about 12 hours ago • 1
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction Paper • 2501.01957 • Published 3 days ago • 19
view article Article **Fine-tune SmolLM's on custom synthetic data** By prithivMLmods • 1 day ago • 10
view post Post 1581 Reasoning SmolLM2 🚀🎯Fine-tuning SmolLM2 on a lightweight synthetic reasoning dataset for reasoning-specific tasks. Future updates will focus on lightweight, blazing-fast reasoning models. Until then, check out the blog for fine-tuning details.🔥Blog : https://huggingface.co/blog/prithivMLmods/smollm2-ft🔼 Models :+ SmolLM2-CoT-360M : prithivMLmods/SmolLM2-CoT-360M+ Reasoning-SmolLM2-135M : prithivMLmods/Reasoning-SmolLM2-135M+ SmolLM2-CoT-360M-GGUF : prithivMLmods/SmolLM2-CoT-360M-GGUF🤠Other Details :+ Demo : prithivMLmods/SmolLM2-CoT-360M+ Fine-tune nB : prithivMLmods/SmolLM2-CoT-360M See translation 🚀 11 11 🔥 6 6 + Reply
Paper - Multimodal Collection Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 95 items • Updated about 12 hours ago • 1
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models Paper • 2306.07691 • Published Jun 13, 2023 • 5
Scaling Test-Time Compute with Open Models Collection Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated about 18 hours ago • 19