Anthonny Olime's picture

Anthonny Olime

Aviv-anthonnyolime

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

updated a collection about 12 hours ago

Paper - Multimodal

upvoted a paper about 12 hours ago

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

View all activity

Organizations

Aviv-anthonnyolime's activity

upvoted a paper about 12 hours ago

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Paper • 2501.01904 • Published 3 days ago • 12

updated a collection about 12 hours ago

Paper - Multimodal

Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 95 items • Updated about 12 hours ago • 1

upvoted a paper about 12 hours ago

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published 3 days ago • 19

liked 2 models about 13 hours ago

declare-lab/TangoFlux

Text-to-Audio • Updated 4 days ago • 922 • 42

prithivMLmods/Reasoning-SmolLM2-135M

Text Generation • Updated about 17 hours ago • 5

upvoted an article about 13 hours ago

Article

Fine-tune SmolLM's on custom synthetic data

By

•

1 day ago

• 10

reacted to prithivMLmods's post with 🚀 about 13 hours ago

Post

1581

Reasoning SmolLM2 🚀

🎯Fine-tuning SmolLM2 on a lightweight synthetic reasoning dataset for reasoning-specific tasks. Future updates will focus on lightweight, blazing-fast reasoning models. Until then, check out the blog for fine-tuning details.

🔥Blog : https://huggingface.co/blog/prithivMLmods/smollm2-ft

🔼 Models :
+ SmolLM2-CoT-360M : prithivMLmods/SmolLM2-CoT-360M
+ Reasoning-SmolLM2-135M : prithivMLmods/Reasoning-SmolLM2-135M
+ SmolLM2-CoT-360M-GGUF : prithivMLmods/SmolLM2-CoT-360M-GGUF

🤠 Other Details :
+ Demo : prithivMLmods/SmolLM2-CoT-360M
+ Fine-tune nB : prithivMLmods/SmolLM2-CoT-360M

updated a collection 3 days ago

Dataset

9 items • Updated 3 days ago

liked a dataset 3 days ago

DAMO-NLP-SG/multimodal_textbook

Updated 1 day ago • 487 • 20

updated 2 collections 4 days ago

Audio model

5 items • Updated 4 days ago

Paper - Multimodal

Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding • 95 items • Updated about 12 hours ago • 1

upvoted a paper 4 days ago

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Paper • 2306.07691 • Published Jun 13, 2023 • 5

liked 2 models 4 days ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated about 2 hours ago • 986 • 242

facebook/SONAR

Updated Feb 14, 2024 • 40

upvoted a collection 5 days ago

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated about 18 hours ago • 19

liked a dataset 6 days ago

laion/LAION-Audio-300M

Viewer • Updated about 2 hours ago • 156M • 306 • 12

liked 3 models 12 days ago

deepseek-ai/DeepSeek-V3

Updated 8 days ago • 71.7k • 1.32k

deepseek-ai/DeepSeek-V3-Base

Updated 8 days ago • 8.36k • 1.16k

ibm-granite/granite-3.1-8b-base

Text Generation • Updated 18 days ago • 5.51k • 14