Muhtasham Oblokulov's picture

Muhtasham Oblokulov PRO

muhtasham

·

https://www.linkedin.com/in/muhtasham/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago

Maya: An Instruction Finetuned Multilingual Multimodal Model

liked a Space about 5 hours ago

kkr5155/maya_demo

upvoted a collection about 19 hours ago

View all activity

Organizations

muhtasham's activity

upvoted a paper about 5 hours ago

Maya: An Instruction Finetuned Multilingual Multimodal Model

Paper • 2412.07112 • Published 28 days ago • 26

upvoted a collection about 19 hours ago

marc

5 items • Updated Nov 11, 2024 • 2

upvoted a collection 4 days ago

Reasoning Datasets

Reasoning datasets that are trending 🔥 • 10 items • Updated 3 days ago • 17

upvoted a collection 17 days ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 18 days ago • 75

upvoted a collection 18 days ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 18 days ago • 116

upvoted an article 28 days ago

Article

Finding Moroccan Arabic (Darija) in Fineweb 2

By

•

29 days ago

• 20

upvoted a collection 29 days ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 15 days ago • 208

upvoted 2 collections about 1 month ago

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated about 1 hour ago • 69

Hymba

A series of Hybrid Small Language Models. • 2 items • Updated 1 day ago • 25

upvoted 2 collections 2 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 15 days ago • 197

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 101

upvoted a paper 2 months ago

MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding

Paper • 2408.11049 • Published Aug 20, 2024 • 12

upvoted an article 3 months ago

Article

How to build a custom text classifier without days of human labeling

By

•

Oct 17, 2024

• 55

upvoted 3 collections 5 months ago

⛈️ Llama-3.1 Storm Models

Fine-tuned Llama 3.1 8B model with superior reasoning, conversation abilities, and function calling! • 3 items • Updated Aug 25, 2024 • 15

Tower

Model weights and SFT data for Tower. • 11 items • Updated Nov 15, 2024 • 26

Code Evaluation

Collection of Papers on Code Evaluation (from code generation language models) • 45 items • Updated Oct 29, 2024 • 15

upvoted an article 5 months ago

Article

Mixture of Depth is Vibe

By

•

Apr 22, 2024

• 44

upvoted a collection 5 months ago

Llama-3.1 Quantization

Neural Magic quantized Llama-3.1 models • 22 items • Updated Nov 22, 2024 • 42

upvoted an article 5 months ago

Article

Serverless Inference with Hugging Face and NVIDIA NIMs

Jul 29, 2024

• 27

upvoted a collection 6 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 638