Marc Sun's picture

Marc Sun

marcsun13

AI & ML interests

LLM, Quantization, Training, Inference

Recent Activity

Articles

Organizations

Hugging Face's profile picture Hugging Face Internal Testing Organization's profile picture HuggingFaceM4's profile picture Hugging Face OSS Metrics's profile picture accelerate's profile picture Hugging Face TB Research's profile picture Quanto library's profile picture LocalLLaMA's profile picture MLX Community's profile picture Hugging Face 1Bit LLMs's profile picture Paris AI Running Club's profile picture LLHF's profile picture SLLHF's profile picture Hugging Quants's profile picture Hugging Face Party @ PyTorch Conference's profile picture qrias's profile picture DDUF's profile picture

marcsun13's activity

upvoted an article 3 months ago
view article
Article

Fixing Gradient Accumulation

46
upvoted 3 articles 4 months ago
view article
Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

215
view article
Article

SmolLM - blazingly fast and remarkably powerful

296
upvoted an article 5 months ago
view article
Article

XetHub is joining Hugging Face!

81
upvoted an article 7 months ago
view article
Article

Benchmarking Text Generation Inference

29
upvoted an article 7 months ago
view article
Article

License to Call: Introducing Transformers Agents 2.0

123
upvoted an article 9 months ago
view article
Article

Welcome Llama 3 - Meta's new open LLM

281
upvoted 8 articles 9 months ago
view article
Article

Vision Language Models Explained

239
view article
Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

171
view article
Article

Overview of natively supported quantization schemes in 🤗 Transformers

11
view article
Article

Making LLMs lighter with AutoGPTQ and transformers

37
view article
Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

67
view article
Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

105
view article
Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

70
view article
Article

quanto: a pytorch quantization toolkit

33