Edit Models filters

Inference status

Misc

8-bit precision

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Mixture of Experts

text-embeddings-inference

Carbon Emissions

Models

9,794

Full-text search

Active filters: 8-bit

neuralmagic/Meta-Llama-3.1-8B-quantized.w8a8

Text Generation • Updated Oct 23, 2024 • 358 • 2

anokimchen/sd-turbo-openvino-8bit-no-calibration

Text-to-Image • Updated Aug 7, 2024 • 1

anokimchen/sd-turbo-openvino-8bit-GPT4vision-calibrated

Text-to-Image • Updated Aug 7, 2024 • 1

shuyuej/Mistral-Nemo-Instruct-2407-GPTQ-INT8

Updated Aug 7, 2024 • 59 • 1

alpindale/Meta-Llama-3.1-70B-Instruct-GPTQ-INT8

Updated Aug 13, 2024 • 40 • 2

FuturisticVibes/Rocinante-12B-v1.1-8.0bpw-h8-exl2

Updated Aug 23, 2024 • 10 • 1

Statuo/Celeste-v1.9-8bpw-EXL2

Text Generation • Updated Aug 17, 2024 • 21 • 1

MaziyarPanahi/SmolLM-1.7B-Instruct-v0.2-GGUF

Text Generation • Updated Aug 18, 2024 • 249 • 7

MaziyarPanahi/Phi-3.5-mini-instruct-GGUF

Text Generation • Updated Aug 20, 2024 • 2.34M • 6

neuralmagic/SmolLM-1.7B-Instruct-quantized.w8a8

Text Generation • Updated Oct 9, 2024 • 20 • 1

KhanhVan/Vistral-7B-Chat-gguf1

Text Generation • Updated Aug 24, 2024 • 51 • 2

Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8

Image-Text-to-Text • Updated Sep 21, 2024 • 6.97k • 20

Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8

Image-Text-to-Text • Updated Sep 21, 2024 • 3.55k • 12

FuturisticVibes/ArliAI-RPMax-12B-v1.1-8.0bpw-h8-exl2

Updated Sep 1, 2024 • 4 • 2

jadechoghari/aya-23-8B-quantized

Text Generation • Updated Sep 1, 2024 • 71 • 3

MaziyarPanahi/Yi-Coder-9B-Chat-GGUF

Text Generation • Updated Sep 4, 2024 • 2.34M • 2

MaziyarPanahi/DeepSeek-V2.5-GGUF

Text Generation • Updated Sep 11, 2024 • 32.2k • 4

HF1BitLLM/Llama3-8B-1.58-100B-tokens

Text Generation • Updated Sep 19, 2024 • 2.32k • 166

MaziyarPanahi/solar-pro-preview-instruct-GGUF

Text Generation • Updated Sep 13, 2024 • 2.33M • 22

Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int8

Image-Text-to-Text • Updated Sep 24, 2024 • 4.81k • 7

Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 9, 2024 • 1.31k • 7

Qwen/Qwen2.5-1.5B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 9, 2024 • 2.15k • 2

Qwen/Qwen2.5-7B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 18, 2024 • 9.24k • 10

Qwen/Qwen2.5-14B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 9, 2024 • 8.01k • 12

Qwen/Qwen2.5-32B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 9, 2024 • 3.21k • 8

Qwen/Qwen2.5-72B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 9, 2024 • 7.16k • 15

LoneStriker/Mistral-Small-Instruct-2409-8.0bpw-h8-exl2

Updated Sep 17, 2024 • 30 • 5

DewEfresh/pixtral-12b-8bit

Image-Text-to-Text • Updated Sep 18, 2024 • 69 • 12

MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF

Text Generation • Updated Sep 18, 2024 • 2.33M • 2

MaziyarPanahi/Qwen2.5-7B-Instruct-GGUF

Text Generation • Updated Sep 18, 2024 • 2.34M • 8