Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

8-bit precision

Mixture of Experts

Misc with no match

4-bit precision

text-embeddings-inference

Carbon Emissions

Models

327

Full-text search

Active filters: fp8

deepseek-ai/DeepSeek-V3

Updated 4 days ago • 45.5k • 958

deepseek-ai/DeepSeek-V3-Base

Updated 4 days ago • 6.64k • 1.09k

neuralmagic/Meta-Llama-3-70B-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 83.6k • 11

neuralmagic/Qwen2-72B-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 677 • 11

neuralmagic/Mixtral-8x22B-Instruct-v0.1-FP8

Text Generation • Updated Aug 12, 2024 • 344 • 2

nm-testing/gemma-2-27b-it-FP8

Text Generation • Updated Jun 27, 2024 • 488 • 3

neuralmagic/gemma-2-9b-it-FP8

Text Generation • Updated Jul 18, 2024 • 543 • 5

neuralmagic/Mistral-Nemo-Instruct-2407-FP8

Text Generation • Updated Jul 19, 2024 • 33.1k • 17

neuralmagic/DeepSeek-Coder-V2-Instruct-FP8

Text Generation • Updated Jul 22, 2024 • 3.08k • 7

neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8

Text Generation • Updated Oct 9, 2024 • 534k • 37

neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8-dynamic

Text Generation • Updated Oct 19, 2024 • 640 • 5

neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8

Text Generation • Updated Oct 9, 2024 • 66.7k • 37

neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8

Text Generation • Updated Oct 9, 2024 • 3.62k • 31

neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8-dynamic

Text Generation • Updated Oct 19, 2024 • 188 • 14

mgoin/Nemotron-4-340B-Instruct-hf-FP8

Text Generation • Updated Aug 8, 2024 • 135 • 3

neuralmagic/Meta-Llama-3.1-70B-FP8

Text Generation • Updated Oct 9, 2024 • 233 • 1

John6666/flux-dev8-anime-nsfw-fp8-flux

Text-to-Image • Updated Sep 1, 2024 • 438 • 2

amd/Mixtral-8x22B-Instruct-v0.1-FP8-KV

Updated 14 days ago • 2.05k • 1

John6666/lyh-anime-flux-v2a1-fp8-flux

Text-to-Image • Updated Sep 12, 2024 • 714 • 3

John6666/sapianf-nude-men-women-for-flux-v20fp16-fp8-flux

Text-to-Image • Updated Sep 12, 2024 • 1.51k • 3

neuralmagic/Llama-3.2-11B-Vision-Instruct-FP8-dynamic

Text Generation • Updated Oct 2, 2024 • 69.2k • 17

neuralmagic/Llama-3.2-3B-Instruct-FP8-dynamic

Text Generation • Updated Oct 9, 2024 • 1.03k • 2

neuralmagic/Llama-3.2-90B-Vision-Instruct-FP8-dynamic

Text Generation • Updated Oct 2, 2024 • 4.08k • 8

neuralmagic/Phi-3.5-mini-instruct-FP8-KV

Text Generation • Updated Oct 1, 2024 • 302 • 2

neuralmagic/pixtral-12b-FP8-dynamic

Text Generation • Updated Nov 1, 2024 • 2.96k • 7

neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic

Text Generation • Updated Oct 17, 2024 • 2.15k • 14

Infermatic/magnum-v4-72b-FP8-Dynamic

Text Generation • Updated Oct 21, 2024 • 1.2k • 1

mysticbeing/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-DYNAMIC

Text Generation • Updated Nov 6, 2024 • 141 • 4

tencent-community/Hunyuan-A52B-Instruct-FP8

Text Generation • Updated Nov 5, 2024 • 31 • 1

FriendliAI/Meta-Llama-3-8B-Instruct-fp8

Text Generation • Updated Nov 3, 2024 • 36 • 2