-
-
-
-
-
-
Inference status
Active filters:
fp8
deepseek-ai/DeepSeek-V3
Updated
•
45.5k
•
958
deepseek-ai/DeepSeek-V3-Base
Updated
•
6.64k
•
1.09k
neuralmagic/Meta-Llama-3-70B-Instruct-FP8
Text Generation
•
Updated
•
83.6k
•
11
neuralmagic/Qwen2-72B-Instruct-FP8
Text Generation
•
Updated
•
677
•
11
neuralmagic/Mixtral-8x22B-Instruct-v0.1-FP8
Text Generation
•
Updated
•
344
•
2
nm-testing/gemma-2-27b-it-FP8
Text Generation
•
Updated
•
488
•
3
neuralmagic/gemma-2-9b-it-FP8
Text Generation
•
Updated
•
543
•
5
neuralmagic/Mistral-Nemo-Instruct-2407-FP8
Text Generation
•
Updated
•
33.1k
•
17
neuralmagic/DeepSeek-Coder-V2-Instruct-FP8
Text Generation
•
Updated
•
3.08k
•
7
neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
Updated
•
534k
•
37
neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8-dynamic
Text Generation
•
Updated
•
640
•
5
neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8
Text Generation
•
Updated
•
66.7k
•
37
neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8
Text Generation
•
Updated
•
3.62k
•
31
neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8-dynamic
Text Generation
•
Updated
•
188
•
14
mgoin/Nemotron-4-340B-Instruct-hf-FP8
Text Generation
•
Updated
•
135
•
3
neuralmagic/Meta-Llama-3.1-70B-FP8
Text Generation
•
Updated
•
233
•
1
John6666/flux-dev8-anime-nsfw-fp8-flux
Text-to-Image
•
Updated
•
438
•
2
amd/Mixtral-8x22B-Instruct-v0.1-FP8-KV
Updated
•
2.05k
•
1
John6666/lyh-anime-flux-v2a1-fp8-flux
Text-to-Image
•
Updated
•
714
•
3
John6666/sapianf-nude-men-women-for-flux-v20fp16-fp8-flux
Text-to-Image
•
Updated
•
1.51k
•
3
neuralmagic/Llama-3.2-11B-Vision-Instruct-FP8-dynamic
Text Generation
•
Updated
•
69.2k
•
17
neuralmagic/Llama-3.2-3B-Instruct-FP8-dynamic
Text Generation
•
Updated
•
1.03k
•
2
neuralmagic/Llama-3.2-90B-Vision-Instruct-FP8-dynamic
Text Generation
•
Updated
•
4.08k
•
8
neuralmagic/Phi-3.5-mini-instruct-FP8-KV
Text Generation
•
Updated
•
302
•
2
neuralmagic/pixtral-12b-FP8-dynamic
Text Generation
•
Updated
•
2.96k
•
7
neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
Text Generation
•
Updated
•
2.15k
•
14
Infermatic/magnum-v4-72b-FP8-Dynamic
Text Generation
•
Updated
•
1.2k
•
1
mysticbeing/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-DYNAMIC
Text Generation
•
Updated
•
141
•
4
tencent-community/Hunyuan-A52B-Instruct-FP8
Text Generation
•
Updated
•
31
•
1
FriendliAI/Meta-Llama-3-8B-Instruct-fp8
Text Generation
•
Updated
•
36
•
2