-
-
-
-
-
-
Inference status
Active filters:
fp8
neuralmagic/Llama-3.2-1B-Instruct-FP8-dynamic
Text Generation
•
Updated
•
2.16k
•
2
SicariusSicariiStuff/Dusk_Rainbow_FP8
Updated
•
10
soprasteria/Mixtral-8x7B-Instruct-v0.1-FP8
Updated
•
134
CalamitousFelicitousness/SorcererLM-8x22b-FP8-Dynamic
obamaTeo/llama-finetune-8bit-wiki-284-ver2
Updated
fxmarty/quark-legacy-fp8
Updated
•
250
amd/jais-13b-chat-FP8
predibase/Qwen2.5-14B-FP8
CalamitousFelicitousness/banana-2-b-72b-FP8-Dynamic
taozi555/Llama-Guard-3-8B-FP8
ajinkya-tejankar/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV
Updated
Infermatic/Lumimaid-v0.2-70B-FP8-Dynamic
Updated
•
27
predibase/Qwen2.5-32B-Instruct-FP8
Updated
•
253
Infermatic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-Dynamic
Text Generation
•
Updated
•
248
predibase/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV
Updated
Infermatic/Stellar-Odyssey-12b-v0.0-FP8-Dynamic
Updated
Infermatic/Chronos-Platinum-72B-FP8-Dynamic
Infermatic/Nautilus-70B-v0.1-FP8-Dynamic
Updated
yejingfu/nmagic-Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
Updated
•
8
mysticbeing/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-DYNAMIC
Text Generation
•
Updated
•
120
•
3
yejingfu/nmagic-Meta-Llama-3.1-70B-Instruct-FP8
Text Generation
•
Updated
•
7
Dev0502/Qwen2.5-14B-Instruct-abliterated-v2-FP8
andecy64/Nxcode-CQ-7B-orpo-FP8
SicariusSicariiStuff/DeepSeek-Coder-V2-Instruct-FP8
EmbeddedLLM/Qwen2.5-72B-Instruct-OCP-FP8-Quark
yejingfu/nmagic-Meta-Llama-3-70B-Instruct-FP8
Updated
•
28
EmbeddedLLM/Nexusflow_Athena-V2-Chat-OCP-FP8-Quark
EmbeddedLLM/Nexusflow_Athena-V2-Agent-OCP-FP8-Quark
liuxl12/Qwen2.5-32B-Instruct-FP8
Model-SafeTensors/Meta-Llama-3-8B-Instruct-FP8
Updated
•
3.86k