-
-
-
-
-
-
Inference status
Active filters:
vllm
mistralai/Pixtral-12B-2409
Image-Text-to-Text
•
Updated
•
566
mistralai/Mistral-Large-Instruct-2411
Updated
•
3.78M
•
183
mistralai/Ministral-8B-Instruct-2410
Updated
•
3.8M
•
382
mistralai/Mistral-Small-Instruct-2409
Updated
•
3.76M
•
366
QuantFactory/Ministral-8B-Instruct-2410-GGUF
Updated
•
611
•
2
mistralai/Pixtral-Large-Instruct-2411
Image-Text-to-Text
•
Updated
•
378
neuralmagic/Meta-Llama-3-70B-Instruct-FP8
Text Generation
•
Updated
•
66.8k
•
11
neuralmagic/Qwen2-72B-Instruct-FP8
Text Generation
•
Updated
•
634
•
11
neuralmagic/Mixtral-8x22B-Instruct-v0.1-FP8
Text Generation
•
Updated
•
372
•
2
neuralmagic/gemma-2-9b-it-FP8
Text Generation
•
Updated
•
522
•
5
neuralmagic/Mistral-Nemo-Instruct-2407-FP8
Text Generation
•
Updated
•
32.6k
•
17
neuralmagic/DeepSeek-Coder-V2-Instruct-FP8
Text Generation
•
Updated
•
1.63k
•
7
neuralmagic/Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
•
Updated
•
491k
•
37
neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8-dynamic
Text Generation
•
Updated
•
689
•
5
neuralmagic/Meta-Llama-3.1-70B-Instruct-FP8
Text Generation
•
Updated
•
60.5k
•
37
neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8
Text Generation
•
Updated
•
3.25k
•
31
neuralmagic/Meta-Llama-3.1-405B-Instruct-FP8-dynamic
Text Generation
•
Updated
•
185
•
14
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a16
Text Generation
•
Updated
•
8.15k
•
9
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
5.94k
•
13
mistralai/Mistral-Large-Instruct-2407
Updated
•
3.76M
•
814
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a16
Text Generation
•
Updated
•
255
•
4
mgoin/Nemotron-4-340B-Instruct-hf
Text Generation
•
Updated
•
56
•
4
mgoin/Nemotron-4-340B-Instruct-hf-FP8
Text Generation
•
Updated
•
150
•
3
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
5.79k
•
18
neuralmagic/Meta-Llama-3.1-70B-FP8
Text Generation
•
Updated
•
259
•
1
neuralmagic/Meta-Llama-3.1-8B-quantized.w8a8
Text Generation
•
Updated
•
358
•
2
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w4a16
Text Generation
•
Updated
•
112k
•
28
neuralmagic/Meta-Llama-3.1-405B-Instruct-quantized.w4a16
Text Generation
•
Updated
•
2.36k
•
12
neuralmagic/SmolLM-1.7B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
20
•
1
mistral-community/pixtral-12b-240910
Image-Text-to-Text
•
Updated
•
383