-
-
-
-
-
-
Inference status
Active filters:
8-bit
neuralmagic/Meta-Llama-3.1-8B-quantized.w8a8
Text Generation
•
Updated
•
358
•
2
anokimchen/sd-turbo-openvino-8bit-no-calibration
Text-to-Image
•
Updated
•
1
anokimchen/sd-turbo-openvino-8bit-GPT4vision-calibrated
Text-to-Image
•
Updated
•
1
shuyuej/Mistral-Nemo-Instruct-2407-GPTQ-INT8
Updated
•
59
•
1
alpindale/Meta-Llama-3.1-70B-Instruct-GPTQ-INT8
Updated
•
40
•
2
FuturisticVibes/Rocinante-12B-v1.1-8.0bpw-h8-exl2
Updated
•
10
•
1
Statuo/Celeste-v1.9-8bpw-EXL2
Text Generation
•
Updated
•
21
•
1
MaziyarPanahi/SmolLM-1.7B-Instruct-v0.2-GGUF
Text Generation
•
Updated
•
249
•
7
MaziyarPanahi/Phi-3.5-mini-instruct-GGUF
Text Generation
•
Updated
•
2.34M
•
6
neuralmagic/SmolLM-1.7B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
20
•
1
KhanhVan/Vistral-7B-Chat-gguf1
Text Generation
•
Updated
•
51
•
2
Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
6.97k
•
20
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
3.55k
•
12
FuturisticVibes/ArliAI-RPMax-12B-v1.1-8.0bpw-h8-exl2
Updated
•
4
•
2
jadechoghari/aya-23-8B-quantized
Text Generation
•
Updated
•
71
•
3
MaziyarPanahi/Yi-Coder-9B-Chat-GGUF
Text Generation
•
Updated
•
2.34M
•
2
MaziyarPanahi/DeepSeek-V2.5-GGUF
Text Generation
•
Updated
•
32.2k
•
4
HF1BitLLM/Llama3-8B-1.58-100B-tokens
Text Generation
•
Updated
•
2.32k
•
166
MaziyarPanahi/solar-pro-preview-instruct-GGUF
Text Generation
•
Updated
•
2.33M
•
22
Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
4.81k
•
7
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
1.31k
•
7
Qwen/Qwen2.5-1.5B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
2.15k
•
2
Qwen/Qwen2.5-7B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
9.24k
•
10
Qwen/Qwen2.5-14B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
8.01k
•
12
Qwen/Qwen2.5-32B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
3.21k
•
8
Qwen/Qwen2.5-72B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
7.16k
•
15
LoneStriker/Mistral-Small-Instruct-2409-8.0bpw-h8-exl2
Updated
•
30
•
5
DewEfresh/pixtral-12b-8bit
Image-Text-to-Text
•
Updated
•
69
•
12
MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF
Text Generation
•
Updated
•
2.33M
•
2
MaziyarPanahi/Qwen2.5-7B-Instruct-GGUF
Text Generation
•
Updated
•
2.34M
•
8