-
-
-
-
-
-
Inference status
Active filters:
GPTQ
DanielAWrightGabrielAI/pygmalion-7b-4bit-128g-cuda-2048Token
Text Generation
•
Updated
•
51
•
15
rirv938/GPTQ-LLaMa-65B-4bit-triton
Text Generation
•
Updated
•
8
mlabonne/gpt2-GPTQ-4bit
Text Generation
•
Updated
•
20
CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA
Text Generation
•
Updated
•
10
daedalus314/Griffin-3B-GPTQ
Text Generation
•
Updated
•
16
Sanrove/gpt2-GPTQ-4b
Text Generation
•
Updated
•
14
daedalus314/Marx-3B-V2-GPTQ
Text Generation
•
Updated
•
10
TKDKid1000/pythia-2.8b-deduped-GPTQ
Text Generation
•
Updated
•
9
Trelis/Yi-34B-200K-Llamafied-chat-SFT-function-calling-v2-GPTQ
Text Generation
•
Updated
Inferless/deciLM-7B-GPTQ
Text Generation
•
Updated
•
11
•
1
Inferless/SOLAR-10.7B-Instruct-v1.0-GPTQ
Text Generation
•
Updated
•
12
•
2
Inferless/Mixtral-8x7B-v0.1-int8-GPTQ
Text Generation
•
Updated
•
12
•
2
Masterjp123/SnowyRP-FinalV1-L2-13B-GPTQ
Text Generation
•
Updated
•
16
•
3
bigquant/Senku-70B-GPTQ-4bit
Text Generation
•
Updated
•
8
•
1
twhoool02/Llama-2-7b-hf-AutoGPTQ
Text Generation
•
Updated
•
24
Dmitriy007/rugpt2_gen_news-gptq-4bit
Text Generation
•
Updated
•
13
SwastikM/Llama-2-7B-Chat-text2code
Text Generation
•
Updated
•
21
•
4
adriabama06/Llama-3.2-1B-Instruct-GPTQ-8bit-128g
Text Generation
•
Updated
•
594
•
1
NightForger/saiga_nemo_12b-GPTQ
Text Generation
•
Updated
•
141
NaomiBTW/L3-8B-Lunaris-v1-GPTQ
Text Generation
•
Updated
GusPuffy/Llama-3.1-70B-ArliAI-RPMax-v1.3-GPTQ