Edit Models filters

Inference status

Misc

AutoTrain Compatible

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

Carbon Emissions

Misc with no match

text-embeddings-inference

Mixture of Experts

Models

21

Full-text search

Active filters: GPTQ

DanielAWrightGabrielAI/pygmalion-7b-4bit-128g-cuda-2048Token

Text Generation • Updated May 18, 2023 • 51 • 15

rirv938/GPTQ-LLaMa-65B-4bit-triton

Text Generation • Updated May 25, 2023 • 8

mlabonne/gpt2-GPTQ-4bit

Text Generation • Updated Jul 8, 2023 • 20

CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA

Text Generation • Updated Jul 20, 2023 • 10

daedalus314/Griffin-3B-GPTQ

Text Generation • Updated Sep 8, 2023 • 16

Sanrove/gpt2-GPTQ-4b

Text Generation • Updated Sep 24, 2023 • 14

daedalus314/Marx-3B-V2-GPTQ

Text Generation • Updated Oct 12, 2023 • 10

TKDKid1000/pythia-2.8b-deduped-GPTQ

Text Generation • Updated Oct 25, 2023 • 9

Trelis/Yi-34B-200K-Llamafied-chat-SFT-function-calling-v2-GPTQ

Text Generation • Updated Nov 20, 2023

Inferless/deciLM-7B-GPTQ

Text Generation • Updated Jan 4, 2024 • 11 • 1

Inferless/SOLAR-10.7B-Instruct-v1.0-GPTQ

Text Generation • Updated Jan 4, 2024 • 12 • 2

Inferless/Mixtral-8x7B-v0.1-int8-GPTQ

Text Generation • Updated Jan 25, 2024 • 12 • 2

Masterjp123/SnowyRP-FinalV1-L2-13B-GPTQ

Text Generation • Updated Apr 4, 2024 • 16 • 3

bigquant/Senku-70B-GPTQ-4bit

Text Generation • Updated Feb 26, 2024 • 8 • 1

twhoool02/Llama-2-7b-hf-AutoGPTQ

Text Generation • Updated Apr 3, 2024 • 24

Dmitriy007/rugpt2_gen_news-gptq-4bit

Text Generation • Updated Feb 28, 2024 • 13

SwastikM/Llama-2-7B-Chat-text2code

Text Generation • Updated May 19, 2024 • 21 • 4

adriabama06/Llama-3.2-1B-Instruct-GPTQ-8bit-128g

Text Generation • Updated 7 days ago • 594 • 1

NightForger/saiga_nemo_12b-GPTQ

Text Generation • Updated Nov 4, 2024 • 141

NaomiBTW/L3-8B-Lunaris-v1-GPTQ

Text Generation • Updated Nov 11, 2024

GusPuffy/Llama-3.1-70B-ArliAI-RPMax-v1.3-GPTQ

Updated Dec 3, 2024 • 19