Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

Misc with no match

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

319

Full-text search

Active filters: rlhf

sileod/deberta-v3-large-tasksource-nli

Zero-Shot Classification • Updated Feb 17, 2024 • 683 • 34

sileod/deberta-v3-base-tasksource-nli

Zero-Shot Classification • Updated Aug 13, 2024 • 17.2k • 120

PKU-Alignment/beaver-7b-v1.0

Reinforcement Learning • Updated May 9, 2024 • 26 • 10

PKU-Alignment/beaver-7b-v1.0-cost

Reinforcement Learning • Updated Apr 20, 2024 • 462 • 9

nvidia/NV-Llama2-70B-RLHF-Chat

Text Generation • Updated Mar 9, 2024 • 4

joey00072/ToxicHermes-2.5-Mistral-7B

Text Generation • Updated Dec 16, 2023 • 45 • 18

argilla/distilabeled-OpenHermes-2.5-Mistral-7B

Text Generation • Updated Jan 17, 2024 • 38 • 30

argilla/distilabeled-Marcoro14-7B-slerp-full

Text Generation • Updated Mar 4, 2024 • 729 • 2

mlabonne/NeuralBeagle14-7B

Text Generation • Updated Mar 4, 2024 • 123 • 158

mlabonne/NeuralBeagle14-7B-GGUF

Updated Jan 28, 2024 • 189 • 46

mlx-community/NeuralBeagle14-7B-4bit-mlx

Updated Jan 17, 2024 • 12 • 4

TheBloke/NeuralBeagle14-7B-GGUF

Updated Jan 17, 2024 • 467 • 26

argilla/CapybaraHermes-2.5-Mistral-7B

Updated Mar 4, 2024 • 37 • 67

tasksource/deberta-small-long-nli

Zero-Shot Classification • Updated Aug 28, 2024 • 25k • 39

TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF

Updated Jan 31, 2024 • 11.7k • 102

TheBloke/CapybaraHermes-2.5-Mistral-7B-AWQ

Updated Jan 31, 2024 • 292 • 21

TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ

Updated Jan 31, 2024 • 2.55k • 56

mlabonne/AlphaMonarch-7B

Text Generation • Updated Mar 28, 2024 • 12.9k • 148

ContextualAI/Contextual_KTO_Mistral_PairRM

Text Generation • Updated Apr 26, 2024 • 217 • 31

mlabonne/OrpoLlama-3-8B

Text Generation • Updated Jun 15, 2024 • 75 • 54

mradermacher/CapybaraHermes-2.5-Mistral-7B-GGUF

Updated Nov 15, 2024 • 6 • 1

mradermacher/CapybaraHermes-2.5-Mistral-7B-i1-GGUF

Updated Nov 15, 2024 • 420 • 1

mradermacher/distilabeled-Marcoro14-7B-slerp-full-GGUF

Updated 19 days ago • 141 • 1

mradermacher/distilabeled-Marcoro14-7B-slerp-full-i1-GGUF

Updated 19 days ago • 221 • 1

stanfordnlp/SteamSHP-flan-t5-xl

Text2Text Generation • Updated Oct 10, 2023 • 61 • 43

stanfordnlp/SteamSHP-flan-t5-large

Text2Text Generation • Updated Oct 10, 2023 • 30 • 33

trl-lib/llama-7b-se-peft

Updated Apr 6, 2023 • 4

sileod/deberta-v3-large-tasksource-rlhf-reward-model

Text Classification • Updated Mar 28, 2023 • 61 • 11

trl-lib/llama-7b-se-rl-peft

Updated Apr 14, 2023 • 103

trl-lib/llama-7b-se-rm-peft

Updated Apr 6, 2023 • 8