-
-
-
-
-
-
Inference status
Active filters:
rlhf
sileod/deberta-v3-large-tasksource-nli
Zero-Shot Classification
•
Updated
•
683
•
34
sileod/deberta-v3-base-tasksource-nli
Zero-Shot Classification
•
Updated
•
17.2k
•
120
PKU-Alignment/beaver-7b-v1.0
Reinforcement Learning
•
Updated
•
26
•
10
PKU-Alignment/beaver-7b-v1.0-cost
Reinforcement Learning
•
Updated
•
462
•
9
nvidia/NV-Llama2-70B-RLHF-Chat
Text Generation
•
Updated
•
4
joey00072/ToxicHermes-2.5-Mistral-7B
Text Generation
•
Updated
•
45
•
18
argilla/distilabeled-OpenHermes-2.5-Mistral-7B
Text Generation
•
Updated
•
38
•
30
argilla/distilabeled-Marcoro14-7B-slerp-full
Text Generation
•
Updated
•
729
•
2
mlabonne/NeuralBeagle14-7B
Text Generation
•
Updated
•
123
•
158
mlabonne/NeuralBeagle14-7B-GGUF
Updated
•
189
•
46
mlx-community/NeuralBeagle14-7B-4bit-mlx
Updated
•
12
•
4
TheBloke/NeuralBeagle14-7B-GGUF
Updated
•
467
•
26
argilla/CapybaraHermes-2.5-Mistral-7B
Updated
•
37
•
67
tasksource/deberta-small-long-nli
Zero-Shot Classification
•
Updated
•
25k
•
39
TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF
Updated
•
11.7k
•
102
TheBloke/CapybaraHermes-2.5-Mistral-7B-AWQ
Updated
•
292
•
21
TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ
Updated
•
2.55k
•
56
mlabonne/AlphaMonarch-7B
Text Generation
•
Updated
•
12.9k
•
148
ContextualAI/Contextual_KTO_Mistral_PairRM
Text Generation
•
Updated
•
217
•
31
mlabonne/OrpoLlama-3-8B
Text Generation
•
Updated
•
75
•
54
mradermacher/CapybaraHermes-2.5-Mistral-7B-GGUF
Updated
•
6
•
1
mradermacher/CapybaraHermes-2.5-Mistral-7B-i1-GGUF
Updated
•
420
•
1
mradermacher/distilabeled-Marcoro14-7B-slerp-full-GGUF
Updated
•
141
•
1
mradermacher/distilabeled-Marcoro14-7B-slerp-full-i1-GGUF
Updated
•
221
•
1
stanfordnlp/SteamSHP-flan-t5-xl
Text2Text Generation
•
Updated
•
61
•
43
stanfordnlp/SteamSHP-flan-t5-large
Text2Text Generation
•
Updated
•
30
•
33
trl-lib/llama-7b-se-peft
sileod/deberta-v3-large-tasksource-rlhf-reward-model
Text Classification
•
Updated
•
61
•
11
trl-lib/llama-7b-se-rl-peft
Updated
•
103
trl-lib/llama-7b-se-rm-peft