Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
reward model
Inference Endpoints
AutoTrain Compatible
text-generation-inference
4-bit precision
custom_code
Carbon Emissions
8-bit precision
Eval Results
Mixture of Experts
Misc with no match
Merge
text-embeddings-inference
Apply filters
Models
99
Full-text search
Edit filters
Sort: Trending
Active filters:
reward model
Clear all
Qwen/Qwen2.5-Math-RM-72B
Text Classification
•
Updated
Oct 31, 2024
•
10.5k
•
68
berkeley-nest/Starling-LM-7B-alpha
Text Generation
•
Updated
Mar 20, 2024
•
16k
•
557
berkeley-nest/Starling-RM-7B-alpha
Updated
Jul 30, 2024
•
28
•
102
ManniX-ITA/Starling-LM-7B-beta-LaserRMT-v1
Text Generation
•
Updated
Apr 12, 2024
•
19
•
2
codeIA/GuIA-v2
Text Generation
•
Updated
Apr 22, 2024
•
16
•
1
jieliu/Storm-7B
Text Generation
•
Updated
Jun 18, 2024
•
47
•
41
nvidia/Llama3-70B-SteerLM-RM
Updated
Jun 19, 2024
•
12
•
42
nvidia/Nemotron-4-340B-Reward
Updated
Jun 19, 2024
•
18
•
115
mradermacher/Storm-7B-i1-GGUF
Updated
Aug 2, 2024
•
142
•
1
internlm/internlm2-1_8b-reward
Text Classification
•
Updated
Jul 15, 2024
•
2.39k
•
10
internlm/internlm2-20b-reward
Text Classification
•
Updated
Oct 9, 2024
•
481
•
22
nvidia/Llama-3.1-Nemotron-70B-Reward
Updated
Oct 15, 2024
•
23
•
69
nvidia/Llama-3.1-Nemotron-70B-Reward-HF
Updated
Oct 15, 2024
•
9.98k
•
77
second-state/Llama-3.1-Nemotron-70B-Reward-HF-GGUF
Text Generation
•
Updated
Oct 19, 2024
•
201
•
1
yale-nlp/MDCureRM
Updated
Nov 22, 2024
•
127
•
3
mradermacher/Starling-LM-7B-alpha-GGUF
Updated
Nov 4, 2024
•
72
•
1
mradermacher/Starling-LM-7B-beta-GGUF
Updated
17 days ago
•
169
•
1
mradermacher/Starling-LM-7B-beta-i1-GGUF
Updated
17 days ago
•
800
•
1
mradermacher/Starling-LM-7B-beta-LaserRMT-v1-GGUF
Updated
5 days ago
•
191
•
1
mradermacher/GuIA-v2-GGUF
Updated
4 days ago
•
187
•
1
nicholasKluge/RewardModelPT
Text Classification
•
Updated
Jun 18, 2024
•
51
nicholasKluge/RewardModel
Text Classification
•
Updated
Jun 18, 2024
•
14
Ablustrund/moss-rlhf-reward-model-7B-zh
Updated
Jul 13, 2023
•
1
•
23
fnlp/moss-rlhf-reward-model-7B-en
Updated
Jul 13, 2023
•
9
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
11
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
11
•
1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
10
•
2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
14
•
1
LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2
Text Generation
•
Updated
Nov 27, 2023
•
11
•
2
TheBloke/Starling-LM-7B-alpha-GGUF
Updated
Nov 28, 2023
•
593
•
94
Previous
1
2
3
4
Next