Pinkstack commited on
Commit
9ce02d4
·
verified ·
1 Parent(s): 8133173

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -0
README.md CHANGED
@@ -136,6 +136,19 @@ Phi-4 that has been tuned to be more advanced at reasoning.
136
 
137
  Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**
138
 
 
 
 
 
 
 
 
 
 
 
 
 
 
139
  the model uses this prompt format: (modified phi-4 prompt)
140
  ```
141
  {{ if .System }}<|system|>
 
136
 
137
  Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**
138
 
139
+ Beats qwen/qwq at MATH & MuSR & GPQA (MuSR being a reasoning benchmark)
140
+ Evaluation:
141
+
142
+
143
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/csbdGKzGcDVMPRqMCoH8D.png)
144
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/HR9WtjBhE4h6wrq88FLAf.png)
145
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/GLt4ct4yAVMvYEpoYO5o6.png)
146
+
147
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/CP9UF9kdBT_SW8Q79PSui.png)
148
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/doEIqDrM639hRPSg_J6AF.png)
149
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/yl5Et2TkCoYuIrNpDhZu9.png)
150
+
151
+
152
  the model uses this prompt format: (modified phi-4 prompt)
153
  ```
154
  {{ if .System }}<|system|>