Update README.md
Browse files
README.md
CHANGED
@@ -136,6 +136,19 @@ Phi-4 that has been tuned to be more advanced at reasoning.
|
|
136 |
|
137 |
Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**
|
138 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
139 |
the model uses this prompt format: (modified phi-4 prompt)
|
140 |
```
|
141 |
{{ if .System }}<|system|>
|
|
|
136 |
|
137 |
Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**
|
138 |
|
139 |
+
Beats qwen/qwq at MATH & MuSR & GPQA (MuSR being a reasoning benchmark)
|
140 |
+
Evaluation:
|
141 |
+
|
142 |
+
|
143 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/csbdGKzGcDVMPRqMCoH8D.png)
|
144 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/HR9WtjBhE4h6wrq88FLAf.png)
|
145 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/GLt4ct4yAVMvYEpoYO5o6.png)
|
146 |
+
|
147 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/CP9UF9kdBT_SW8Q79PSui.png)
|
148 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/doEIqDrM639hRPSg_J6AF.png)
|
149 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/yl5Et2TkCoYuIrNpDhZu9.png)
|
150 |
+
|
151 |
+
|
152 |
the model uses this prompt format: (modified phi-4 prompt)
|
153 |
```
|
154 |
{{ if .System }}<|system|>
|