Pinkstack
/

SuperThoughts-CoT-14B-16k-o1-QwQ

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pinkstack commited on about 23 hours ago

Commit

9ce02d4

·

verified ·

1 Parent(s): 8133173

Update README.md

Files changed (1) hide show

README.md +13 -0

README.md CHANGED Viewed

@@ -136,6 +136,19 @@ Phi-4 that has been tuned to be more advanced at reasoning.
 Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**
 the model uses this prompt format: (modified phi-4 prompt)
 ```
 {{ if .System }}<|system|>

 Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**
+Beats qwen/qwq at MATH & MuSR & GPQA (MuSR being a reasoning benchmark)
+Evaluation:
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/csbdGKzGcDVMPRqMCoH8D.png)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/HR9WtjBhE4h6wrq88FLAf.png)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/GLt4ct4yAVMvYEpoYO5o6.png)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/CP9UF9kdBT_SW8Q79PSui.png)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/doEIqDrM639hRPSg_J6AF.png)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/yl5Et2TkCoYuIrNpDhZu9.png)
 the model uses this prompt format: (modified phi-4 prompt)
 ```
 {{ if .System }}<|system|>