Pinkstack
/

SuperThoughts-CoT-14B-16k-o1-QwQ

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pinkstack commited on 3 days ago

Commit

3110d47

·

verified ·

1 Parent(s): 3b9a783

Update README.md

Files changed (1) hide show

README.md +15 -3

README.md CHANGED Viewed

@@ -18,18 +18,30 @@ inference:
 widget:
 - messages:
   - role: user
-    content: How many R's in strawberry? Think step by step.
 library_name: transformers
 ---
 gguf/final version: https://huggingface.co/Pinkstack/PARM-V2-phi-4-16k-CoT-o1-gguf
 [Phi-4 Technical Report](https://arxiv.org/pdf/2412.08905)
-Phi-4 that has been tuned to be more advanced at reasoning. Parm magic 😉
 Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**
-NOTE: more information soon, gguf
 # Uploaded  model

 widget:
 - messages:
   - role: user
+    content: How many R's in strawberry? Think step by step.
 library_name: transformers
+datasets:
+- amphora/QwQ-LongCoT-130K
+base_model:
+- microsoft/phi-4
 ---
 gguf/final version: https://huggingface.co/Pinkstack/PARM-V2-phi-4-16k-CoT-o1-gguf
 [Phi-4 Technical Report](https://arxiv.org/pdf/2412.08905)
+Phi-4 that has been tuned to be more advanced at reasoning.
 Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**
+Please use this prompt format to enable advanced reasoning:
+```
+{{ if .System }}<|system|>
+{{ .System }}<|im_end|>
+{{ end }}{{ if .Prompt }}<|user|>
+{{ .Prompt }}<|im_end|>
+{{ end }}<|assistant|>{{ .CoT }}<|CoT|>
+{{ .Response }}<|FinalAnswer|><|im_end|>
+```
 # Uploaded  model