Pinkstack
/

SuperThoughts-CoT-14B-16k-o1-QwQ

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pinkstack commited on 4 days ago

Commit

25ddc0a

·

verified ·

1 Parent(s): b6bca54

Update README.md

Files changed (1) hide show

README.md +11 -6

README.md CHANGED Viewed

@@ -4,20 +4,25 @@ tags:
 - text-generation-inference
 - transformers
 - unsloth
-- llama
 - trl
 - sft
-license: apache-2.0
 language:
 - en
 ---
 # Uploaded  model
 - **Developed by:** Pinkstack
-- **License:** apache-2.0
 - **Finetuned from model :** unsloth/phi-4-unsloth-bnb-4bit
-This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 - text-generation-inference
 - transformers
 - unsloth
+- Phi-3
 - trl
 - sft
+- qwq
+- reasoning
+license: mit
 language:
 - en
+pipeline_tag: text-generation
 ---
+Phi-4 that has been tuned to be more advanced at reasoning. Parm2 magic 😉
+Unlike other Parm models we had to optimize out fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**
 # Uploaded  model
 - **Developed by:** Pinkstack
+- **License:** MIT
 - **Finetuned from model :** unsloth/phi-4-unsloth-bnb-4bit
+This phi-4 model was trained with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.