Pinkstack commited on
Commit
25ddc0a
·
verified ·
1 Parent(s): b6bca54

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -6
README.md CHANGED
@@ -4,20 +4,25 @@ tags:
4
  - text-generation-inference
5
  - transformers
6
  - unsloth
7
- - llama
8
  - trl
9
  - sft
10
- license: apache-2.0
 
 
11
  language:
12
  - en
 
13
  ---
14
 
 
 
 
 
15
  # Uploaded model
16
 
17
  - **Developed by:** Pinkstack
18
- - **License:** apache-2.0
19
  - **Finetuned from model :** unsloth/phi-4-unsloth-bnb-4bit
20
 
21
- This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
22
-
23
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
4
  - text-generation-inference
5
  - transformers
6
  - unsloth
7
+ - Phi-3
8
  - trl
9
  - sft
10
+ - qwq
11
+ - reasoning
12
+ license: mit
13
  language:
14
  - en
15
+ pipeline_tag: text-generation
16
  ---
17
 
18
+ Phi-4 that has been tuned to be more advanced at reasoning. Parm2 magic 😉
19
+
20
+ Unlike other Parm models we had to optimize out fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**
21
+
22
  # Uploaded model
23
 
24
  - **Developed by:** Pinkstack
25
+ - **License:** MIT
26
  - **Finetuned from model :** unsloth/phi-4-unsloth-bnb-4bit
27
 
28
+ This phi-4 model was trained with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.