File size: 1,333 Bytes
6fa50a2 9f57bed 6fa50a2 27acaa5 1aaa580 27acaa5 6fa50a2 e1f0a01 6fa50a2 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 |
---
datasets:
- Open-Orca/SlimOrca-Dedup
- jondurbin/airoboros-3.2
license: gemma
---
# sappha-2b-v3
a slightly less experimental qlora instruct finetune of the gemma-2b base model. trained with unsloth.
## benchmarks
| | gemma-2b-it | sappha-2b-v3 | dolphin-2.8-gemma-2b |
| ---------------------- | ----------- | ------------ | -------------------- |
| MMLU (five-shot) | 36.98 | **38.02** | 37.89 |
| HellaSwag (zero-shot) | 49.22 | **51.70** | 47.79 |
| PIQA (one-shot) | 75.08 | **75.46** | 71.16 |
| TruthfulQA (zero-shot) | **37.51** | 31.65 | 37.15 |
## prompt format
basic chatml:
```
<|im_start|>system
You are a useful and helpful AI assistant.<|im_end|>
<|im_start|>user
what are LLMs?<|im_end|>
<|im_start|>assistant
LLMs, or Large Language Models, are advanced artificial intelligence systems that can perform tasks similar to human language. They are trained on vast amounts of data and can understand and respond to human queries. LLMs are often used in various applications, such as language translation, text generation, and question answering.<|im_end|>
```
## quants
gguf: https://huggingface.co/Fizzarolli/sappha-2b-v3-GGUF
## what happened to v2?
it was a private failure :) |