Mirrored mergekit-ready models
Collection
Mirrored models tweaked to be more friendly for mergekit. No pickles allowed.
•
7 items
•
Updated
•
1
This is a merge of pre-trained language models created using mergekit.
Excess lm_head.weight tensor weights have been trimmed away from the weights at lemon07r/Gemma-2-Ataraxy-v4c-9B.
This model was merged using the SLERP merge method.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
base_model: zelk12/recoilme-gemma-2-Ataraxy-9B-v0.1-t0.25
dtype: bfloat16
merge_method: slerp
parameters:
t: 0.25
slices:
- sources:
- layer_range: [0, 42]
model: zelk12/recoilme-gemma-2-Ataraxy-9B-v0.1-t0.25
- layer_range: [0, 42]
model: lemon07r/Gemma-2-Ataraxy-v3b-9B
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 32.63 |
IFEval (0-Shot) | 69.45 |
BBH (3-Shot) | 44.13 |
MATH Lvl 5 (4-Shot) | 17.98 |
GPQA (0-shot) | 11.19 |
MuSR (0-shot) | 15.30 |
MMLU-PRO (5-shot) | 37.72 |