Gravel 4B

Continued-pretraining of qingy2024/Qwen2.5-4B on 143M tokens from HuggingFaceTB/finemath 4-plus

Downloads last month
3
Safetensors
Model size
3.86B params
Tensor type
BF16
·
Inference Examples
Unable to determine this model's library. Check the docs .

Model tree for qingy2024/Gravel-3.8B-Base

Base model

Qwen/Qwen2.5-3B
Finetuned
(1)
this model

Dataset used to train qingy2024/Gravel-3.8B-Base