Gravel 4B

Continued-pretraining of qingy2024/Qwen2.5-4B on 143M tokens from HuggingFaceTB/finemath 4-plus

Safetensors

Model size

3.86B params

Tensor type

BF16

Inference Examples

Unable to determine this model's library. Check the docs .

Model tree for qingy2024/Gravel-3.8B-Base

Base model

Qwen/Qwen2.5-3B

Finetuned

Finetuned

(1)

this model