YuLan-Mini - a yulan-team Collection

yulan-team 's Collections

YuLan-Mini

updated 8 days ago

A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.

yulan-team/YuLan-Mini

Text Generation • Updated 3 days ago • 670 • 27

Note A highly capable 2.4B lightweight LLM using only 1T pre-training data.
yulan-team/YuLan-Mini-Datasets

Updated 8 days ago • 302 • 8
YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published 14 days ago • 60
yulan-team/YuLan-Mini-Before-Annealing

Updated 7 days ago • 36 • 6

Note The model & optimizer states of the last curriculum phase before learning rate annealing.
yulan-team/YuLan-Mini-Phase20

Updated 8 days ago • 8 • 2

Note The model & optimizer states of the 20th curriculum phase.