ORPO: Monolithic Preference Optimization without Reference Model Paper โข 2403.07691 โข Published Mar 12, 2024 โข 64
view article Article Welcome FalconMamba: The first strong attention-free 7B model Aug 12, 2024 โข 108
Gemma release Collection Groups the Gemma models released by the Google team. โข 40 items โข Updated 25 days ago โข 328
LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper โข 2307.02486 โข Published Jul 5, 2023 โข 80