view post Post 1207 News! ChemVLM Codes Opensource Now! https://github.com/AI4Chem/ChemVlm See translation
view post Post 2560 LLaMA-O1-PRM and LLaMA-O1-Reinforcement will release in this weekend.We have implemented a novel Reinforcement finetune(RFT) pipeline that taught models learning reasoning and reward labeling without human annotation. See translation
LLaMA-O1-1129 Datasets, Models, Codes and Papers SimpleBerry/LLaMA-O1-Supervised-1129 Text Generation • Updated Dec 3, 2024 • 521 • 18 SimpleBerry/LLaMA-O1-Base-1127 Text Generation • Updated Dec 3, 2024 • 76 • 17 SimpleBerry/OpenLongCoT-Pretrain-1202 Viewer • Updated Dec 2, 2024 • 135k • 88 • 2 SimpleBerry/OpenLongCoT-SFT Viewer • Updated Dec 2, 2024 • 332k • 161 • 15
Multi-Corpus Datasets For ChemLLM YeungNLP/firefly-train-1.1M Viewer • Updated Apr 10, 2023 • 1.65M • 1.08k • 294 stingning/ultrachat Viewer • Updated Feb 22, 2024 • 774k • 1.27k • 429 Open-Orca/OpenOrca Viewer • Updated Oct 21, 2023 • 2.91M • 8.1k • 1.35k Vezora/Tested-143k-Python-Alpaca Viewer • Updated Mar 23, 2024 • 143k • 294 • 39