BlakeRain
's Collections
General
updated
Unicron: Economizing Self-Healing LLM Training at Scale
Paper
•
2401.00134
•
Published
•
9
Astraios: Parameter-Efficient Instruction Tuning Code Large Language
Models
Paper
•
2401.00788
•
Published
•
21
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table
Understanding
Paper
•
2401.04398
•
Published
•
21
The Impact of Reasoning Step Length on Large Language Models
Paper
•
2401.04925
•
Published
•
16
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Paper
•
2401.05033
•
Published
•
16
PIXART-δ: Fast and Controllable Image Generation with Latent
Consistency Models
Paper
•
2401.05252
•
Published
•
47
AToM: Amortized Text-to-Mesh using 2D Diffusion
Paper
•
2402.00867
•
Published
•
10
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Paper
•
2403.04132
•
Published
•
38
Teaching Large Language Models to Reason with Reinforcement Learning
Paper
•
2403.04642
•
Published
•
46
GLoRe: When, Where, and How to Improve LLM Reasoning via Global and
Local Refinements
Paper
•
2402.10963
•
Published
•
10
ReFT: Reasoning with Reinforced Fine-Tuning
Paper
•
2401.08967
•
Published
•
29
Octopus v2: On-device language model for super agent
Paper
•
2404.01744
•
Published
•
57