Inference-Time Policy Steering through Human Interactions Paper • 2411.16627 • Published Nov 25, 2024 • 1
Grounding Language Plans in Demonstrations Through Counterfactual Perturbations Paper • 2403.17124 • Published Mar 25, 2024
Dr. LLaMA: Improving Small Language Models in Domain-Specific QA via Generative Data Augmentation Paper • 2305.07804 • Published May 12, 2023 • 2