SDPO: Segment-Level Direct Preference Optimization for Social Agents Paper • 2501.01821 • Published 15 days ago • 18
Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published 10 days ago • 46