Multimodal Preference Data Synthetic Alignment with Reward Model Paper • 2412.17417 • Published 26 days ago • 1