Preference Optimization with Multi-Sample Comparisons

Written on October 16, 2024