Preference Optimization with Multi-Sample Comparisons
Written on October 16, 2024