Yousef Mroueh

IBM
Scientific, Seminar
Kantorovich Initiative Seminar: Yousef Mroueh
December 13, 2024
University of British Columbia
Current LLM alignment techniques use pairwise human preferences at a sample level, and as such, they do not imply an alignment on the distributional level. We propose in this paper Alignment via Optimal Transport (AOT), a novel method for...