Projects with this topic
Sort by:
-
https://github.com/princeton-nlp/SimPO SimPO: Simple Preference Optimization with a Reference-Free Reward
Updated
https://github.com/princeton-nlp/SimPO SimPO: Simple Preference Optimization with a Reference-Free Reward