R
rlhf

Any
Batchfile
C
C++
CMake
CSS
Dockerfile
Go
HCL
HTML
Java
JavaScript
Jinja
Jupyter Notebook
MDX
Makefile
PHP
Python
Ruby
Rust
SCSS
Shell
Swift
TSX
TypeScript
Vue

Projects with this topic

Sort by:

Sort by
Updated date
Name
Name, descending
Oldest updated
Oldest created
Last created
Most stars
Hide archived projects
Show archived projects
Show archived projects only

View SimPO project

mirrored_repos / MachineLearning / princeton-nlp / SimPO

https://github.com/princeton-nlp/SimPO SimPO: Simple Preference Optimization with a Reference-Free Reward

alignment Large Langua... rlhf preference-a...

0

Updated Feb 16, 2025

0 0 0 0

Updated Feb 16, 2025

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾