Skip to content

Commit ec61f46

Browse files
Merge pull request #25 from bits-bytes-nn/paper-reviews/2305.18290v3-20251019-151041
Paper Review: Direct Preference Optimization: Your Language Model is Secretly a Reward Model
2 parents 4e92145 + 465f264 commit ec61f46

1 file changed

Lines changed: 775 additions & 0 deletions

File tree

0 commit comments

Comments
 (0)