Questions about data preparation for DPO #3

Open

opened

on Oct 22, 2024

There is no code provided for the DPO execution process described in the paper after RM scoring.
Can you explain how you built the data for DPO?

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests