simplify reward structure; add reward_exact_answer #3492
Merged
copybara-service[bot] merged 1 commit intomainfrom Mar 24, 2026
Merged
simplify reward structure; add reward_exact_answer #3492copybara-service[bot] merged 1 commit intomainfrom
copybara-service[bot] merged 1 commit intomainfrom