New reference implementation: Misalignment evaluations#108
Open
ethancjackson wants to merge 59 commits into
Open
New reference implementation: Misalignment evaluations#108ethancjackson wants to merge 59 commits into
ethancjackson wants to merge 59 commits into
Commits
Commits on Mar 18, 2026
Commits on Mar 19, 2026
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
Commits on Mar 23, 2026
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
Commits on Mar 24, 2026
Commits on Apr 22, 2026
Commits on Apr 28, 2026
Commits on Apr 29, 2026
Commits on May 14, 2026
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
fix(misalignment_qa): fix 10→9 task count in README; export ExamplesInjectMode; remove stale pycache
andcommitted- andcommitted
fix(misalignment_qa): drop temperature for LiteLLM providers to fix Anthropic claude-opus-4 failures
andcommitted- andcommitted
fix(misalignment_qa): consistent temperature=0.2 across all providers, null only for claude-opus-4-7
andcommitted- andcommitted
- committed
- committed
- andcommitted
- andcommitted
- committed
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- committed
Commits on May 20, 2026
- andcommitted
- andcommitted
- committed
- andcommitted
- committed
- committed