Skip to content

Restore Gemini 3.1 Pro data in Paper 1#4

Open
byapparov wants to merge 4 commits into
feature/reorganize-into-paper-foldersfrom
feat/restore-gemini-paper1
Open

Restore Gemini 3.1 Pro data in Paper 1#4
byapparov wants to merge 4 commits into
feature/reorganize-into-paper-foldersfrom
feat/restore-gemini-paper1

Conversation

@byapparov
Copy link
Copy Markdown
Contributor

Summary

  • Reverted main.tex to pre-removal version (commit 1e1f15b) restoring all 5-model / 737-run / 3-family numbers, Gemini table rows, and discussion text
  • Updated compute_stats.py and generate_figures.py to load Gemini data from papers/3-kpi-targets/gemini_scores.csv (with sql-querysql domain normalization)
  • Regenerated all 4 figures and rebuilt PDF — verified 737 runs, FR=33.4%/8.4%, pooled MD=9.8%/PC=7.1%

Test plan

  • compute_stats.py outputs 737 runs matching all original tex numbers
  • Figures show 5 models including Gemini 3.1 Pro
  • PDF renders correctly (9 pages)

🤖 Generated with Claude Code

byapparov and others added 4 commits February 26, 2026 11:53
Revert main.tex to pre-removal version (1e1f15b) with all 5-model/737-run/3-family numbers. Update compute_stats.py and generate_figures.py to load Gemini data from papers/3-kpi-targets/gemini_scores.csv. Regenerate all figures and PDF.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Soften 'identical semantic content' claim to acknowledge pseudocode's
structural priors (typed fields, enums, validation functions). Add
mechanistic hypothesis for Gemini SQL anomaly (over-literal PerFileRules
interpretation). Expand future work with ablation studies and hybrid
format suggestions. Update threats to validity accordingly.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Add 4 new verified citations: Willard & Louf (constrained decoding),
  Liu et al. (Text-to-SQL survey), Lyu et al. (Dockerfile generation,
  ICSE 2026), Madaan et al. (Self-Refine, NeurIPS 2023)
- Integrate citations into Related Work (Code as Prompt, new Constrained
  Decoding paragraph) and Discussion (Relation to constrained decoding)
- Fix Table 2 column overflow by shortening task names and header
- Add metadata reminder comment for ACM rights management
- Add QA.md documenting run independence analysis for reviewer questions

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Use relative paths with a common prefix note instead of full
repository paths to keep \texttt{} blocks within column width.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant