docs: Add DSQL loader operations reference#176
Conversation
90abd0d to
4161f36
Compare
Functional Eval Results (with-skill, live run)Ran evals 10-12 with the skill loaded. All 11/11 expectations pass.
The skill teaches DSQL-loader-specific operational knowledge (partition warming, tmpfs defaults, header flag semantics) that the agent cannot infer from general training data. |
Code reviewDSQL skill PR adds 20-agent fleet ran per dsql-skill-author Workflow 2. Findings post 5-gate validation:
Items considered and dropped (audit trail)
🤖 Generated with Claude Code — 20-agent fleet per dsql-skill-author Workflow 2 §1 roster, all findings 5-gate validated at head SHA If this code review was useful, please react with 👍. Otherwise, react with 👎. |
Add references/data-loading.md covering aurora-dsql-loader operations: - Fresh-vs-warm partition behavior and throughput expectations - Resume/retry mechanics (--manifest-dir, --resume-job-id) - Conflict handling (--on-conflict do-nothing) - CSV/TSV header handling (--header flag, v3.0.0 default) - Schema inference caveats and --dry-run validation - Index count impact on throughput - Diagnostic decision tree for slow loads SKILL.md changes: - Add Workflow 3: Bulk Data Loading with key constraints - Add data loading to overview and Quick Start - Add trigger keywords (aurora-dsql-loader, bulk load, etc.) - Add data-loading.md reference entry with When/Contains - Add cross-reference from connectivity-tools.md Eval coverage: - 3 should-trigger + 2 should-not-trigger entries in trigger_evals.json - 3 functional evals (IDs 10-12) in evals.json with LLM judge grading - data_loading_eval_results.md with expected with-skill vs baseline comparison demonstrating the skill teaches operational knowledge not in general training data (partition warming, tmpfs defaults, header flag semantics)
4161f36 to
d093231
Compare
5687cab to
3e1d908
Compare
- Condense data-loading.md from 242 to 166 lines (remove verbose explanations per reviewer feedback) - Add RFC 2119 directives (MUST/SHOULD) for --manifest-dir, --header, --on-conflict preconditions, and schema inference validation - Add silent-failure callout for schema inference caveats - Fix --on-conflict semantics: triggers on any unique constraint, not just PK; add precondition that source must not have changed for crash recovery - Rename 'Going Deeper' header to 'When to load the full reference' with clear agent trigger condition - Fix 'slower than expected' to 'slow load times' in SKILL.md - Trim redundant description triggers (bulk load DSQL, DSQL data loading) - Fix workflow-number drift: Workflow 6 → 7 in dsql-lint.md, Workflow 8 → 9 in README/evals/runner - Fix README assertion count 43 → 42 - Remove standalone 'Related References' section (cross-ref inlined at top) - Add eval results with baseline vs with-skill comparison
3e1d908 to
86ec0da
Compare
- Partition behavior: replace bullet list with summary paragraph + link to published DSQL primary keys documentation - Common pitfall: rewrite as 'Agent guidance' with explicit action (advise user to check for duplicate PKs, recommend de-duplication) - Index section: remove explanatory preamble, keep only the two actionable bullets
b6a44e2 to
385b469
Compare
- Reconcile SKILL.md against main: keep main's structure (awslabs#176 added Workflow 3 Bulk Data Loading), add PR 168's PostgreSQL Migrations / ORM Guides / OCC Retry references and renumber the new workflows to 10 (Full PG to DSQL Schema Migration) and 11 (ORM Migration). - Restore the correct MCP tool name `dsql_lint` (with underscore) across SKILL.md, dsql-lint.md, development-guide.md, and the new pg-migrations/* files. The PR had renamed it to `dsql-lint` (dash), which does not match the registered tool. - Bring main's data-loading.md and other merge changes into the branch.
Add a comprehensive data-loading reference for the aurora-dsql-loader, covering:
Also adds:
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of the project license.