Add LiveSQLBench adapter and release-tracking workflow

ID	MCP_ANALYST_AGENT-ADD_LIVESQLBENCH_ADAPTER_AND_RELEASE_TRACKING_WORKFLOW
Status	not_started
Priority	p2
Milestone	m2-internal-adoption-design-partners
Owner	data-ai-engineer-architect
Initiative	external-text-to-sql-benchmarks-and-sota-calibration

Problem

Add a second-wave integration for LiveSQLBench with explicit handling for release versions, hidden-vs-open splits, and evolving benchmark context.

LiveSQLBench is appealing because it is newer, more dynamic, and more contamination-aware than older static benchmarks.
That same dynamism adds versioning and release-management complexity that makes it a poor first integration target.
This task exists so the second-wave work is already scoped and ordered instead of becoming a vague "maybe later" benchmark bucket.

Recommended: add LiveSQLBench only after the shared contract and first adapters exist, with explicit release/version tracking. Treat it as a release-aware benchmark family where artifact provenance records exactly which release and split were used.

Why this is recommended:

Trade-off: maximizes ambition, but adds too much benchmark-management complexity before the shared contract is proven.

Trade-off: simplest operationally, but loses much of the benchmark's value.

Define how benchmark release/version identifiers appear in normalized case metadata and run provenance.
Decide which LiveSQLBench slice is practical for local development and what should remain deferred.
Add a loader and run mode that pins a specific release instead of silently drifting.
Add dashboard slices that distinguish release-to-release movement from model movement.
Document how new releases should be introduced without contaminating prior comparisons.

N/A - implementation task, but not a browser-flow task.