Status: accepted
The internal dbt benchmark remains the primary fast loop. External benchmarks are added as calibration against public SOTA, not as a replacement for the internal benchmark.
Status: accepted
Phase 1 focuses on:
BIRD mini-devSpider 2.0-LiteThis gives one broadly recognized hard benchmark and one enterprise-style benchmark before taking on more dynamic benchmark families.
Status: accepted
Runs must carry benchmark name, split, version, dialect, scorer, and environment provenance so leaderboard comparisons are honest and reviewable.
Status: accepted
LiveSQLBench is a second-wave integration. Interactive and hosted benchmark settings are explicitly deferred until query-level integrations are stable.