context catalog nimble

Purpose

Context architecture/catalog and Nimble integration into inspect, MCP, and generation flows. This workstream defines how data context is structured, stored, and surfaced across the product — the "catalog" of what we know about a user's data and how it's made available to humans and AI. Nimble is the methodology for lightweight dbt model linting and context enrichment. This workstream owns the context schema, the Nimble rule engine, and the integration points where context flows into inspect (profiler output), MCP (agent tools), and dashboard generation (smarter defaults). Adjacent to inspect-profiler (which produces raw context) and mcp-analyst-agent (which consumes context for AI workflows).

Owner

Data AI Engineer Architect

Initiatives

Profiling Foundation Layers 1-5 — Completed, M0 — Prototype, 2 / 2 tasks complete (100%)
Description Enrichment Pipeline — Planned, M1 — 5T Internal Pilot Ready, 2 / 2 tasks complete (100%)
Grain Inference and Fanout Risk — Ready For Eng, M1 — 5T Internal Pilot Ready, 2 / 2 tasks complete (100%)
Layer 6 Relationship Mapping — Planned, M1 — 5T Internal Pilot Ready, 1 / 2 tasks complete (50%)
MCP Catalog and Agent Tools — In Progress, M1 — 5T Internal Pilot Ready, 4 / 4 tasks complete (100%)
Question-aware schema retrieval and narrowing — Planned, M2 — Internal Adoption + Design Partners, 0 / 4 tasks complete (0%)
External Context Sources — Planned, MX — Far Future, 0 / 0 tasks complete (0%)

Tasks by Milestone

A runnable prototype path exists for context schema/catalog contracts and Nimble enrichment flows across product surfaces, with concrete artifacts that prove the flow works end-to-end in the current codebase. Core assumptions are documented, known constraints are explicit, and the team can explain what is real versus mocked without ambiguity.

☑AI_CONTEXT core MCP tools built MCP Catalog and Agent Tools Completed — Completed core MCP tool set (catalog, execute_query, render_dashboard, list_sources) for AI context consumption and act…
☑AI_CONTEXT profiling layers 1-5 foundation built Profiling Foundation Layers 1-5 Completed — Completed baseline profiling foundation (schema, enrichment, stats, samples, semantic/quality inference) used by AI_CON…
☑AI_CONTEXT schema context formatter and MCP resources built MCP Catalog and Agent Tools Completed — Completed token-efficient schema context formatter and MCP resources that expose pre-built AI context to agents.
☑AI_CONTEXT table description ingestion built Profiling Foundation Layers 1-5 Completed — Completed ingestion of database table descriptions into profiling output as baseline semantic enrichment.
☑Prototype gaps and follow-on capture Completed — Document top gaps and risks in cross-surface context contract that must be addressed next.
☑Prototype implementation path Completed — Implement a runnable end-to-end prototype path for context schema model.
☑Prototype validation and proof Completed — Validate context enrichment rules with concrete proof artifacts and repeatable steps.

Internal analysts can execute at least one weekly real workflow that depends on context schema/catalog contracts and Nimble enrichment flows across product surfaces in the 5T Analytics environment, without bespoke engineering intervention for every run. Instrumentation and feedback capture are in place so failures, friction points, and adoption gaps are visible and triaged with owners.

context schema/catalog contracts and Nimble enrichment flows across product surfaces is hardened enough for regular use by multiple internal teams and initial design partners, with a predictable response loop for issues and requests. Quality expectations are documented, and prioritized improvements from real usage are actively incorporated into delivery.

Launch scope for context schema/catalog contracts and Nimble enrichment flows across product surfaces is complete, externally explainable, and supportable: user-facing behavior is stable, documentation is publishable, and operational ownership is explicit. Remaining gaps are non-blocking, risk-assessed, and tracked as post-launch follow-up rather than unresolved launch debt.

☐Launch docs and external readiness — Publish external-facing documentation and examples for context enrichment rules that are executable by new users.
☐Launch operations and reliability readiness — Finalize operational readiness for cross-surface context contract: telemetry, alerting, support ownership, and incident…
☐Public launch scope completion — Complete launch-critical scope for context schema model with production-safe behavior and rollback clarity.
☐Add join-path grounding to question-scoped context bundles Waiting on build-question-aware-schema-search-and-isolation-cli-over-inspect-json-and-dbt-metadata — Teach bundles to surface likely join paths and key relationships between retained tables so the SQL generator sees how…
☐Build lightweight value hints retrieval from inspect artifacts Waiting on build-question-aware-schema-search-and-isolation-cli-over-inspect-json-and-dbt-metadata — Expose cheap static filter-disambiguation hints such as enum-like values date ranges and high-signal categorical member…
☐QUERY_VALIDATOR foundation and first integrations — Build the first query validator path using SQLGlot plus schema profile grain and relationship context for query review…

Post-launch stabilization is complete for context schema/catalog contracts and Nimble enrichment flows across product surfaces: recurring incidents are reduced, support burden is lower, and quality gates are enforced consistently before release. The team has a repeatable operating model for maintenance, regression prevention, and measured reliability improvements.

☐Regression prevention and quality gates — Add or enforce regression gates around context enrichment rules so release quality is sustained automatically.
☐Sustainable operating model — Document and adopt sustainable operating model for cross-surface context contract across support, triage, and release c…
☐v1.0 stability and defect burn-down — Run stability program for context schema model with recurring defect burn-down and reliability trend tracking.
☐Evaluate question-scoped bundle compression strategies Waiting on compare-text-to-sql-evals-with-question-aware-retrieval-vs-full-context-prompting — Compare raw narrowed dumps against more structured or explanation-rich bundle shapes so the team can pick the smallest…

v1.2 delivers meaningful depth improvements in context schema/catalog contracts and Nimble enrichment flows across product surfaces based on observed usage and retention signals, not just roadmap intent. Enhancements improve real customer outcomes, and release readiness is demonstrated through metrics, regression coverage, and clear migration guidance where relevant.

☐Quality and performance improvements — Ship measurable quality/performance improvements in context enrichment rules tied to user-facing outcomes.
☐v1.2 depth expansion — Deliver depth expansion in context schema model prioritized by observed usage and retention outcomes.
☐v1.2 release and migration readiness — Prepare v1.2 release/migration readiness for cross-surface context contract, including communication and upgrade guidan…
☐evaluate snowflake semantic views and lineage as context sources — Investigate whether Snowflake semantic views, Cortex Analyst instructions, and GET_LINEAGE metadata should be ingested…

Long-horizon opportunities for context schema/catalog contracts and Nimble enrichment flows across product surfaces are captured as concrete hypotheses with user impact, prerequisites, and evaluation criteria. Ideas are ranked by strategic value and feasibility so future investment decisions can be made quickly with less rediscovery.

☐Experiment design for future bets — Design validation experiments for cross-surface context contract so future bets can be tested before major investment.
☐Future opportunity research — Capture long-horizon opportunities for context schema model with user impact and strategic fit.
☐Prerequisite and dependency mapping — Map enabling prerequisites and dependencies for context enrichment rules to reduce future startup cost.

context catalog nimble

Purpose

Owner

Initiatives

Tasks by Milestone

> M0 — Prototype (7/7 completed)

> M1 — 5T Internal Pilot Ready (12/12 completed)

> M2 — Internal Adoption + Design Partners (0/10 completed)

> M3 — Public Launch (0/6 completed)

> M4 — v1.0 Launch (0/4 completed)

> M5 — v1.2 Launch (0/4 completed)

> MX — Far Future (0/3 completed)