Data Copilot: MCP Tool & Data Standardization¶
This document outlines the standardization of tool and data access for the Data Copilot using the Model Context Protocol (MCP). By adopting MCP, we ensure that specialized agents in the Text-to-SQL pipeline can interact with diverse data sources (SQL, docs, APIs) through a unified, machine-parseable interface.
Goal¶
Design a "free/cheap-first" standardization layer that allows the Data Copilot to scale across new domains without hard-coding database connectors or API clients.
MCP Integration Matrix¶
| Tool Type | MCP Capability | Cost Profile | Implementation |
|---|---|---|---|
| SQL Database | Query/Resource | Free (OSS) | mcp-server-sqlite or mcp-server-postgres |
| Documentation | Resource (Text) | Free (Local) | mcp-server-files over Markdown docs |
| KPI Glossary | Resource (JSON) | Free (Local) | Custom JSON Resource server |
| External APIs | Tool (REST) | Low (API Keys) | fetch or specialized MCP servers (e.g., Jira, Slack) |
| Metadata | Resource (Schema) | Free (Local) | SQL introspection MCP server |
Resource vs Tool Patterns¶
When designing MCP servers for Data Copilots, follow these patterns:
- Resources: Use for static or slowly changing context (e.g., Schema, KPI definitions, archived SOPs). The agent "reads" these to build its world model.
- Tools: Use for actions or live data retrieval (e.g., execute_query, fetch_current_weather, create_jira_ticket). The agent "calls" these to interact with the world.
Concrete MCP Integration Examples¶
1. SQL Query Server¶
- Role: Provides the
execute_querytool to the SQL Generator agent. - Example:
mcp-server-sqlite --db inventory.db. - Standardization: Ensures all SQL generators receive schema context in the same format.
2. Documentation RAG (Docs Retrieval)¶
- Role: Allows the Intent Agent to look up domain definitions in the "Knowledge Base".
- Example: MCP server exposing
docs/services/as searchable resources. - Benefit: No need for a heavy Vector DB for small, high-density SOP documents.
3. KPI & Metric Glossary¶
- Role: Centralized source of truth for metric definitions (e.g., "What is Net Margin?").
- Example: A JSON-file resource server mapped to
data/kpi_glossary.json. - Standardization: Prevents LLMs from hallucinating calculation logic.
4. Home Assistant API Tool¶
- Role: Enables the Copilot to query live state (e.g., current power usage) for hybrid diagnostic queries.
- Example: MCP server wrapping Home Assistant REST API.
5. Metadata/Schema Inspector¶
- Role: Provides the Table and Column Prune agents with up-to-date schema information.
- Benefit: Decouples the agent from the specific DB dialect (SQLite vs Postgres).
Minimal MCP Server Set for Small Teams¶
For a home-office or small team setup, start with these three: 1. Filesystem MCP: Exposes Markdown docs and JSON configurations. 2. SQL MCP: Specific to your primary database (e.g., SQLite). 3. Fetch MCP: For lightweight web requests/API integrations.
Migration Path from Hard-coded Connectors¶
- Phase 1 (Shadow): Deploy MCP servers alongside existing Python connectors.
- Phase 2 (Abstraction): Update Pydantic models in the Data Copilot pipeline to accept MCP resource URIs instead of raw connection strings.
- Phase 3 (Cutover): Replace direct
sqlite3orrequestscalls withmcp_client.call_tool().
Security & Auth Boundaries¶
- Least Privilege: SQL MCP servers should use read-only credentials with
LIMITenforcement. - Auditability: All MCP tool calls are logged by the agent orchestrator (n8n or LangGraph).
- Authentication: MCP servers should be restricted to the local network/Tailscale mesh with token-based access.
- Network Isolation: For high-security home labs, run MCP servers in a dedicated "Automation" VLAN or a Tailscale "tag" group that only allows connections from the agent orchestrator node.
Related tools / concepts¶
- Data Copilot Text-to-SQL Architecture
- Data Copilot Agentic RAG
- Data Copilot SQL Validation
- Answer Synthesis Schema
Sources / References¶
Related tools / concepts¶
- Data Copilot Architecture
- Data Copilot Agentic RAG
- Tool Calling & Model Context Protocol (MCP)
- Claude Tool Search
Contribution Metadata¶
- Last reviewed: 2026-04-30
- Confidence: high
- Related Issues: #187