Skip to content

DocKit Agentic Data Studio

DocKit's Agentic Data Studio is an AI agent that talks to your databases in natural language. Tell it what you need and it writes queries, inspects schemas, updates documents, deletes records, or creates indexes — all in one conversation.

The Sidebar Assistant gives you the same AI in a chat panel, so you can ask questions and generate queries without switching views.

DocKit Data Studio list indices

What the Agent Can Do

The agent has tools for every supported database:

DatabaseCapabilities
Elasticsearch / OpenSearch / EasySearchSearch, get, index, update, delete documents · List indices, get mappings, create/delete indices · Manage aliases (list, get, create, delete, bulk update) · Add or update field mappings
DynamoDBExecute PartiQL queries (SELECT, INSERT, UPDATE, DELETE) · Describe and list tables
MongoDBFind documents, run aggregation pipelines · Insert, update, delete documents · List databases and collections

Each tool has a risk level (Safe for read-only, Elevated for create/update, or Destructive for delete) and needs the matching permission to run.

Quick Start

Open Settings → LLMs and add an AI provider (OpenAI, Anthropic, DeepSeek, or any of the 12 supported). Then open Data Studio from the Nav bar, click Add Source to attach a database connection, and type your request in plain English.

DocKit Settings LLMs demo

For quick questions without leaving your editor, use the Sidebar Assistant — click the chat icon (💬) in the sidebar.

Safety & Permissions

ModeHow tools runRisk levelRequires confirmation
Ask (default)Prompts Allow/Deny before every non-read toolSafe — read queries, list indices, describe tablesNever
AutoExecutes automatically when session has permissionsElevated — create/update documents, insert itemsOnly in Ask mode
Destructive — delete documents, drop indices, delete tablesAlways

You can also set read/create/update/delete permissions per database source, or inherit from the session level. Confirmation rules let you auto-allow safe or repetitive operations, or auto-deny dangerous tools entirely.

Agent Loop

The agent runs in a loop: it thinks, calls tools, checks the results, and decides what to do next. Three settings control how long it runs: max iterations (default 200) limits the number of tool-call rounds, wall-clock budget (default 30 minutes) caps elapsed time, and token budget (default 1,000,000) limits cumulative input tokens. If any budget runs out, the agent stops with an explanation and offers to continue.

Long conversations get compacted automatically — old messages are summarized into a compact form that keeps the important bits while cutting token usage. Compaction markers show up in the chat timeline so you can see what got rolled up.

The agent shows its current phase as it works:

Preparing → Iterating → Waiting for model → Compacting → Done

Each tool call displays its name, arguments, runtime, and result. Expand any tool call to see the full output.

Source Management

Attach any configured database connection as a Data Studio source. The agent reads schemas, queries data, and writes changes based on the permissions you set. You can also attach local files as read-only sources — CSV, Excel, JSON, and Parquet. The agent can inspect and query file contents but can't modify them.

Session Management

Each Data Studio conversation is a session with its own history, attached sources, and settings. Browse and switch between past sessions from the history panel, start a clean conversation with a new session, detach a source without deleting the underlying connection, or pick a different model per session.

The Sidebar Assistant is a lightweight chat panel accessible from anywhere in the app. It is separate from Data Studio — it works as a single-turn Q&A interface rather than a multi-step agent loop.

Use it to ask questions about your database schema or query results, generate queries using your schema as context, get explanations of query behavior and error messages, or debug without leaving your current view.

FeatureSidebar AssistantData Studio
InteractionSingle Q&A turnsMulti-step agent loop
Tool executionNo — text responses onlyYes — reads schemas, runs queries, modifies data
ContextCurrent conversation onlyFull session with history and compaction
Source attachmentNoYes — attach multiple databases per session
Best forQuick questions, explanationsComplex data operations and analysis

Open it by clicking the chat icon (💬) in the right sidebar, or press the keyboard shortcut. The Assistant shares the same provider and model configuration as Data Studio. You can configure a separate model for it in Settings → AI → Models if you want a lighter model for quick Q&A while keeping a more capable model for Data Studio. Multi-chat sessions are supported with automatic provider isolation, and you can delete individual chat sessions from the history panel.

Providers & Models

DocKit supports 12+ AI provider types — cloud and local: OpenAI, Anthropic, DeepSeek, OpenRouter (200+ models), Google Gemini, Grok, Mistral, Azure OpenAI, Ollama (fully offline), LM Studio (fully offline), custom OpenAI-compatible endpoints, and custom Anthropic-compatible endpoints.

To add a provider, go to Settings → Providers and click Add Provider. Pick a type, give it a display name, enter the API endpoint (pre-filled for standard providers), paste your API key or token, and optionally configure an HTTP proxy URL. Click Test Connection to verify, then Save.

DocKit discovers available models automatically. You can route them to specific features — pick one model for the Sidebar Assistant (quick Q&A) and another for Data Studio (full agent loop). If you don't set a preference, DocKit picks the best available model.

Getting an API key:

If you're behind a firewall, enter a proxy URL for each provider in Settings → Providers. Format: http://host:port. Or choose Use system proxy to inherit OS-level settings.

Privacy & Security

What the agent sends to AI providers:

  • ✅ Your text prompt
  • ✅ Current index, table, and collection names
  • ✅ Schema and mapping (field names and types)
  • ✅ Query results (truncated to about 1,024 characters, enough context for the agent to work with)
  • ❌ Full query results (stored locally in DocKit's database, visible in the UI)
  • ❌ Credentials (API keys, passwords, connection secrets never leave your machine)

How tool results flow to the LLM: When the agent runs a tool like es__search or mongo__find, the full result stays local. Only a short summary (around 1,024 characters) goes back to the LLM as context for its next step. Enough to work with, but not your full dataset.

Credential safety: Connection secrets live on your machine and get resolved in the Rust backend. The agent never sees them — it passes a connection_id (something like "42"), and the backend maps that to the real config before running the tool. Passwords, API keys, and AWS secrets never make it into LLM prompts.

No telemetry: DocKit doesn't phone home. No query data, credentials, or analytics leave your machine. Works fully offline with local providers like Ollama and LM Studio.

Troubleshooting

API Key Issues

If you see Invalid API key or Unauthorized, make sure the key has no extra spaces or line breaks, check it hasn't expired, verify you have enough credits or quota, and try generating a new key.

Connection Failed

For Network error or Timeout, check your internet connection, set up a proxy if you're behind a firewall, verify the API endpoint is reachable, and for local providers like Ollama or LM Studio, make sure the service is running.

Agent Stops Early

If the agent stops with an iteration cap, wall clock, or token budget message, raise the relevant budget in Settings → AI → Chat settings, simplify your request so it needs fewer tool calls, or start a new session to reset the context.

Poor Quality Responses

Try a different model — Claude for reasoning tasks, GPT-4o for general use. Be more specific in your request by including field names and index names. Break complex operations into smaller steps.

Next Steps