Environment Variables¶

Configuration lives in .env (uv path) or data/config.yaml (Docker / web dashboard). Both map to the same environment variables described here.

on/off vs true/false

Most variables use on/off. The GEMINI_* variables use true/false because they are passed directly to the Google SDK.

Mixing them up silently disables the feature — ENABLE_DUCKDUCKGO=true will not report an error but the tool will not be registered.

Each variable in the tables below shows its accepted values.

Required¶

Variable	Description
`DISCORD_BOT_TOKEN`	Your Discord bot token
`FAST_MODEL`	Model in `provider:model_id` format (e.g. `google:gemini-2.5-flash`)
`FAST_API_KEY`	API key for the fast model's provider. For local servers (Ollama, LM Studio) that require no authentication, set this to any non-empty string (e.g. `local`).

uv path only¶

Variable	Description
`CHAT_SYS_PROMPT_PATH`	Path to the system prompt file (e.g. `config/chat_sys_prompt.txt`). Docker users configure the system prompt through the web dashboard instead.

Dual-model routing¶

Variable	Default	Description
`DEEP_MODEL`	(off)	Second model in `provider:model_id` format. Leave blank to disable routing.
`DEEP_API_KEY`	same as `FAST_API_KEY`	API key for the deep model (only needed for a different provider)
`AUTO_ROUTE`	`off`	`on`/`off` — send complex messages to `DEEP_MODEL` automatically
`FALLBACK_ON_ERROR`	`off`	`on`/`off` — fall back to the other model when one returns an error

See Model Providers & Routing for details on how routing works.

Custom endpoints¶

Useful for local inference servers (Ollama, LM Studio, vLLM) or API gateways.

Variable	Description
`FAST_BASE_URL`	Custom endpoint for the fast model
`DEEP_BASE_URL`	Custom endpoint for the deep model

When the bot runs in Docker and the model server runs on the host, use host.docker.internal instead of localhost:

Server	Model string	Base URL
Ollama	`ollama:llama3.2`	`http://host.docker.internal:11434`
LM Studio	`lmstudio:model-name`	`http://host.docker.internal:1234/v1`
vLLM	`vllm:model-name`	`http://host.docker.internal:8000/v1`
Any OpenAI-compatible	`openai-chat:model-name`	`http://host.docker.internal:<port>/v1`

Bot behaviour¶

Variable	Default	Description
`ENABLE_CONTEXTUAL_SYSTEM_PROMPT`	`on`	`on`/`off` — inject user display names and current time into the system prompt
`CONTEXT_TOKEN_BUDGET`	`0`	Max input tokens per request; oldest messages are dropped when exceeded. `0` = no limit.

Web search & browsing¶

Variable	Default	Description
`ENABLE_DUCKDUCKGO`	`off`	`on`/`off` — free DuckDuckGo search, works with any provider
`ENABLE_BRAVE_SEARCH`	`off`	`on`/`off` — web search via the Brave Search API (requires `BRAVE_API_KEY`)
`BRAVE_API_KEY`	—	Brave Search API key; free tier at brave.com/search/api
`ENABLE_WEBSITE_TOOLS`	`off`	`on`/`off` — let the bot fetch and read URLs from the conversation

Workspace¶

Variable	Default	Description
`ENABLE_WORKSPACE`	`off`	`on`/`off` — give the bot read access to a local folder
`WORKSPACE_ROOT`	`workspace/`	Root folder the bot can access
`WORKSPACE_SYS_PROMPT_PATH`	`config/workspace_sys_prompt.txt`	Where to store the generated workspace context

See Tools for more on workspace behaviour.

Custom tools¶

Variable	Default	Description
`ENABLE_CUSTOM_APIS`	`off`	`on`/`off` — enable HTTP API tools defined in `CUSTOM_APIS_JSON`
`CUSTOM_APIS_JSON`	`[]`	JSON array of REST API configs (`name`, `base_url`, `api_key`, `description`). Must be a single line — see Tools
`ENABLE_SQL_DATABASES`	`off`	`on`/`off` — enable SQL tools defined in `SQL_DATABASES_JSON`
`SQL_DATABASES_JSON`	`[]`	JSON array of database configs (`name`, `db_url`, `description`)
`CUSTOM_DIR`	`custom`	Directory scanned for Python extension files (`*.py`) — see Custom Commands & Tools

For writing your own commands and agent tools in Python (no env var needed — just drop files into custom/), see Custom Commands & Tools.

Google / Gemini-specific¶

These apply to google: models only. Ignored for all other providers.

Search & grounding¶

Variable	Default	Description
`GEMINI_SEARCH`	`true`	`true`/`false` — enable Google Search grounding
`GEMINI_GROUNDING_THRESHOLD`	model default	Apply grounding only when confidence is below this value (0.0–1.0)
`GEMINI_URL_CONTEXT`	`false`	`true`/`false` — let the model fetch URLs mentioned in the conversation (Gemini only; not `gemma-*`)

Thinking (Gemini 2.5+ / Gemma 4)¶

Variable	Default	Description
`GEMINI_THINKING_BUDGET`	model default	Token budget for reasoning; `0` disables thinking
`GEMINI_THINKING_LEVEL`	model default	`low` or `high`

Per-model overrides¶

Any GEMINI_* variable can be overridden for just one model using the FAST_ or DEEP_ prefix. CONTEXT_TOKEN_BUDGET (defined under Bot behaviour above) also supports per-model overrides:

Shared default	Fast override	Deep override
`GEMINI_SEARCH`	`FAST_SEARCH`	`DEEP_SEARCH`
`GEMINI_GROUNDING_THRESHOLD`	`FAST_GROUNDING_THRESHOLD`	`DEEP_GROUNDING_THRESHOLD`
`GEMINI_URL_CONTEXT`	`FAST_URL_CONTEXT`	`DEEP_URL_CONTEXT`
`GEMINI_THINKING_BUDGET`	`FAST_THINKING_BUDGET`	`DEEP_THINKING_BUDGET`
`GEMINI_THINKING_LEVEL`	`FAST_THINKING_LEVEL`	`DEEP_THINKING_LEVEL`
`CONTEXT_TOKEN_BUDGET`	`FAST_CONTEXT_TOKEN_BUDGET`	`DEEP_CONTEXT_TOKEN_BUDGET`