AIdex

BRAIN.md — AIdex Project Context

Read this file first, before touching code. It exists so you (or an AI assistant) can get fully oriented and start making changes without re-reading every source file. Keep it updated when you make structural changes — stale context here is worse than no context.

Project: AIdex AI Coding Agent (formerly “Nexus”) — a CLI/TUI/Web AI coding agent, by Zelvior, Apache 2.0 licensed. Current version: 1.5.1 Language: Python, stdlib-first, optional deps only for the enhanced UI. Core philosophy: maximum compatibility (Windows XP 32-bit → Windows 11, Linux, macOS) + zero hard dependencies + graceful degradation everywhere.

1. What this project actually is

A single coherent Python backend (src/core/agent.py + src/core/config.py

src/providers/base.py + src/tools/file_tools.py) exposed through three different front ends that all share the exact same engine:

Rich TUI (src/tui/app.py) — the “full” terminal experience, needs rich + prompt_toolkit, Python 3.7+.
Plain TUI (src/tui/plain.py) — zero-dependency fallback, pure stdlib (input()/print()), Python 2.7+, works on Windows XP / 32-bit.
Web UI (src/web/server.py + src/web/static/) — browser-based, served by a zero-dependency stdlib http.server, no Flask/Django/etc.

Critical rule when adding a feature: if it’s user-facing (a new command, a new tool, a new setting), it generally needs to be wired into all three front ends, or explicitly decided to be web-only / CLI-only with a comment explaining why. They are not automatically in sync — they each have their own command-dispatch code that calls into the shared core.

Entry point: aidex.py (auto-detects which UI to launch; --plain/--full/ --web to force one). nexus.py is a legacy alias that just forwards to aidex.py — keep it working, don’t add new logic to it.

2. Directory map

aidex.py                  Main entry point. Auto-detects UI: web > full > plain.
nexus.py                  Legacy alias → forwards to aidex.py. Don't add logic here.
install.py                Universal installer. Never hard-exits on old Python.
setup.py                  pip/setuptools metadata.
requirements.txt          OPTIONAL deps only (rich, prompt_toolkit). Web UI needs none.
CHANGELOG.md              Keep-a-Changelog format. Add an entry per logical change.
README.md                 User-facing docs. Update when adding features/commands.
LICENSE                   Apache 2.0.

src/
  core/
    agent.py        THE agent. Singleton `agent = Agent()`. chat_stream() is
                     the main loop — native tool calling with multi-turn
                     tool-result loop, regex fallback for non-native models.
                     Also: _path_within() helper and the per-session
                     confine_to_workspace/excluded_tools gating (§7).
    config.py       Singleton `config = Config()`. All settings, API keys,
                     provider registry (PROVIDERS + IMAGE_PROVIDERS dicts),
                     live-model wrapper, image-generation wrapper.
    models.py       Live model fetching (OpenRouter/Groq/Anthropic/Ollama/
                     Gemini/Pollinations/custom), disk caching with TTL,
                     fallback chain: live → cache → stale-cache → static
                     built-in list. Never raises to the caller in practice
                     (config.get_live_models wraps it).
    imagegen.py     Image generation — free Pollinations by default (no
                     key), or a custom OpenAI-images-JSON-style endpoint.
                     See §6.
    ralph.py        Autonomous task-loop orchestrator, modeled on the real
                     ralph-tui project's task schema (structured IDs like
                     US-001/T-001, description, acceptance_criteria[],
                     priority P0-P3, task_type story/task). RalphState
                     (persisted JSON task list, priority-ordered selection)
                     + RalphRunner (select → prompt → execute → detect →
                     repeat). UI-agnostic — drives via callbacks, not
                     printing. Pure stdlib, Python 2.7-compatible (no
                     f-strings) so it runs identically under the plain TUI
                     on XP/32-bit. Old task files (just id/title/status)
                     still load fine — new fields default sensibly. See
                     the dedicated Ralph section below.
  providers/
    base.py         AIProvider (base, stdlib urllib only) →
                     OpenAICompatProvider (OpenRouter/Groq/OpenAI/Ollama/
                     Gemini/Pollinations/custom — all just OpenAI-compatible
                     endpoints, no new provider class needed) and
                     AnthropicProvider (native Anthropic API). Native
                     function-calling support, streaming + non-streaming.
  tools/
    file_tools.py   ALL 34 tools live here as plain functions, each
                     returning ToolResult(success, output, error). Also
                     TOOL_DEFINITIONS (compact param-spec format used to
                     generate help text) and the schema builders
                     (build_openai_tools_schema / build_anthropic_tools_schema)
                     that convert TOOL_DEFINITIONS into real JSON Schema for
                     native function calling.
  tui/
    app.py          Rich/prompt_toolkit UI. Class AIdexApp. Has its own
                     COMMANDS dict + big if/elif dispatch in _handle_command.
                     Owns a RalphState instance (self.ralph_state).
    plain.py        Stdlib-only mirror of app.py's functionality. Keep
                     feature parity — when you add a command to app.py,
                     add the equivalent here too. Also owns a RalphState.
  web/
    server.py       Handler(BaseHTTPRequestHandler) + run_web_server().
                     REST + SSE bridge to the same agent/config singletons.
                     Workspace confinement + shell-tool gating live here
                     (see Security section below — this is the ONE place
                     with a different trust boundary than the CLI). Module-
                     level `_ralph_state`/`_ralph_runner`/`_ralph_lock`
                     singletons (NOT per-Handler-instance — every HTTP
                     request gets a fresh Handler object, so anything that
                     must persist across requests, like an in-progress
                     Ralph run, has to live at module scope).
    static/
      index.html    Single-page app shell, semantic HTML5, 5 panels
                     (chat/files/models/ralph/settings) shown/hidden via CSS.
      style.css     Modern CSS: @layer, color-mix(), CSS custom properties.
                     Dark theme, amber accent (--accent: #f5a623).
      app.js        Vanilla ES module. No build step, no framework. Hand-
                     rolled SSE-over-fetch parser (EventSource doesn't
                     support POST, so this isn't a real EventSource). Same
                     parser pattern reused for both /api/chat and
                     /api/ralph/run.

3. The shared engine — how a chat turn actually works

Agent.chat_stream(user_input, excluded_tools=None) in src/core/agent.py:

Appends user message to self.messages.
Builds system prompt via _build_system() — uses plain .replace(), NOT .format(). This matters: the prompt’s fallback example contains literal {/} JSON, and .format() will crash on that. This was a real bug that broke every single message in the original codebase before this was fixed. Never change _build_system() back to .format().
Gets the provider via _get_provider() (reads config.provider, config.model, the right API key, request_timeout, max_retries).
Checks provider.supports_native_tools (True for OpenAICompatProvider and AnthropicProvider, False for nothing currently — both real providers support it).
Builds the tool schema (build_openai_tools_schema() or build_anthropic_tools_schema() from file_tools.py), filtered by excluded_tools if given.
Runs a loop, up to config.max_tool_turns (default 25 — deliberately generous; this used to be a hardcoded 8 and that was a real, reported bug: any task needing more than 8 tool calls before a final answer aborted with a confusing “possible loop” error even when nothing was actually looping). Genuine stuck-loop protection is separate and much more precise: if the same tool is called with the exact same parameters 3 times in a row (REPEAT_THRESHOLD in agent.py), that’s treated as a real loop and stops immediately with a specific message naming the tool — legitimate multi-step work calls different tools or different params each time even across many turns, so this never fires on real work. Don’t lower max_tool_turns as a substitute for real loop detection, and don’t raise REPEAT_THRESHOLD much above 3 — 3 identical calls in a row is already a very confident loop signal.
- Streams from the provider. Provider yields typed dict chunks: {"type": "text", "text": ...} or {"type": "tool_calls", "tool_calls": [...]}.
- If no tool calls came back natively, falls back to legacy regex parsing of <tool_call>{...}</tool_call> text blocks (for models that ignore the tools= param and just answer in text).
- If there ARE tool calls: executes each via _execute_tool(), appends the assistant’s tool-call message AND the tool results back into self.messages in the correct provider-specific shape (OpenAI: role: "tool" messages with tool_call_id; Anthropic: content blocks with type: "tool_result"), then loops again so the model can react to the result.
- If no tool calls at all: appends final assistant text, yields ("done", ""), returns.
Yields a stream of ("text"|"tool_call"|"tool_result"|"error"|"done", content) tuples — this is the stable external contract all three UIs consume identically (TUIs print it, the web server turns it into SSE events).

normalize_message_content(msg) (module-level function in agent.py) — use this anywhere you need to display a message from agent.messages/history. Message content can now be None (OpenAI tool-call message), a list of content blocks (Anthropic), or a plain string. Never assume it’s a string.

4. Tools — two read paths, don’t mix them up

There are two different “read a file” functions and two different “list a directory” functions, and mixing them up caused a real, already-fixed bug (corrupted files when saved through the web editor):

read_file(path, workspace) / list_directory(path, workspace) — AI/terminal-display formatted. Adds [File: x] (N lines) headers and ` 1 | ` line-number prefixes, and list_directory returns a 2-level recursive emoji tree. Never feed this into something that gets saved back as raw file content.
read_file_raw(path, workspace) / list_directory_flat(path, workspace) — raw/structured, for UI consumption. read_file_raw returns exact file bytes-as-text, no formatting. list_directory_flat returns (ToolResult, [{"name", "is_dir", "size"?}, ...]) — single level, no tree, made for clickable navigation. The web UI uses these exclusively.

All 33 tools are registered in TOOL_DEFINITIONS (file_tools.py) with a compact param spec: {"name": "str - description, default: X"}. The default: substring in a param’s description is what marks it optional when building JSON Schema (_parse_param_spec in file_tools.py) — keep that convention if you add params, or the schema’s required list will be wrong.

Tool list (33): read_file, write_file, edit_file, patch_lines, append_file, delete_file, move_file, copy_file, create_directory, list_directory, search_files, grep_file, run_command, run_python, git_status, git_diff, git_log, git_add, git_commit, git_init, git_branch, git_checkout, analyze_project, get_file_info, read_file_lines, head_file, tail_file, count_lines, disk_usage, env_info, find_replace_in_files, which, list_models.

run_command and run_python are the only tools that touch safe_mode (blocks a hardcoded list of dangerous command substrings when True, default True). run_python writes code to a temp file and shells out to {sys.executable} tmpfile.py — it respects safe_mode via the same path as run_command (this was a real bug, fixed: it used to ignore the config’s actual safe_mode value).

5. Live model system (`src/core/models.py`)

config.get_live_models(provider=None, force_refresh=False, timeout=None) → (List[ModelInfo], source_str). source_str is one of: "live" (fresh API hit), "cache" (fresh disk cache, < 6h old), "stale-cache" (cache expired AND live fetch failed, served anyway), "static-fallback" (no cache, no network — falls back to the hardcoded PROVIDERS[x]["free_models"]/["paid_models"] lists in config.py).

Cache lives at <config_dir>/models_cache/models_<provider>.json, TTL 6h (CACHE_TTL_SECONDS in models.py).

Per-provider fetch logic in fetch_live_models():

OpenRouter: hits GET {base_url}/models, parses pricing.prompt/ completion (USD per token → converted to per-1M-token for display), :free suffix or zero pricing = free, supported_parameters containing "tools" = native-tool-call capable.
Groq/OpenAI/Ollama: same endpoint shape, minimal schema (just id, sometimes context_window), no pricing info available.
Anthropic: GET {base_url}/models with x-api-key header (not Bearer), returns id+display_name, no pricing.

ModelInfo has .price_label() ("FREE" / "$X.XX/$Y.YY per 1M" / "?") and .context_label() ("128K" / "?") — use these for display, don’t reformat manually.

filter_models() / sort_models() / cache_age_label() in models.py are shared helpers used identically by app.py, plain.py, and server.py’s /api/models — if you change filter/sort behavior, change it once here.

6. Config (`src/core/config.py`)

Singleton config. Config dir: ~/.config/aidex/ (Linux/XDG), %APPDATA%\aidex\ (Windows), ~/Library/Application Support/aidex/ (macOS). Auto-migrates from the old nexus-agent config dir on first load if no aidex config exists yet — don’t remove _LEGACY_CONFIG_FILE handling in load().

Default provider is pollinations, not OpenRouter — this is the zero-config story: a fresh install can chat AND generate images with no key, no signup, immediately. Don’t “fix” this back to a key-requiring default without good reason; it’s intentional.

Key methods: config.get(key, default), config.set(key, value) (saves to disk immediately), config.get_api_key(provider=None) (returns the literal stored key, or the sentinel "not-needed" for key-optional providers — don’t truthy-check this directly, see below), config.needs_api_key(provider=None) / config.has_usable_key(provider=None) (use THESE for any “is this provider ready to use?” check — has_usable_key is what every UI status display should call), config.as_dict() (public, use this instead of touching _data directly from outside the class), config.get_provider_info(provider=None), config.all_models(provider) (static fallback list only), config.get_live_models(...) (the real one, see §5), config.models_cache_age(provider=None).

A real bug that shipped and got fixed: several places (agent.py’s _get_provider, both TUIs’ status displays, the web server’s /api/status) used to do if config.get_api_key() to mean “is a key configured” — broke as soon as a key-optional provider’s sentinel value made that always truthy. Fixed by adding needs_api_key/has_usable_key and switching every call site. If you add a new “is provider ready” check anywhere, use has_usable_key(), never raw get_api_key() truthiness.

DEFAULT_CONFIG keys you’ll actually touch: provider, model, *_api_key (one per chat provider, see PROVIDERS below), ollama_base_url, custom_base_url/custom_chat_model, workspace, safe_mode, stream, max_tokens, temperature, request_timeout, max_retries, web_allow_shell_tools (web-UI-specific, see §7, default False), image_provider/image_model/image_api_key/image_base_url (image generation, configured independently of chat — see below).

PROVIDERS dict (chat) — 17 entries: openrouter, groq, cerebras, mistral, cohere, github, nvidia, huggingface, zhipu, llm7, kluster, anthropic, openai, ollama, pollinations, gemini, custom. Each has name, base_url, key_field, free_models/ paid_models (static fallback lists — these go stale, that’s expected, that’s why §5 exists), notes (one-line human-friendly description shown in every UI’s provider picker), signup_url (where to get a key — also shown in every UI), and optionally requires_key: False (currently ollama, pollinations, llm7 — work with zero configuration). ollama’s and custom’s base_url are overridden per-instance from config.get("ollama_base_url") / config.get("custom_base_url") in get_provider_info() since those are user-supplied endpoints. All 13 of the newer providers are plain OpenAI-compatible endpoints — no new provider class was needed in base.py, they just work via the existing OpenAICompatProvider + create_provider()’s generic else branch.

Verify before trusting a base_url from memory or a search result. Two of the providers added in this pass had base URLs that were confidently wrong on the first pass and only caught by cross-checking multiple independent current sources: GitHub Models’ old models.inference.ai.azure.com endpoint was fully decommissioned (October 2025) — the correct one is models.github.ai/inference with org/model-name-style IDs; Hugging Face’s older api-inference.huggingface.co is likewise stale — the correct router is router.huggingface.co/v1. If you add another provider, verify its current base URL against its own current docs, not a single search snippet — provider endpoints get migrated/deprecated more often than this codebase gets touched.

The UI-friendly grouping used by all three front ends’ provider pickers — “no signup needed” / “free tier, quick signup” / “paid / custom” — is computed the same way everywhere: not requires_key first, then requires_key and free_models truthy, then everything else. If you add a provider, get requires_key and free_models right and the grouping takes care of itself; don’t hardcode a provider into a UI-specific list.

Image generation (`src/core/imagegen.py`)

Configured independently from the chat provider — you can chat via OpenRouter and still generate images via the free Pollinations default without switching anything. IMAGE_PROVIDERS dict (separate registry from chat PROVIDERS): pollinations (free, no key, GET-prompt-in-URL shape — image.pollinations.ai/prompt/{prompt}?model=&width=&height=&seed=, response body IS the raw image bytes) and custom (any other image API; generate_image() in imagegen.py picks the OpenAI-images-JSON shape instead of the GET shape if the configured base URL ends in /v1 or /openai).

Entry point: config.generate_image(prompt, width, height, seed) → returns an ImageResult (.data raw bytes, .content_type, .suggested_filename()). The generate_image tool in file_tools.py wraps this and saves to a file — it’s a normal tool, registered in TOOL_DEFINITIONS like everything else, so it’s automatically available to native function-calling with zero extra schema code, and reachable via /image <prompt> in both TUIs and POST /api/image on the web (which returns the image as base64 JSON, with an optional save_as to also write it to a workspace-confined path).

7. Web UI security model — read before touching server.py

The web UI is a different trust boundary than the CLI/TUI. A developer running the terminal app already has shell access to their own machine, so the CLI tools are intentionally unrestricted. A browser-reachable HTTP endpoint is not the same thing, even on localhost. Two real already-fixed vulnerabilities inform the current design — do not regress them:

Workspace confinement — two layers:
- Direct endpoints: _is_within_workspace() in server.py. Every filesystem-touching web endpoint (/api/fs/list, /api/fs/read, /api/fs/write, /api/image via save_as, and path-like params in /api/tool) checks the resolved path stays inside config.workspace. Absolute paths and ../ traversal outside the workspace get 403.
- Chat/tool-execution path: Agent._execute_tool() has a second, independent check via self._confine_to_workspace (set when chat_stream(..., confine_to_workspace=True) is called) using the _path_within() helper in agent.py. This exists because the AI can call ANY tool with ANY params during a chat turn — a per-endpoint check alone doesn’t help when the model itself decides to pass an absolute output_path to generate_image (or any future file-writing tool) mid-conversation. The web server’s _api_chat_stream() always passes confine_to_workspace=True; agent.run_tool() (used by the CLI/TUI directly) does NOT set this, preserving the CLI’s intentionally permissive behavior.
- The underlying _resolve() in file_tools.py does NOT enforce either of these on its own (permissive by design for the CLI) — confinement is the web layer’s job, at both the endpoint level AND the tool-execution level. If you add a new file-writing tool, check whether its path-like params are named path/file/directory/ output_path (those are what both checks look for) — if you use a different param name, add it to both checks.
Shell-tool gating: run_command/run_python are excluded from the tool schema offered to the model during web chat by default (_SHELL_TOOLS set in server.py, checked against config.get("web_allow_shell_tools", False)), AND defense-in-depth blocked again inside Agent._execute_tool() itself via self._excluded_tools — so even a non-native model trying the legacy regex tool-call fallback can’t sneak through. If you add a new dangerous tool, add it to _SHELL_TOOLS too.

If you add a new web endpoint OR a new file-writing tool: route it through the relevant check(s) above. Don’t assume “it’s just localhost” — two separate variants of this mistake already shipped in this project (one in a direct endpoint, one via a tool’s parameter reachable only through chat), both caught in testing and fixed. The lesson: checking the direct endpoint is not enough if the same write path is also reachable by the AI deciding to call a tool with attacker-influenced parameters.

Separately real bug, also fixed: the file editor used to load read_file()’s display-formatted output into the editable textarea, then save that formatted text straight back to disk on Save, corrupting every file edited through the web UI. Fixed by adding read_file_raw()/ list_directory_flat() (§4) specifically for UI use. If you add another UI surface that reads-then-writes a file, use the raw variants.

Ralph (autonomous loop) gets the same gating, deliberately on by default with no opt-out in the UI: _api_ralph_run_stream() always constructs its RalphRunner with confine_to_workspace=True and the same _SHELL_TOOLS exclusion as /api/chat. An unattended multi-task loop calling tools with no human watching each individual step is, if anything, a stronger case for these protections than a single chat turn a person is actively reading — don’t relax this for Ralph specifically just because it “feels like a power-user feature.”

8. Web API reference (`src/web/server.py`)

All endpoints relative to wherever run_web_server() binds (default 127.0.0.1, auto-picks a free port from 8420 up, or pass --port).

Method	Path	Notes
GET	`/`	index.html
GET	`/static/<file>`	css/js, path-traversal-safe via `_safe_static_path`
GET	`/api/status`	provider/model/workspace/safe_mode/has_key
GET	`/api/models?provider=&refresh=1`	live model list, see §5
POST	`/api/models/switch`	`{model}`
POST	`/api/provider/switch`	`{provider}` — auto-picks a free live model
GET/POST	`/api/config`	GET masks secrets via `_mask_key`; POST only allows a fixed safe field set + provider key fields
GET	`/api/history`	uses `normalize_message_content`
POST	`/api/history/clear`
GET	`/api/tools`	raw `TOOL_DEFINITIONS`
POST	`/api/tool`	`{tool, params}` — direct tool execution, gated per §7
GET	`/api/fs/list?path=`	`list_directory_flat`, confined per §7
GET	`/api/fs/read?path=`	`read_file_raw`, confined per §7
POST	`/api/fs/write`	`{path, content}`, confined per §7
GET	`/api/providers`	full chat-provider metadata (name, notes, signup_url, requires_key) — drives the web Settings panel’s grouped key list
GET	`/api/image-providers`	available image backends, key-requirement flags. Cloudflare needs a second field (`cloudflare_account_id`) the others don’t — handled as a special case in both the wizard and the web Settings panel, not a generic pattern
POST	`/api/image`	`{prompt, width?, height?, seed?, save_as?}` — generates an image, returns base64 inline; `save_as` confined per §7
GET	`/api/ralph`	task list + counts + `running`/`paused` state
POST	`/api/ralph/add`	`{title}`
POST	`/api/ralph/clear`	409 if a run is in progress
POST	`/api/ralph/stop`	cooperative — current task finishes, then the loop exits
POST	`/api/ralph/run`	SSE, `{max_iterations?}` → events: `task_start`, `text`, `tool_call`, `tool_result`, `error`, `task_done`, `finished`. 409 if already running (module-level `_ralph_runner` lock, see directory map)
POST	`/api/chat`	SSE, `{message}` → events: `text`, `tool_call`, `tool_result`, `error`, `done`

Frontend (app.js) parses both SSE endpoints manually via fetch().body.getReader() + a \n\n-delimited buffer parser — NOT the browser’s native EventSource, because EventSource can’t do POST bodies. If you change the SSE event format on the server, update the matching parser in app.js (sendChat() for chat, the ralphRunBtn click handler for Ralph).

Every POST handler MUST drain the request body, even if it doesn’t use it. This server runs HTTP/1.1 with persistent connections. If a handler sends its response without calling self._read_json_body() first, the unread bytes of that request’s body sit in the socket’s read buffer and get prepended onto the next request line the browser sends on the same reused connection — corrupting it into a 501 Unsupported method. This is a real bug that shipped (/api/ralph/clear, /api/ralph/stop, and the inline /api/history/clear branch all skipped it) and was only caught by testing in an actual browser with real connection reuse — curl each request in isolation, or even curl --next, did not reproduce it; only a genuinely reused http.client/browser connection did. If you add a new POST handler, call self._read_json_body() as the very first thing, even if the result is unused — don’t assume “this endpoint takes no input” means it’s safe to skip.

A related but distinct issue: any handler that streams an unbounded response (SSE) without Content-Length or chunked encoding must send Connection: close and set self.close_connection = True — claiming keep-alive on a response whose end the client can only detect by the connection closing is contradictory framing. Both /api/chat and /api/ralph/run do this correctly now; if you add a third SSE endpoint, copy that pattern, not a _send_json-style response’s headers.

9. Compatibility matrix (don’t break these)

Plain TUI (plain.py): must stay importable and runnable on Python 2.7+. No f-strings, no walrus operator, no pathlib reliance for the core loop, no type hints that aren’t string-quoted/from __future__. It’s verified via ast.parse + checking for JoinedStr/NamedExpr nodes — don’t reintroduce those.
Web UI: needs Python 3.6+ (uses f-strings in server.py) for the backend; ThreadingHTTPServer needs 3.7+ but server.py has a manual socketserver.ThreadingMixIn fallback for 3.6. aidex.py --web checks the version and prints a clear error + suggests --plain below 3.6, rather than a cryptic traceback.
Rich TUI: needs Python 3.7+ (that’s what rich/prompt_toolkit themselves require). aidex.py’s auto-detect tries this first, falls back to plain on import failure OR on any runtime exception from AIdexApp().run() (wrapped in try/except, never just dies).
aidex.py is always the real entry point. nexus.py just imports and calls aidex.main(). Never duplicate launch logic into nexus.py.

10. Known design decisions (so you don’t “fix” them by accident)

_execute_tool dispatch in agent.py is a big dict of lambdas keyed by tool name, built fresh on every call (cheap, fine, not a bug).
The legacy <tool_call>{json}</tool_call> text-parsing fallback in agent.py is intentional — some free/small models on OpenRouter ignore the tools= parameter entirely and just emit text. Don’t remove it thinking native tool calling makes it obsolete.
test_connection() on providers tries the free /models listing endpoint first before falling back to a real (paid) chat completion — this is deliberate cost-avoidance, not an oversight.
Static free_models/paid_models lists in PROVIDERS (config.py) are known to go stale — that’s expected, they’re only the last-resort fallback when live fetch AND cache both fail. Don’t “fix” them by trying to keep them current; fix the live-fetch path instead if it’s broken.
The web UI defaults web_allow_shell_tools to False and there’s currently no Settings-panel toggle for it in the UI — that’s deliberate, so enabling shell exec from the browser requires editing the config file directly, not one accidental click.
Default chat provider is pollinations, not OpenRouter — this is the zero-config story (§6). Don’t “fix” this back to a key-requiring default.
max_tool_turns defaults to 25, deliberately generous, NOT a tight loop-prevention budget — that job belongs to the separate identical-call-repeated-3x detection (§3). A previous hardcoded value of 8 was a real bug (legitimate multi-step tasks aborted with a “possible loop” error). Don’t lower this thinking it adds safety; it doesn’t catch loops any better than the repeat-detection already does, it just breaks longer legitimate tasks again.
Image generation (config.image_provider) is configured completely separately from chat (config.provider) — that’s intentional, not an oversight to “simplify.” A person should be able to use a paid/precise chat model while still using the free Pollinations image default, or vice versa.
RalphRunner is UI-agnostic by design (callbacks only, never prints or touches console/sys.stdout itself) specifically so the identical orchestration logic drives the plain TUI, the Rich TUI, and the web SSE endpoint without three different implementations to keep in sync. If you need Ralph to behave differently in one UI, do it in that UI’s callback functions, not by adding UI-specific branches inside ralph.py.

11. Where to look for X

“Why didn’t my tool call work” → agent.py chat_stream(), check provider.supports_native_tools and the schema builders in file_tools.py.
“Model list is empty/wrong” → models.py fetch_live_models() per provider, or config.get_live_models()’s fallback chain.
“Web UI file browser/editor acting weird” → check you’re using the _raw/_flat variants (§4), and the workspace-confinement check (§7).
“New slash command” → add to both app.py’s COMMANDS dict + dispatch, and plain.py’s equivalent. Update README’s command table.
“New chat provider” → add an entry to PROVIDERS in config.py with name/base_url/key_field/notes/signup_url and (if free) free_models; verify the base_url against the provider’s current docs, not memory or one search result (§6 has two real examples of this going wrong). No other code changes needed for an OpenAI-compatible provider — base.py’s factory and models.py’s fetch both already fall through to the generic branch for anything that isn’t openrouter/anthropic. Add a default API-key config field to DEFAULT_CONFIG too.
“New image provider” → add an entry to IMAGE_PROVIDERS in config.py; if it needs more than one credential field (like Cloudflare’s account ID + token), that’s handled as an explicit special case in both the TUI wizards and the web Settings panel’s renderImageProviderExtra() — there’s no generic multi-field pattern, so a third such provider would need its own special case too.
“New tool” → add the function to file_tools.py, add to TOOL_DEFINITIONS, add a dispatch entry in agent.py’s _execute_tool. If it’s filesystem/shell-related, decide if it needs to go in _SHELL_TOOLS (server.py) for web gating, and whether its path-like params need names matching the confinement check’s list (path/file/directory/output_path) or whether you need to add a new name to that list (§7).
“New web POST endpoint” → call self._read_json_body() first, always, even if you don’t use the result (§8 — this exact omission shipped twice). If it streams an unbounded response, send Connection: close + self.close_connection = True, don’t claim keep-alive (§8).
“Image generation not working” → imagegen.py’s generate_image() dispatch (Pollinations GET-shape vs. custom OpenAI-images-JSON-shape), or config.generate_image()’s wiring of image_provider/image_model.
“Ralph task stuck / loop won’t continue” → check the task file (<workspace>/ralph_tasks.json) directly — status should be pending/in_progress/done/failed/skipped; a crash mid-task leaves it in_progress forever since nothing resets it back to pending automatically (by design — a human should look at why it got stuck rather than silently retrying).
“Ralph picked the wrong task” → RalphState.next_pending() selects by priority_rank() (P0 before P1 before P2 before P3), ties broken by list order — it is NOT FIFO by add time once priorities differ.
“Ralph dashboard layout/fields” → app.py’s _ralph_task_list_panel()/ _ralph_detail_panel() (Rich Layout split-pane, closely modeled on the real ralph-tui screenshot), plain.py’s show_ralph_status()/ show_ralph_task_detail() (ASCII/Unicode-adaptive equivalent), and the web’s renderRalphTasks()/renderRalphDetails() in app.js. All three read the same RalphTask fields (description, acceptance_criteria, priority, task_type) — if you add a new task field, wire it into all three, not just one.
“Something crashes on old Windows/XP/32-bit” → check it’s reachable via plain.py, not just app.py; check no f-strings/walrus snuck in (ralph.py must stay clean too — it’s imported by plain.py).

This site is open source. Improve this page.