odysseus

mirror of https://github.com/pewdiepie-archdaemon/odysseus.git synced 2026-06-25 06:05:34 -04:00

Author	SHA1	Message	Date
pewdiepie-archdaemon	9adb940ef9	Agent stream: 10s heartbeat keepalive on the SSE subscribe so long-running thinking models dont drop the connection	2026-06-19 00:34:30 +00:00
pewdiepie-archdaemon	2fbfd22946	Agent loop: compact one-line tool-usage hints for local/small models so the system prompt doesnt eat the context budget	2026-06-19 00:34:24 +00:00
pewdiepie-archdaemon	a10bfc466b	Model endpoints: per-category probe timeouts (15s local / 3s ollama / 2s api) so slow first-token launches arent killed	2026-06-19 00:34:19 +00:00
pewdiepie-archdaemon	18f29bdfd8	Email send: normalize address fields to strip trailing commas + stray whitespace before MIME encoding	2026-06-19 00:34:13 +00:00
pewdiepie-archdaemon	63d9b12b22	Cookbook Running: short-circuit polls for Ollama sidecar tasks so status stays running Three different background loops (_reconnectTask reachability poll, _checkServeReachability, _pollBackgroundStatus) each independently flipped Ollama sidecar tasks between running and stopped because the `docker exec ollama-rocm ollama show <tag>` cmd exits cleanly after its verification print, which the loops misread as the serve dying. Added _isOllamaSidecarTask(task) and an early-bail in each of the three loops so the task stays pinned to running once the show-cmd exits 0. Also the tmux-graceful-kill path prepends a `docker exec ollama-rocm ollama stop <tag>` before tearing down the tmux session, so the Ollama-side model load gets unloaded too (was leaving the model resident in the daemon after Stop).	2026-06-19 00:33:48 +00:00
pewdiepie-archdaemon	ee6fd8ffe8	Cookbook UI: backend-aware env vars, always-show MoE/EP/Reasoning toggles, GPU default, Firefox-mobile expand Frontend half of the backend-detection + per-OS install command work, plus a pile of mobile/UX fixes: Backend awareness: - _gpuEnvPrefix() picks CUDA_VISIBLE_DEVICES / HIP_VISIBLE_DEVICES / nothing based on detected hwfit backend + scanned-host match (so a stale ajax scan does not leak CUDA env vars into a kierkegaard Vulkan launch). Replaces 6 hardcoded CUDA_VISIBLE_DEVICES sites. - GGML_CUDA_ENABLE_UNIFIED_MEMORY only emitted when backend is actually CUDA (was leaking onto Vulkan/ROCm via saved presets). Per-target install command: - Dep rows render a single mono command box + Copy button when the server resolved pkg.install_cmd_for_target. Reused in the build-deps install failure toast so the toast and the row show the same line. - Diagnosis patterns split cmake/g++/git out of the generic llama-cpp-python catch-all so a missing-cmake failure surfaces a cmake-specific message + per-distro Copy buttons. Form toggles always visible: - Reasoning Parser, Expert Parallel, MoE Env Vars no longer gated on model-family detection. Detection still hints (parser tag shown when matched); toggle works with sensible defaults otherwise. MiniMax M- series added to MoE family detector so the auto-fill is right. Mobile + GPU default: - Launch tab cached-list flex collapsed to 0px on mobile because the desktop `flex: 1 1 0` had no parent height to grow into. Override to `flex: 0 0 auto` in the cookbook mobile @media block. - doclib-card expand on mobile (Firefox no :has() support) pins explicit px heights so the launch form actually appears. - llama_mode defaults to gpu when hwfit detected cuda/rocm/vulkan/ metal on the current target, instead of always cpu (which was forcing -ngl 0 on first-open and burning 35GB models on CPU).	2026-06-19 00:33:37 +00:00
pewdiepie-archdaemon	f01465e87f	Cookbook Dependencies: per-OS+backend install command + install-system-deps endpoint When a llama.cpp launch needs cmake/build-essential/git the user used to get a four-distro dump ("apt: x / pacman: y / dnf: z / brew: w") and had to pick the right one. Now: - shell_routes /api/cookbook/packages probes /etc/os-release on the target in the same SSH round-trip as the existing system-prereq check, classifies into debian / arch / fedora / alpine / suse / macos, and builds a single install_cmd_for_target string from the (os_family, backend) matrix. CUDA hosts get nvidia-cuda-toolkit; ROCm gets rocm-dev / rocm-hip-sdk; Vulkan gets libvulkan-dev / vulkan-headers; etc. - llama_cpp catalog entry gets system_prereqs: [cmake, g++, git]. When any of those are missing on the target, the row picks up pkg.build_deps_missing + pkg.install_cmd_for_target for the frontend to render. - New POST /api/cookbook/install-system-deps endpoint runs the right package manager via passwordless sudo on the target. Allowlisted to {cmake, build-essential, g++, gcc, git, tmux, make}; sudo -n only so it can never hang waiting for a password (returns a clear "passwordless sudo unavailable" error via stderr instead).	2026-06-19 00:33:19 +00:00
pewdiepie-archdaemon	1324e1b0d5	Cookbook backend detection: report Vulkan on AMD hosts without ROCm; gate CUDA build on actual NVIDIA hardware Three classes of incorrect detection fixed: (1) AMD GPU + no ROCm installed (e.g. Strix Halo) was reported as backend=rocm everywhere, so launch commands emitted HIP_VISIBLE_DEVICES (silent no-op on Vulkan) and the from-source build path failed. Both _probe_amd_sysfs (routes/cookbook_routes) and _detect_amd (services/hwfit/hardware) now probe rocminfo / hipconfig / vulkaninfo at detection time and report vulkan when only Vulkan is present. (2) Build helper was picking the CUDA branch on AMD hosts whenever a stray pip-installed nvcc was on PATH (vLLM wheels carry one without libcudart). Added _odysseus_has_nvidia_hw() that checks nvidia-smi / /dev/nvidia* / lspci, and gates both the nvcc PATH augmentation and the CUDA elif branch on real hardware. (3) Build chain reordered to ROCm/HIP > CUDA > Vulkan > CPU. Vulkan tier added between CUDA and CPU as a portable fallback for hosts with a GPU but no native toolchain (the common Strix Halo case). Same _append_llama_cpp_linux_accel_build_lines also auto-attempts sudo -n apt/pacman/dnf install of cmake/build-essential/git when they are missing, surfacing a clear no-passwordless-sudo warning otherwise.	2026-06-19 00:33:07 +00:00
pewdiepie-archdaemon	b3e186746a	Docker compose: mount docker.sock + install Docker CLI so Cookbook can reach sibling containers Cookbook now needs to docker-exec into ollama-rocm (and any other sibling container holding a model server) from inside its own container, so: - Dockerfile installs the Docker CLI from the static binary tarball (the Debian docker.io package ships dockerd but not the client on slim) - docker-compose.yml bind-mounts /var/run/docker.sock and adds group_add for the host docker group (default GID 963) - entrypoint.sh detects the socket GID, creates a local group with that GID, and runs usermod -aG before gosu-dropping to the app user so the supplementary group propagates through (gosu strips by default)	2026-06-19 00:32:47 +00:00
Michael	39a802bea2	fix(tools): prune skipped dirs before descending in glob tool (#4538 ) * fix(tools): prune skipped dirs before descending in glob tool GlobTool used pathlib.Path.rglob which descends into every directory (including node_modules, .git, dist, etc.) and filters AFTER the walk. On repos with large junk directories this causes the glob tool to hang for minutes. Replace rglob with os.walk that prunes _CODENAV_SKIP_DIRS before descending — matching the approach GrepTool already uses. Also add a fast path for literal patterns (no wildcards → direct path lookup). Fixes #4493 * fix(tools): use regex glob matching to fix * semantics and literal fallback Replace fnmatch with _glob_to_regex so that * stays within a single path segment (matching pathlib/rglob semantics) and */ spans zero or more directories. Literal patterns now fall through to os.walk when the direct path lookup misses, so e.g. 'foo.py' still finds files at any depth. Add tests for: - bare literal matching in subdirectories - multi-segment single-star patterns (sub/.txt) - * not crossing / boundaries - ** matching at arbitrary depth Closes #4493 --------- Co-authored-by: michaelxer <michaelxer@users.noreply.github.com>	2026-06-18 22:02:29 +02:00
RaresKeY	1cc8a373b0	fix(cookbook): validate agent SSH targets (#4429 )	2026-06-18 21:41:33 +02:00
Wei Hong	a52ac6822b	fix(cookbook): pull llama.cpp from the ggml-org GHCR namespace (#4457 ) (#4490 ) The Dependencies tab's llama.cpp docker recipe surfaced \`docker pull ghcr.io/ggerganov/llama.cpp:server-cuda\`. The upstream repo moved from github.com/ggerganov/llama.cpp to github.com/ggml-org/llama.cpp and the old GHCR namespace no longer publishes images, so copying the recipe failed with: failed to resolve reference "ghcr.io/ggerganov/llama.cpp:server-cuda": not found Point the recipe at \`ghcr.io/ggml-org/llama.cpp:server-cuda\`, which is already the namespace routes/cookbook_routes.py uses for the source clone. Adds a regression test in the same shape as test_cookbook_diagnosis_js.py asserting the new namespace and forbidding the dead one. No CSS/HTML/SVG/style changes — the file is a pure data module (no DOM access) consumed by other renderers; only the displayed command text changes.	2026-06-18 21:29:47 +02:00
Wei Hong	7475779b7c	fix(chat): track chat hot-path background tasks for strong references (#4443 ) (#4444 ) Two background tasks scheduled on every chat completion in routes/chat_helpers.py — the memory/skill extraction dispatch and the session auto-namer — are created via bare asyncio.create_task(...). asyncio only holds a weak reference to the outer task, so the GC can collect it mid-execution and the work silently never runs. Add a module-private _BG_TASKS set and a _spawn_bg() helper that mirrors WebhookManager._spawn_tracked (the pattern #3964 / #4336 established for the webhook emitters two lines apart in the same function). Route both call sites through it so the lifecycle owner is explicit. Adds an AST-level guard test that fails on any bare asyncio.create_task(...) statement in routes/chat_helpers.py to prevent a regression — same shape as test_webhook_emitters_use_manager.py from #4336. The same bare pattern exists in routes/email_routes.py and routes/cookbook_routes.py; left out of this PR per CONTRIBUTING.md's "one fix per PR" and tracked in #4443's "Additional Information" for a follow-up.	2026-06-18 21:26:11 +02:00
Christian Eriksson	e7ffc69729	fix(cookbook): scope the "Kill vLLM" diagnosis to actual vLLM tracebacks (#4517 ) The diagnosis panel offered a "Kill vLLM processes" (pkill -f vllm) recovery for ANY Python traceback — including pip build failures and other tracebacks that have nothing to do with vLLM. That advice is useless for a build failure and harmful if an unrelated vLLM server happens to be running. ERROR_PATTERNS in static/js/cookbook-diagnosis.js had one catch-all traceback matcher that always attached the vLLM-kill fix. Split it into three (all keeping the existing healthy-server suppression): - pip build failure (Failed to build / metadata-generation-failed / subprocess-exited-with-error / Could not build wheels) -> "a dependency failed to build" message, no kill. - vLLM-specific traceback (tail mentions vllm) -> keeps the kill, now scoped. - any other traceback -> neutral "check the captured output" message, no kill. How to test: - node --check static/js/cookbook-diagnosis.js - Trigger a wheel-build failure (old package on a newer Python) or a non-vLLM traceback and open the diagnosis. Before: generic traceback message + "Kill vLLM processes" button. After: a build-failure / neutral message with no kill; only a real vLLM traceback still offers it. Fixes #4516 Co-authored-by: Claude	2026-06-18 21:18:14 +02:00
Karl Jussila	396e26b4bf	fix(auth): tie remember-me cookie lifetime to TOKEN_TTL (#4472 ) The persistent login cookie's max_age hardcoded 60 * 60 * 24 * 7, an independent copy of the session token lifetime that core/auth.py already defines once as TOKEN_TTL (and reports to the frontend via /api/auth/policy as session_days). If TOKEN_TTL changes, the cookie silently drifts: the browser keeps a cookie for a token whose lifetime no longer matches. Import TOKEN_TTL and use it for the cookie max_age so the session lifetime has a single source of truth. No behaviour change at the current value. Fixes #4471	2026-06-18 21:15:48 +02:00
nubs	0bfc7750a2	fix(llm): route gpt-oss harmony commentary channel without leaking markers/tool-args (#4523 ) The harmony stream router only recognized the analysis and final channels, so gpt-oss's standard `commentary` channel (tool-call preambles / function-arg bodies) was unhandled: the literal `<\|channel\|>commentary` marker, the `to=functions.*` recipient, and the commentary body all leaked into the visible answer. Add commentary to the marker regex + the suffix-hold table, and route its body to thinking (only `final` is user-facing). Adds a regression test (split-chunk + recipient + body), verified to fail without the fix.	2026-06-18 21:12:25 +02:00
Rolly Calma	790ef81b06	fix: use aware UTC in health timestamp (#4503 )	2026-06-18 20:58:25 +02:00
Victor	804691501f	test: stop test_skill_index_prompt_injection leaking a stub prefs_routes (#4387 ) _patch_prefs installs a fake routes.prefs_routes with a bare sys.modules[...] = assignment that is never undone. The stub is an empty ModuleType without _save_for_user, so a later test whose code path runs `from routes.prefs_routes import _save_for_user` (e.g. test_backup_import_skills) fails with ImportError under an unfavorable test order. Install the stub with monkeypatch.setitem instead (the helper already takes monkeypatch and uses it for DATA_DIR) so it is reverted at teardown. Repro: pytest tests/test_skill_index_prompt_injection.py tests/test_backup_import_skills.py (1 failed before, 5 passed after).	2026-06-18 20:54:15 +02:00
dependabot[bot]	8e6a2e89f8	chore(deps): bump actions/checkout in the actions group (#4559 ) Bumps the actions group with 1 update: [actions/checkout](https://github.com/actions/checkout). Updates `actions/checkout` from 6.0.3 to 7.0.0 - [Release notes](https://github.com/actions/checkout/releases) - [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md) - [Commits](https://github.com/actions/checkout/compare/df4cb1c069e1874edd31b4311f1884172cec0e10...9c091bb21b7c1c1d1991bb908d89e4e9dddfe3e0) --- updated-dependencies: - dependency-name: actions/checkout dependency-version: 7.0.0 dependency-type: direct:production update-type: version-update:semver-major dependency-group: actions ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-06-18 20:49:58 +02:00
dependabot[bot]	dbcc7874bf	chore(deps): bump the npm group with 2 updates (#4558 ) Bumps the npm group with 2 updates: [@anthropic-ai/sdk](https://github.com/anthropics/anthropic-sdk-typescript) and [@antithesishq/bombadil](https://github.com/antithesishq/bombadil). Updates `@anthropic-ai/sdk` from 0.104.1 to 0.105.0 - [Release notes](https://github.com/anthropics/anthropic-sdk-typescript/releases) - [Changelog](https://github.com/anthropics/anthropic-sdk-typescript/blob/main/CHANGELOG.md) - [Commits](https://github.com/anthropics/anthropic-sdk-typescript/compare/sdk-v0.104.1...sdk-v0.105.0) Updates `@antithesishq/bombadil` from 0.5.0 to 0.6.1 - [Release notes](https://github.com/antithesishq/bombadil/releases) - [Changelog](https://github.com/antithesishq/bombadil/blob/main/CHANGELOG.md) - [Commits](https://github.com/antithesishq/bombadil/compare/v0.5.0...v0.6.1) --- updated-dependencies: - dependency-name: "@anthropic-ai/sdk" dependency-version: 0.105.0 dependency-type: direct:production update-type: version-update:semver-minor dependency-group: npm - dependency-name: "@antithesishq/bombadil" dependency-version: 0.6.1 dependency-type: direct:development update-type: version-update:semver-minor dependency-group: npm ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-06-18 20:42:49 +02:00
RaresKeY	16e660ad09	fix(hwfit): normalize CPU arch for fallback estimates (#4441 )	2026-06-18 20:26:22 +02:00
Mazen Tamer Salah	b51d83b16d	fix(agent): index api_call so RAG tool selection can retrieve it (#3923 ) * fix(agent): index api_call so RAG tool selection can retrieve it api_call exists in FUNCTION_TOOL_SCHEMAS and the agent's system prompt advertises configured API integrations, but the tool had no entry in BUILTIN_TOOL_DESCRIPTIONS. RAG tool selection embeds those descriptions and retrieves the top-K per message, so a tool without one can never be selected: the agent claims it can call Home Assistant/Miniflux/Gitea/etc. and then never receives the api_call schema (unless the Personal Assistant ASSISTANT_ALWAYS_AVAILABLE path applies). Add a retrieval-rich description for api_call, plus an ast-based parity test asserting every FUNCTION_TOOL_SCHEMAS tool has an index description so the next added tool cannot silently drift the same way. Fixes #3794 * fix(agent): route API-integration intent to api_call at selection time Addresses review (RaresKeY) on #3923: indexing api_call in the ToolIndex description was necessary but not sufficient — the #3794 repro ('Use the api_call tool to call Home Assistant GET /api/states') matched no domain in _classify_agent_request, classified as low-signal, so the agent loop skipped retrieval entirely and the schema filter sent only ALWAYS_AVAILABLE (manage_memory/ask_user/update_plan). api_call never reached the model. - _classify_agent_request: detect API-integration intent (api_call, integration(s), Home Assistant/Miniflux/Gitea/Linkding/Jellyfin) -> new 'integrations' domain, so the turn is no longer low-signal. - _DOMAIN_TOOL_MAP['integrations'] = {api_call}: deterministically seeds api_call into relevant tools after retrieval, independent of embeddings. - _DOMAIN_RULES['integrations']: rule pack (required — _domain_rules_for_tools indexes _DOMAIN_RULES[domain] directly). - tool_index _KEYWORD_HINTS: parity hint for the retrieval / keyword-fallback paths. - Regression drives the real classifier -> domain-map -> FUNCTION_TOOL_SCHEMAS filter chain and asserts api_call is advertised for the #3794 prompt. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-18 08:43:25 +00:00
Shreyas S Joshi	f70db19cc6	fix(document): allow render-pdf to be framed and 503 cleanly on missing PyMuPDF (#2103 ) * fix(document): allow render-pdf to be framed and 503 cleanly on missing PyMuPDF Fixes #2101. Two related bugs in the PDF-form library preview flow: 1. SecurityHeadersMiddleware was sending X-Frame-Options: DENY and frame-ancestors 'none' on /api/document/{doc_id}/render-pdf, but static/js/documentLibrary.js embeds the response in an <iframe> for the library card preview. The browser blocked the load with ERR_BLOCKED_BY_RESPONSE, leaving the user with a blank panel. Extend the existing is_tool_render exemption to also cover /api/document/.../render-pdf. Per-document owner checks still run in the route handler, so the exemption is scoped the same way as the tool-render exemption it mirrors. /api/document/.../export-pdf is left untouched — it's a download (Content-Disposition: attachment), not an iframe embed. 2. routes/document_routes.py:render_pdf called fill_fields, which raises RuntimeError via _require_fitz() when the optional PyMuPDF dependency isn't installed. That RuntimeError bubbled out as a generic 500 with a cryptic 'PDF render failed' detail. Reuse the existing _load_pdf_viewer_fitz() helper to fail fast with a 503 and a user-actionable install hint (mentions requirements-optional.txt and AGPL-3.0), matching the convention used by the other PDF endpoints. Tests cover both fixes: - middleware headers on /api/document/.../render-pdf (iframeable, but X-Content-Type-Options and Referrer-Policy are still set) - middleware headers on /api/document/.../export-pdf (must stay strict) - middleware path matching precision (similar-but-different paths stay strict) - middleware headers on /api/tools/.../render (no regression) - middleware headers on /api/chat (no regression) - render-pdf returns 503 with install hint when PyMuPDF is missing - 503 is raised before any file I/O (fail-fast ordering) * chore: address maintainer feedback on PDF previews same-origin framing and comment trimming * chore: make render-pdf regression tests order-independent	2026-06-18 06:25:26 +00:00
Kenny Van de Maele	56ba144875	refactor(tools): move model-interaction tools to the agent_tools registry (#4445 ) Moves chat_with_model, ask_teacher and list_models out of ai_interaction.py into src/agent_tools/model_interaction_tools.py (the do_ prefix dropped) and registers them in TOOL_HANDLERS, so dispatch flows through the registry instead of the dispatch_ai_tool elif in tool_execution.py. The implementations are relocated, not wrapped. ai_interaction.py keeps only the shared helpers they reuse (_resolve_model, AI_CHAT_TIMEOUT), still used by the not-yet-migrated session/pipeline tools. dispatch_ai_tool loses its three now-unused branches. Also removes the dead do_second_opinion: it was already off the live tool surface (no tag/schema/parsing/dispatch; tool_index.py notes it was removed), so the function and its stale frontend catalog entries (admin.js, assistant.js) are deleted. Tests: owner-scope test points at the new list_models location and drops the moved tools from the dispatch_ai_tool parametrize; a new test_model_interaction_registry covers registration, owner threading, and registry dispatch.	2026-06-18 05:56:37 +00:00
pewdiepie-archdaemon	d70c00e8d2	Merge branch 'main' of https://github.com/pewdiepie-archdaemon/odysseus	2026-06-17 12:28:24 +00:00
Matyas Gosztonyi	97a7f59fe7	fix(ui): share one z-order stack across Notes and modals (#3798 ) * fix(notes): bring pane above active windows * fix(notes): align tool window z-order handoff --------- Co-authored-by: Matyas Fenyves <16389204+uhhgoat@users.noreply.github.com>	2026-06-17 12:15:48 +02:00
Afonso Coutinho	24ace44888	fix: canvasCoords crashes on empty touch list (mobile race) (#2045 )	2026-06-17 10:25:39 +02:00
Kenny Van de Maele	93569b141b	fix(security): allowlist manage_mcp 'add' to close the agent-path RCE (#4433 ) * fix(security): allowlist manage_mcp 'add' to close the agent-path RCE do_manage_mcp('add') passed model- and prompt-injection-controlled command, args, and env straight to a stdio subprocess spawn with no validation, and it persisted an enabled server row before connecting (so a payload also survived to re-execute on restart). A string smuggled into a skill description, memory entry, fetched page, or email body could register a server running arbitrary code as the app UID, e.g. command='sh' args=['-c','...']. Add _validate_mcp_command, applied on the agent path before any DB write or spawn: - Hard-deny interpreters, runtimes, package runners, shells, and exec-wrappers (even if an operator lists one in ODYSSEUS_MCP_ALLOWED_COMMANDS). - Require a bare basename (no path components, no shell metacharacters) that is present in the operator allowlist (empty by default). - Reject code-exec argv flags by prefix so glued forms are caught too (-c/-e/-m/--eval/--exec/--print/--module/--command/--require), remote-URL args, and env keys that inject code into the child (LD_PRELOAD, NODE_OPTIONS, PYTHONPATH, DYLD_, PATH, ...). A rejected registration returns an error, writes no row, and makes no connection. The trusted admin route is unchanged. Mirrors the policy intent of _validate_serve_cmd but inverted for the model-reachable surface. Supersedes #438; incorporates the bypass forms found in its review (interpreter script paths, -m pip, glued -c/-e, --eval=, eval subcommands, package runners, remote URLs) and adds integration coverage on the real do_manage_mcp path. Closes #2891 fix(security): deny versioned/alias runtimes in manage_mcp allowlist Addresses RaresKeY's review on #4433. The hard-deny matched command names exactly, so versioned or alias runtime forms (python3.11, node18, pip3, ruby3.2, java, javac, bunx, tsx, ts-node, pypy3, ...) slipped past and, if an operator allowlisted one, re-opened the prompt-injection-controlled MCP registration path. - Canonicalize a trailing version suffix before the deny check so versioned forms collapse to the family (python3.11 -> python, node18 -> node, pip3 -> pip); both the raw basename and the canonical form are denied. - Broaden the denied-family set (java/javac/jshell/jbang/kotlin/dotnet/mono/ swift/osascript/tsx/ts-node/bunx/pypy/jruby/raku/luajit/wish/expect/iex). Deny runs before the operator allowlist, so an alias cannot be allowlisted back in. Canonicalization only feeds the deny check, so a legit name that ends in a digit still reaches the normal allowlist check rather than being mis-denied. Adds validator + integration regressions for versioned/alias runtimes asserting no DB row and no connection, including the allowlisted-anyway case.	2026-06-16 14:34:53 +00:00
Catalin Iliescu	9a00401507	fix(hwfit): use CPU fallback for cpu_only speed estimates (#4397 ) * fix(hwfit): use CPU fallback for cpu_only speed estimates * fix(hwfit): preserve ARM fallback for cpu_only estimates --------- Co-authored-by: Cata <cata@bigjohn.local>	2026-06-16 14:18:31 +00:00
Aura Rays Lab	76562ae31d	Change host from 0.0.0.0 to 127.0.0.1 in CONTRIBUTING.md (#4422 ) Updated the host address in the run command for clarity.	2026-06-16 13:40:47 +00:00
Christian Eriksson	497f455da6	fix(cookbook): open() no longer crashes when a task has a diagnosis (#4417 ) _showDiagnosis referenced an undefined `body` (left over from the refactor that moved the diagnosis text into the toolbar), throwing a ReferenceError whenever a failed task rendered fix buttons. Because open() wraps its render in try/finally with no catch, the throw escaped before the modal was un-hidden, so the whole Cookbook silently failed to open. - cookbook-diagnosis.js: append the fixes row to `diag` (the in-scope container) instead of the removed `body` element. - cookbook.js: guard the render passes in open() so one broken task card can't leave the entire panel stuck hidden. Fixes #4406	2026-06-16 13:35:51 +00:00
Ashvin	dd20c2bc75	fix(tasks): offer shell/file tools to scheduled task agents by default (#4398 ) The scheduled-task runner built the agent's tool set from RAG retrieval plus ASSISTANT_ALWAYS_AVAILABLE. Neither includes bash/python (nor the file tools), and no keyword hint force-includes them, so a task only saw the shell when the tool-embedding index happened to surface it. On hosts where that index is empty or degraded (e.g. a fresh Docker deploy), retrieval returns nothing and the task agent never receives bash/python — telling the user the shell is disabled even for an admin owner. Offer the shell/file group to task agents by default, mirroring the chat agent where these are on unless a privilege or global setting turns them off. The existing blocked_tools_for_owner() gate in stream_agent_loop still strips the whole group for non-admin multi-user owners and only admits it for admins and single-user (AUTH_ENABLED=false) deployments, so this changes what is offered, not who is allowed. A crew that defines an explicit enabled_tools allowlist still has its restriction honored. Also merge the operator's global disabled_tools setting into the scheduler's disabled set before composing relevant_tools and before entering the agent loop, matching what chat already does. Without it, the global tool-disable contract did not reach unattended scheduled tasks: an admin or AUTH_ENABLED=false task could still see and call shell/file tools the operator had turned off globally, since the prompt/schema/execution gates only enforce the disabled tools passed in.	2026-06-16 13:27:30 +00:00
Afonso Coutinho	a36b423a4e	Fix odysseus-calendar list dropping in-progress / multi-day events (#2065 ) cmd_list filtered on the event START falling inside the window (dtstart >= start AND dtstart < end). The canonical web route (routes/calendar_routes.py) and the recurrence contract test use OVERLAP semantics for non-recurring events: dtstart < end AND dtend > start. So an event that began before the window but is still ongoing inside it — e.g. a 09:00-17:00 conference listed at 14:00, or any multi-day event spanning the window — was silently dropped by the CLI even though the web UI shows it. Use overlap, matching the route. dtend is NOT NULL in the schema, so no null-end regression.	2026-06-16 14:04:56 +02:00
Rudy Wolf	4e477741e7	harden(agent-loop): wrap non-native tool results as untrusted data (#1629 ) The non-native (prompted) tool-call path fed tool output back to the model as a plain "[Tool execution results]" user message, bypassing the untrusted_context_message wrapper that THREAT_MODEL.md requires for tool output. That path is what models without native tool-calling (many smaller local models) use, so prompt-injection inside a tool result (fetched page, file read, MCP/email output) could be read as instructions there. Wrap it via untrusted_context_message("tool execution results", ...), the same hardening already applied to skills (#788) and escalation traces (#275). Also update _recent_context_for_retrieval, which used the old "[Tool execution results]" prefix as a sentinel to keep tool envelopes out of the retrieval query, to recognise the wrapped envelope via metadata.trusted. The native path keeps returning tool-role messages (a user-role wrapper would break the native tool-call contract); it is covered by UNTRUSTED_CONTEXT_POLICY. Adds tests/test_tool_output_prompt_injection.py. Fixes #1627. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-16 13:35:07 +02:00
Kenny Van de Maele	a2261c38c1	refactor(auth): centralize the internal-tool pseudo-username into a constant (#4333 ) The in-process tool loopback stamps current_user = "internal-tool" and require_admin grants admin to that sentinel; it is also a reserved username. That security-sensitive string was hand-typed in ~7 places (stamp, admin gate, RESERVED_USERNAMES, and standalone admin-equivalent checks in note/research/ shell/task routes), where a typo silently breaks an auth gate. Add INTERNAL_TOOL_USER in core/middleware.py next to INTERNAL_TOOL_TOKEN/ INTERNAL_TOOL_HEADER and use it at every such site. A typo is now an ImportError, not a silent mismatch. auth.py importing middleware is acyclic (middleware imports no app modules). Behaviour is unchanged. The multi-sentinel sets bundling internal-tool with api/demo/system (assistant_routes, task_scheduler, research_routes) are a separate reserved-set dedup, left for a follow-up. Closes #4332	2026-06-16 13:13:00 +02:00
Alexandre Teixeira	bf56010aad	test: split provider classification tests (#4392 )	2026-06-16 09:54:07 +00:00
Karl Jussila	ee72d71872	fix(auth): centralize password and username validation constants (#4120 ) Added PASSWORD_MIN_LENGTH and RESERVED_USERNAMES to src/constants.py as the single source of truth. Previously PASSWORD_MIN_LENGTH was hardcoded as 8 in four route handlers and all three JS validation paths; RESERVED_USERNAMES was an inline frozenset duplicated in core/auth.py, routes/assistant_routes.py, routes/research_routes.py, and src/task_scheduler.py. Added GET /api/auth/policy (unauthenticated) so the frontend reads the real values from the server instead of hardcoding them in JS. Added missing empty-username guard to /setup and admin POST /users. Both returned a misleading 500/409 on whitespace-only input. /signup already had the check; this makes all three consistent.	2026-06-16 09:52:15 +02:00
RaresKeY	2b519bf355	fix(routes): normalize session owner fallback helpers (#4313 ) * fix(memory): normalize import session fallback * fix(chat): use token owner for compaction scope * fix(background): honor session endpoint fallback	2026-06-16 06:07:42 +01:00
Kfir Sadeh	d795d9a923	feat(launcher): add portable windows launcher (#976 ) * feat(windows): add standalone portable executable, splash screen, and system tray * test: fix test_get_wsl_windows_user_profile_falls_back_to_users_dir on Windows * Refactor launcher: isolate desktop logic into launcher.py, clean app.py/requirements, update build scripts, and add tests * chore: clean launcher test whitespace --------- Co-authored-by: Alexandre Teixeira <alexandremagteixeira@gmail.com>	2026-06-16 04:58:16 +01:00
Tal.Yuan	648db61b45	docs(architecture): add Phase 0 runtime inventory document (#4148 ) * docs(architecture): add Phase 0 runtime inventory document Per #4082 requirements, this no-code planning document maps: - Largest runtime modules (Python + frontend) - Import dependency graph and cross-layer violations - Route ownership grouped by feature domain - Tool registry boundaries and split candidates - Risk-ranked candidate slices with recommended first 3 PRs - Safety guardrails and validation commands for follow-up work Closes #4082 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * docs(architecture): correct inventory metrics per review feedback Address @alteixeira20 review on #4148 (CHANGES_REQUESTED): - src/ flat .py: ~60 -> 91; routes/: 52 -> 54 - core/database.py importers: 49 -> 94; src/agent_loop.py: -> 21 - src/ -> routes/ import lines: ~20 -> 38 - src/ subdirs: 3 -> 2 (agent_tools/, search/); drop non-existent agent/ - move main.py and src/agent/ out of current-structure into new section 10 'Future Direction (NOT current state)' - route grouping: frame as one domain per PR, not a broad reorganization (helper imports / registration / test path risk) * docs(architecture): round-2 fixes — move to specs/, correct counts, frame as candidate Per @alteixeira20 + @RaresKeY review on #4148: - Move docs/architecture-runtime-inventory.md -> specs/ (docs/ is GitHub Pages public content, per @RaresKeY) - src/ -> routes/ import lines: 38 -> 30 (direct grep of import lines referencing routes/, matching reviewer's count) - self-caught count drift: tests 552 -> 544; routes->src 349 -> 351; src->core 49 -> 99 - frame section 6 (rankings/package shapes/split order/route grouping) and section 10 (future direction) as candidate proposals pending maintainer agreement, not a committed plan (per @RaresKeY) * docs(architecture): round-3 reviewer fixes — fix tool categorization, counts, appendix Self-review as reviewer found: - §5.2 tool categories were wrong: listed filesystem/shell/email-sending tools that are NOT in tool_implementations.py (they live in src/agent_tools/). Rewrote to the actual 33 do_* functions grouped by domain (system/cookbook/calendar/notes/search/research/contacts/vault/image) - §2.1 builtin_actions.py: 0 -> 2 classes, ~26 -> ~24 functions - §5.1: '33+' -> '33' (exact count) - Appendix A: 'Complete File Listing' -> 'File Listing'; src noted as '61 of 91 shown' (was claiming complete but listed 61) - Last updated date refreshed * docs(architecture): round-4 — verify remaining counts, soften §6.3 framing - task_scheduler ~6 -> 5 funcs; tool_index ~580 -> 542 lines (verified vs dev) - §6.3 'Recommended First 3 Slices' -> 'Candidate' (ownership unsettled, per review) - verified §4 route-domain line counts, §2.2 frontend counts, mcp_servers=4 - full test suite: 3267 passed, 1 skipped, 0 failed * docs(architecture): refresh Phase 0 inventory metrics + document counting method Refresh every count against current dev (`b58af42`) per review on #4148: - src/ flat .py: 91 -> 95; tests/test_.py: 544 -> 583 - core.database importers: 94 -> 102; src.agent_loop importers: 21 -> 22 - src/ -> routes/ lines: 30 -> 31; routes/ -> src/: 351 -> 374; src/ -> core/: 99 -> 106 - Last updated: dev@9d7a3d6 -> dev@b58af42 Add a "How the metrics are computed" note under section 3.4 with the exact grep/find command for each count, so the numbers are reproducible and future dev drift is a one-command recheck instead of another review round (per the request to note the counting method). Documentation-only; no code changes. Co-Authored-By: Claude <noreply@anthropic.com> docs(architecture): refresh remaining counts + add snapshot basis note Reviewer self-audit of the previous refresh caught more stale counts after the rebase onto dev@b58af42: - tool_implementations importers: 18 -> 17 (§3.2, §6.2, Appendix B) - core/database classes: 27 -> 28 (§2.1, §6.2) - mcp_servers .py files: 4 -> 5 (§1.1) - routes/ -> core/ import lines: 124 -> 126 (§3.4) Line counts in §2.1/§2.2 also drifted over the rebased range but are left as-is and covered by a new "Snapshot basis" note in the header: line counts are a snapshot that drifts as dev moves (recompute with wc -l), while the importer/file/import-line counts are the authoritative ones refreshed here. This keeps the inventory honest about live metric vs structural snapshot, so dev drift no longer triggers a review round. Documentation-only; no code changes. Co-Authored-By: Claude <noreply@anthropic.com> * docs(architecture): fix missed tool_implementations importer count in §6.3 Follow-up to the previous refresh: §6.3 Slice 1 still read "18 importers" after the 18->17 update elsewhere. Correct to 17 for consistency. Doc-only. Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: yuandonghao <yuandonghao@cohl.com> Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-16 04:57:24 +01:00
RaresKeY	260ce8ba59	fix(email): enforce MCP owner boundaries (#4335 ) * fix(email): enforce MCP owner boundaries * fix(email): fail closed for unowned MCP fallback	2026-06-16 04:31:24 +01:00
RaresKeY	2f9ae43a58	test(email): cover sender signature owner cache writes (#4278 )	2026-06-16 04:21:11 +01:00
RaresKeY	293bbfabf4	test(hwfit): cover SSH target validation regressions (#4279 )	2026-06-16 04:18:21 +01:00
Alexandre Teixeira	0086399656	test: add fire_and_forget to API chat webhook stub (#4383 )	2026-06-16 03:15:14 +00:00
RaresKeY	9d2989f386	test(auth): cover reserved username sentinel gate (#4276 )	2026-06-16 04:09:58 +01:00
RaresKeY	b5edbd3df7	fix(devops): harden docker config defaults (#4349 )	2026-06-16 04:03:43 +01:00
RaresKeY	33fe7276be	fix(endpoints): normalize URL handling (#4338 )	2026-06-16 03:59:18 +01:00
RaresKeY	a031a94a2e	fix(cookbook): harden remote serve host handling (#4345 )	2026-06-16 03:46:32 +01:00
RaresKeY	4d10c16d02	fix(auth): clean up rename and null-owner ownership (#4340 )	2026-06-16 03:33:02 +01:00
RaresKeY	745c10e0d7	fix(gallery): confine gallery image path resolution (#4352 )	2026-06-16 03:28:09 +01:00

1 2 3 4 5 ...

1628 Commits