odysseus

mirror of https://github.com/pewdiepie-archdaemon/odysseus.git synced 2026-06-20 11:45:24 -04:00

Author	SHA1	Message	Date
nopoz	160267417e	fix(personal): scope RAG file delete to the caller's own upload dir (#4602 ) The DELETE /api/personal/file disk-delete containment check used the shared PERSONAL_UPLOADS_DIR root, so one admin could delete another user's personal upload by passing its path (uploads are partitioned per owner under <root>/<owner>/). Confine the check to the caller's own per-owner subdir via _personal_upload_dir_for_owner(owner). RAG removal and listing exclusion are unchanged (they still serve non-upload indexed sources). Adds a regression test for the cross-owner case.	2026-06-20 00:50:15 +02:00
Kenny Van de Maele	ed18192a8e	refactor(tools): move session tools to the agent_tools registry (#4454 ) Moves create_session, list_sessions, send_to_session and manage_session out of ai_interaction.py into src/agent_tools/session_tools.py (the do_ prefix dropped) and registers them in TOOL_HANDLERS, so dispatch flows through the registry instead of the dispatch_ai_tool elif in tool_execution.py. Same pattern as the model-interaction move. The bodies move verbatim; each fetches the runtime-set session manager via a get_session_manager() shim, and reuses _resolve_model / AI_CHAT_TIMEOUT from ai_interaction. manage_session's internal 'list' alias is repointed from the old do_list_sessions to the moved list_sessions. stream_ai_tool (dead, no callers) and do_pipeline stay put. dispatch_ai_tool loses its four now-unused branches. Tests: test_session_tools_registry covers registration, owner threading, the manage_session->list_sessions delegation, graceful no-manager handling, and registry dispatch. Verified end-to-end against a live SessionManager.	2026-06-19 11:55:22 +02:00
nopoz	076e8c93c9	fix(ui): escape model name in model-info popup (DOM-XSS) + two latent sinks (#4605 ) chatRenderer.js built the model-info popup HTML by concatenating the model name (from the LLM response's model/answered_by field) into popup.innerHTML without escaping, so a model advertised as an HTML/script payload executed when the user clicked the role label. Wrap both insertions with the uiModule.esc() helper the same function already uses. Also apply existing escape helpers at two latent sinks flagged by CodeQL, fed only by self-authored/server values today: document-tab title via _esc(), and the calendar event background URL (escape the double quote that would otherwise break out of the style="..." attribute).	2026-06-19 11:03:44 +02:00
Kenny Van de Maele	a226c94df7	chore(deps): remove unused @anthropic-ai/sdk dependency (#4566 ) Never imported anywhere in the codebase (unused since v1.0); it is the only root dependency and nothing depends on it. Removing it also drops 6 transitive packages from the lockfile. Fixes #4565	2026-06-19 09:40:35 +02:00
RaresKeY	057ec0552c	fix(cookbook): stop Windows process trees (#4283 )	2026-06-19 00:28:25 -07:00
Kenny Van de Maele	cdae9879f2	feat(agent): add manage_bg_jobs tool to inspect and kill background bash jobs (#4577 ) Detached bash jobs (#!bg) could be launched and auto-reported on completion, but the agent had no way to act on a running one: no on-demand output read and no kill (it blocked until the 1h max-runtime). bg_jobs had the pieces (_read_output, list_for_session, internal _kill) but none was exposed. Adds: - bg_jobs.kill(job_id): tears down the process tree, marks the job killed, and sets followed_up so the monitor does not also auto-continue a deliberate kill. - manage_bg_jobs registry tool with actions list / output / kill, scoped to the chat that launched the job (cross-session access reads as not found). - Wiring: TOOL_HANDLERS/TAGS, function schema, RAG index + keyword hints, parser name map, dispatch (threads session_id via _direct_fallback). Gated like bash (NON_ADMIN_BLOCKED_TOOLS; plan-mode mutator). - agent_loop: background-job intent regex maps to the files domain (and the tool joins _DOMAIN_TOOL_MAP[files]) so short commands like 'kill that job' are not dropped by the low-signal gate that skips tool retrieval. - bg launch message tells the model to call manage_bg_jobs itself for check/stop rather than printing raw tool syntax to the user. Tests: tests/test_bg_job_tools.py (kill semantics, per-chat scoping, actions, and the intent classifier).	2026-06-19 00:28:22 -07:00
Michael	39a802bea2	fix(tools): prune skipped dirs before descending in glob tool (#4538 ) * fix(tools): prune skipped dirs before descending in glob tool GlobTool used pathlib.Path.rglob which descends into every directory (including node_modules, .git, dist, etc.) and filters AFTER the walk. On repos with large junk directories this causes the glob tool to hang for minutes. Replace rglob with os.walk that prunes _CODENAV_SKIP_DIRS before descending — matching the approach GrepTool already uses. Also add a fast path for literal patterns (no wildcards → direct path lookup). Fixes #4493 * fix(tools): use regex glob matching to fix * semantics and literal fallback Replace fnmatch with _glob_to_regex so that * stays within a single path segment (matching pathlib/rglob semantics) and */ spans zero or more directories. Literal patterns now fall through to os.walk when the direct path lookup misses, so e.g. 'foo.py' still finds files at any depth. Add tests for: - bare literal matching in subdirectories - multi-segment single-star patterns (sub/.txt) - * not crossing / boundaries - ** matching at arbitrary depth Closes #4493 --------- Co-authored-by: michaelxer <michaelxer@users.noreply.github.com>	2026-06-18 22:02:29 +02:00
RaresKeY	1cc8a373b0	fix(cookbook): validate agent SSH targets (#4429 )	2026-06-18 21:41:33 +02:00
Wei Hong	a52ac6822b	fix(cookbook): pull llama.cpp from the ggml-org GHCR namespace (#4457 ) (#4490 ) The Dependencies tab's llama.cpp docker recipe surfaced \`docker pull ghcr.io/ggerganov/llama.cpp:server-cuda\`. The upstream repo moved from github.com/ggerganov/llama.cpp to github.com/ggml-org/llama.cpp and the old GHCR namespace no longer publishes images, so copying the recipe failed with: failed to resolve reference "ghcr.io/ggerganov/llama.cpp:server-cuda": not found Point the recipe at \`ghcr.io/ggml-org/llama.cpp:server-cuda\`, which is already the namespace routes/cookbook_routes.py uses for the source clone. Adds a regression test in the same shape as test_cookbook_diagnosis_js.py asserting the new namespace and forbidding the dead one. No CSS/HTML/SVG/style changes — the file is a pure data module (no DOM access) consumed by other renderers; only the displayed command text changes.	2026-06-18 21:29:47 +02:00
Wei Hong	7475779b7c	fix(chat): track chat hot-path background tasks for strong references (#4443 ) (#4444 ) Two background tasks scheduled on every chat completion in routes/chat_helpers.py — the memory/skill extraction dispatch and the session auto-namer — are created via bare asyncio.create_task(...). asyncio only holds a weak reference to the outer task, so the GC can collect it mid-execution and the work silently never runs. Add a module-private _BG_TASKS set and a _spawn_bg() helper that mirrors WebhookManager._spawn_tracked (the pattern #3964 / #4336 established for the webhook emitters two lines apart in the same function). Route both call sites through it so the lifecycle owner is explicit. Adds an AST-level guard test that fails on any bare asyncio.create_task(...) statement in routes/chat_helpers.py to prevent a regression — same shape as test_webhook_emitters_use_manager.py from #4336. The same bare pattern exists in routes/email_routes.py and routes/cookbook_routes.py; left out of this PR per CONTRIBUTING.md's "one fix per PR" and tracked in #4443's "Additional Information" for a follow-up.	2026-06-18 21:26:11 +02:00
Christian Eriksson	e7ffc69729	fix(cookbook): scope the "Kill vLLM" diagnosis to actual vLLM tracebacks (#4517 ) The diagnosis panel offered a "Kill vLLM processes" (pkill -f vllm) recovery for ANY Python traceback — including pip build failures and other tracebacks that have nothing to do with vLLM. That advice is useless for a build failure and harmful if an unrelated vLLM server happens to be running. ERROR_PATTERNS in static/js/cookbook-diagnosis.js had one catch-all traceback matcher that always attached the vLLM-kill fix. Split it into three (all keeping the existing healthy-server suppression): - pip build failure (Failed to build / metadata-generation-failed / subprocess-exited-with-error / Could not build wheels) -> "a dependency failed to build" message, no kill. - vLLM-specific traceback (tail mentions vllm) -> keeps the kill, now scoped. - any other traceback -> neutral "check the captured output" message, no kill. How to test: - node --check static/js/cookbook-diagnosis.js - Trigger a wheel-build failure (old package on a newer Python) or a non-vLLM traceback and open the diagnosis. Before: generic traceback message + "Kill vLLM processes" button. After: a build-failure / neutral message with no kill; only a real vLLM traceback still offers it. Fixes #4516 Co-authored-by: Claude	2026-06-18 21:18:14 +02:00
Karl Jussila	396e26b4bf	fix(auth): tie remember-me cookie lifetime to TOKEN_TTL (#4472 ) The persistent login cookie's max_age hardcoded 60 * 60 * 24 * 7, an independent copy of the session token lifetime that core/auth.py already defines once as TOKEN_TTL (and reports to the frontend via /api/auth/policy as session_days). If TOKEN_TTL changes, the cookie silently drifts: the browser keeps a cookie for a token whose lifetime no longer matches. Import TOKEN_TTL and use it for the cookie max_age so the session lifetime has a single source of truth. No behaviour change at the current value. Fixes #4471	2026-06-18 21:15:48 +02:00
nubs	0bfc7750a2	fix(llm): route gpt-oss harmony commentary channel without leaking markers/tool-args (#4523 ) The harmony stream router only recognized the analysis and final channels, so gpt-oss's standard `commentary` channel (tool-call preambles / function-arg bodies) was unhandled: the literal `<\|channel\|>commentary` marker, the `to=functions.*` recipient, and the commentary body all leaked into the visible answer. Add commentary to the marker regex + the suffix-hold table, and route its body to thinking (only `final` is user-facing). Adds a regression test (split-chunk + recipient + body), verified to fail without the fix.	2026-06-18 21:12:25 +02:00
Rolly Calma	790ef81b06	fix: use aware UTC in health timestamp (#4503 )	2026-06-18 20:58:25 +02:00
Victor	804691501f	test: stop test_skill_index_prompt_injection leaking a stub prefs_routes (#4387 ) _patch_prefs installs a fake routes.prefs_routes with a bare sys.modules[...] = assignment that is never undone. The stub is an empty ModuleType without _save_for_user, so a later test whose code path runs `from routes.prefs_routes import _save_for_user` (e.g. test_backup_import_skills) fails with ImportError under an unfavorable test order. Install the stub with monkeypatch.setitem instead (the helper already takes monkeypatch and uses it for DATA_DIR) so it is reverted at teardown. Repro: pytest tests/test_skill_index_prompt_injection.py tests/test_backup_import_skills.py (1 failed before, 5 passed after).	2026-06-18 20:54:15 +02:00
dependabot[bot]	8e6a2e89f8	chore(deps): bump actions/checkout in the actions group (#4559 ) Bumps the actions group with 1 update: [actions/checkout](https://github.com/actions/checkout). Updates `actions/checkout` from 6.0.3 to 7.0.0 - [Release notes](https://github.com/actions/checkout/releases) - [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md) - [Commits](https://github.com/actions/checkout/compare/df4cb1c069e1874edd31b4311f1884172cec0e10...9c091bb21b7c1c1d1991bb908d89e4e9dddfe3e0) --- updated-dependencies: - dependency-name: actions/checkout dependency-version: 7.0.0 dependency-type: direct:production update-type: version-update:semver-major dependency-group: actions ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-06-18 20:49:58 +02:00
dependabot[bot]	dbcc7874bf	chore(deps): bump the npm group with 2 updates (#4558 ) Bumps the npm group with 2 updates: [@anthropic-ai/sdk](https://github.com/anthropics/anthropic-sdk-typescript) and [@antithesishq/bombadil](https://github.com/antithesishq/bombadil). Updates `@anthropic-ai/sdk` from 0.104.1 to 0.105.0 - [Release notes](https://github.com/anthropics/anthropic-sdk-typescript/releases) - [Changelog](https://github.com/anthropics/anthropic-sdk-typescript/blob/main/CHANGELOG.md) - [Commits](https://github.com/anthropics/anthropic-sdk-typescript/compare/sdk-v0.104.1...sdk-v0.105.0) Updates `@antithesishq/bombadil` from 0.5.0 to 0.6.1 - [Release notes](https://github.com/antithesishq/bombadil/releases) - [Changelog](https://github.com/antithesishq/bombadil/blob/main/CHANGELOG.md) - [Commits](https://github.com/antithesishq/bombadil/compare/v0.5.0...v0.6.1) --- updated-dependencies: - dependency-name: "@anthropic-ai/sdk" dependency-version: 0.105.0 dependency-type: direct:production update-type: version-update:semver-minor dependency-group: npm - dependency-name: "@antithesishq/bombadil" dependency-version: 0.6.1 dependency-type: direct:development update-type: version-update:semver-minor dependency-group: npm ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2026-06-18 20:42:49 +02:00
RaresKeY	16e660ad09	fix(hwfit): normalize CPU arch for fallback estimates (#4441 )	2026-06-18 20:26:22 +02:00
Mazen Tamer Salah	b51d83b16d	fix(agent): index api_call so RAG tool selection can retrieve it (#3923 ) * fix(agent): index api_call so RAG tool selection can retrieve it api_call exists in FUNCTION_TOOL_SCHEMAS and the agent's system prompt advertises configured API integrations, but the tool had no entry in BUILTIN_TOOL_DESCRIPTIONS. RAG tool selection embeds those descriptions and retrieves the top-K per message, so a tool without one can never be selected: the agent claims it can call Home Assistant/Miniflux/Gitea/etc. and then never receives the api_call schema (unless the Personal Assistant ASSISTANT_ALWAYS_AVAILABLE path applies). Add a retrieval-rich description for api_call, plus an ast-based parity test asserting every FUNCTION_TOOL_SCHEMAS tool has an index description so the next added tool cannot silently drift the same way. Fixes #3794 * fix(agent): route API-integration intent to api_call at selection time Addresses review (RaresKeY) on #3923: indexing api_call in the ToolIndex description was necessary but not sufficient — the #3794 repro ('Use the api_call tool to call Home Assistant GET /api/states') matched no domain in _classify_agent_request, classified as low-signal, so the agent loop skipped retrieval entirely and the schema filter sent only ALWAYS_AVAILABLE (manage_memory/ask_user/update_plan). api_call never reached the model. - _classify_agent_request: detect API-integration intent (api_call, integration(s), Home Assistant/Miniflux/Gitea/Linkding/Jellyfin) -> new 'integrations' domain, so the turn is no longer low-signal. - _DOMAIN_TOOL_MAP['integrations'] = {api_call}: deterministically seeds api_call into relevant tools after retrieval, independent of embeddings. - _DOMAIN_RULES['integrations']: rule pack (required — _domain_rules_for_tools indexes _DOMAIN_RULES[domain] directly). - tool_index _KEYWORD_HINTS: parity hint for the retrieval / keyword-fallback paths. - Regression drives the real classifier -> domain-map -> FUNCTION_TOOL_SCHEMAS filter chain and asserts api_call is advertised for the #3794 prompt. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-18 08:43:25 +00:00
Shreyas S Joshi	f70db19cc6	fix(document): allow render-pdf to be framed and 503 cleanly on missing PyMuPDF (#2103 ) * fix(document): allow render-pdf to be framed and 503 cleanly on missing PyMuPDF Fixes #2101. Two related bugs in the PDF-form library preview flow: 1. SecurityHeadersMiddleware was sending X-Frame-Options: DENY and frame-ancestors 'none' on /api/document/{doc_id}/render-pdf, but static/js/documentLibrary.js embeds the response in an <iframe> for the library card preview. The browser blocked the load with ERR_BLOCKED_BY_RESPONSE, leaving the user with a blank panel. Extend the existing is_tool_render exemption to also cover /api/document/.../render-pdf. Per-document owner checks still run in the route handler, so the exemption is scoped the same way as the tool-render exemption it mirrors. /api/document/.../export-pdf is left untouched — it's a download (Content-Disposition: attachment), not an iframe embed. 2. routes/document_routes.py:render_pdf called fill_fields, which raises RuntimeError via _require_fitz() when the optional PyMuPDF dependency isn't installed. That RuntimeError bubbled out as a generic 500 with a cryptic 'PDF render failed' detail. Reuse the existing _load_pdf_viewer_fitz() helper to fail fast with a 503 and a user-actionable install hint (mentions requirements-optional.txt and AGPL-3.0), matching the convention used by the other PDF endpoints. Tests cover both fixes: - middleware headers on /api/document/.../render-pdf (iframeable, but X-Content-Type-Options and Referrer-Policy are still set) - middleware headers on /api/document/.../export-pdf (must stay strict) - middleware path matching precision (similar-but-different paths stay strict) - middleware headers on /api/tools/.../render (no regression) - middleware headers on /api/chat (no regression) - render-pdf returns 503 with install hint when PyMuPDF is missing - 503 is raised before any file I/O (fail-fast ordering) * chore: address maintainer feedback on PDF previews same-origin framing and comment trimming * chore: make render-pdf regression tests order-independent	2026-06-18 06:25:26 +00:00
Kenny Van de Maele	56ba144875	refactor(tools): move model-interaction tools to the agent_tools registry (#4445 ) Moves chat_with_model, ask_teacher and list_models out of ai_interaction.py into src/agent_tools/model_interaction_tools.py (the do_ prefix dropped) and registers them in TOOL_HANDLERS, so dispatch flows through the registry instead of the dispatch_ai_tool elif in tool_execution.py. The implementations are relocated, not wrapped. ai_interaction.py keeps only the shared helpers they reuse (_resolve_model, AI_CHAT_TIMEOUT), still used by the not-yet-migrated session/pipeline tools. dispatch_ai_tool loses its three now-unused branches. Also removes the dead do_second_opinion: it was already off the live tool surface (no tag/schema/parsing/dispatch; tool_index.py notes it was removed), so the function and its stale frontend catalog entries (admin.js, assistant.js) are deleted. Tests: owner-scope test points at the new list_models location and drops the moved tools from the dispatch_ai_tool parametrize; a new test_model_interaction_registry covers registration, owner threading, and registry dispatch.	2026-06-18 05:56:37 +00:00
Matyas Gosztonyi	97a7f59fe7	fix(ui): share one z-order stack across Notes and modals (#3798 ) * fix(notes): bring pane above active windows * fix(notes): align tool window z-order handoff --------- Co-authored-by: Matyas Fenyves <16389204+uhhgoat@users.noreply.github.com>	2026-06-17 12:15:48 +02:00
Afonso Coutinho	24ace44888	fix: canvasCoords crashes on empty touch list (mobile race) (#2045 )	2026-06-17 10:25:39 +02:00
Kenny Van de Maele	93569b141b	fix(security): allowlist manage_mcp 'add' to close the agent-path RCE (#4433 ) * fix(security): allowlist manage_mcp 'add' to close the agent-path RCE do_manage_mcp('add') passed model- and prompt-injection-controlled command, args, and env straight to a stdio subprocess spawn with no validation, and it persisted an enabled server row before connecting (so a payload also survived to re-execute on restart). A string smuggled into a skill description, memory entry, fetched page, or email body could register a server running arbitrary code as the app UID, e.g. command='sh' args=['-c','...']. Add _validate_mcp_command, applied on the agent path before any DB write or spawn: - Hard-deny interpreters, runtimes, package runners, shells, and exec-wrappers (even if an operator lists one in ODYSSEUS_MCP_ALLOWED_COMMANDS). - Require a bare basename (no path components, no shell metacharacters) that is present in the operator allowlist (empty by default). - Reject code-exec argv flags by prefix so glued forms are caught too (-c/-e/-m/--eval/--exec/--print/--module/--command/--require), remote-URL args, and env keys that inject code into the child (LD_PRELOAD, NODE_OPTIONS, PYTHONPATH, DYLD_, PATH, ...). A rejected registration returns an error, writes no row, and makes no connection. The trusted admin route is unchanged. Mirrors the policy intent of _validate_serve_cmd but inverted for the model-reachable surface. Supersedes #438; incorporates the bypass forms found in its review (interpreter script paths, -m pip, glued -c/-e, --eval=, eval subcommands, package runners, remote URLs) and adds integration coverage on the real do_manage_mcp path. Closes #2891 fix(security): deny versioned/alias runtimes in manage_mcp allowlist Addresses RaresKeY's review on #4433. The hard-deny matched command names exactly, so versioned or alias runtime forms (python3.11, node18, pip3, ruby3.2, java, javac, bunx, tsx, ts-node, pypy3, ...) slipped past and, if an operator allowlisted one, re-opened the prompt-injection-controlled MCP registration path. - Canonicalize a trailing version suffix before the deny check so versioned forms collapse to the family (python3.11 -> python, node18 -> node, pip3 -> pip); both the raw basename and the canonical form are denied. - Broaden the denied-family set (java/javac/jshell/jbang/kotlin/dotnet/mono/ swift/osascript/tsx/ts-node/bunx/pypy/jruby/raku/luajit/wish/expect/iex). Deny runs before the operator allowlist, so an alias cannot be allowlisted back in. Canonicalization only feeds the deny check, so a legit name that ends in a digit still reaches the normal allowlist check rather than being mis-denied. Adds validator + integration regressions for versioned/alias runtimes asserting no DB row and no connection, including the allowlisted-anyway case.	2026-06-16 14:34:53 +00:00
Catalin Iliescu	9a00401507	fix(hwfit): use CPU fallback for cpu_only speed estimates (#4397 ) * fix(hwfit): use CPU fallback for cpu_only speed estimates * fix(hwfit): preserve ARM fallback for cpu_only estimates --------- Co-authored-by: Cata <cata@bigjohn.local>	2026-06-16 14:18:31 +00:00
Aura Rays Lab	76562ae31d	Change host from 0.0.0.0 to 127.0.0.1 in CONTRIBUTING.md (#4422 ) Updated the host address in the run command for clarity.	2026-06-16 13:40:47 +00:00
Christian Eriksson	497f455da6	fix(cookbook): open() no longer crashes when a task has a diagnosis (#4417 ) _showDiagnosis referenced an undefined `body` (left over from the refactor that moved the diagnosis text into the toolbar), throwing a ReferenceError whenever a failed task rendered fix buttons. Because open() wraps its render in try/finally with no catch, the throw escaped before the modal was un-hidden, so the whole Cookbook silently failed to open. - cookbook-diagnosis.js: append the fixes row to `diag` (the in-scope container) instead of the removed `body` element. - cookbook.js: guard the render passes in open() so one broken task card can't leave the entire panel stuck hidden. Fixes #4406	2026-06-16 13:35:51 +00:00
Ashvin	dd20c2bc75	fix(tasks): offer shell/file tools to scheduled task agents by default (#4398 ) The scheduled-task runner built the agent's tool set from RAG retrieval plus ASSISTANT_ALWAYS_AVAILABLE. Neither includes bash/python (nor the file tools), and no keyword hint force-includes them, so a task only saw the shell when the tool-embedding index happened to surface it. On hosts where that index is empty or degraded (e.g. a fresh Docker deploy), retrieval returns nothing and the task agent never receives bash/python — telling the user the shell is disabled even for an admin owner. Offer the shell/file group to task agents by default, mirroring the chat agent where these are on unless a privilege or global setting turns them off. The existing blocked_tools_for_owner() gate in stream_agent_loop still strips the whole group for non-admin multi-user owners and only admits it for admins and single-user (AUTH_ENABLED=false) deployments, so this changes what is offered, not who is allowed. A crew that defines an explicit enabled_tools allowlist still has its restriction honored. Also merge the operator's global disabled_tools setting into the scheduler's disabled set before composing relevant_tools and before entering the agent loop, matching what chat already does. Without it, the global tool-disable contract did not reach unattended scheduled tasks: an admin or AUTH_ENABLED=false task could still see and call shell/file tools the operator had turned off globally, since the prompt/schema/execution gates only enforce the disabled tools passed in.	2026-06-16 13:27:30 +00:00
Afonso Coutinho	a36b423a4e	Fix odysseus-calendar list dropping in-progress / multi-day events (#2065 ) cmd_list filtered on the event START falling inside the window (dtstart >= start AND dtstart < end). The canonical web route (routes/calendar_routes.py) and the recurrence contract test use OVERLAP semantics for non-recurring events: dtstart < end AND dtend > start. So an event that began before the window but is still ongoing inside it — e.g. a 09:00-17:00 conference listed at 14:00, or any multi-day event spanning the window — was silently dropped by the CLI even though the web UI shows it. Use overlap, matching the route. dtend is NOT NULL in the schema, so no null-end regression.	2026-06-16 14:04:56 +02:00
Rudy Wolf	4e477741e7	harden(agent-loop): wrap non-native tool results as untrusted data (#1629 ) The non-native (prompted) tool-call path fed tool output back to the model as a plain "[Tool execution results]" user message, bypassing the untrusted_context_message wrapper that THREAT_MODEL.md requires for tool output. That path is what models without native tool-calling (many smaller local models) use, so prompt-injection inside a tool result (fetched page, file read, MCP/email output) could be read as instructions there. Wrap it via untrusted_context_message("tool execution results", ...), the same hardening already applied to skills (#788) and escalation traces (#275). Also update _recent_context_for_retrieval, which used the old "[Tool execution results]" prefix as a sentinel to keep tool envelopes out of the retrieval query, to recognise the wrapped envelope via metadata.trusted. The native path keeps returning tool-role messages (a user-role wrapper would break the native tool-call contract); it is covered by UNTRUSTED_CONTEXT_POLICY. Adds tests/test_tool_output_prompt_injection.py. Fixes #1627. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-16 13:35:07 +02:00
Kenny Van de Maele	a2261c38c1	refactor(auth): centralize the internal-tool pseudo-username into a constant (#4333 ) The in-process tool loopback stamps current_user = "internal-tool" and require_admin grants admin to that sentinel; it is also a reserved username. That security-sensitive string was hand-typed in ~7 places (stamp, admin gate, RESERVED_USERNAMES, and standalone admin-equivalent checks in note/research/ shell/task routes), where a typo silently breaks an auth gate. Add INTERNAL_TOOL_USER in core/middleware.py next to INTERNAL_TOOL_TOKEN/ INTERNAL_TOOL_HEADER and use it at every such site. A typo is now an ImportError, not a silent mismatch. auth.py importing middleware is acyclic (middleware imports no app modules). Behaviour is unchanged. The multi-sentinel sets bundling internal-tool with api/demo/system (assistant_routes, task_scheduler, research_routes) are a separate reserved-set dedup, left for a follow-up. Closes #4332	2026-06-16 13:13:00 +02:00
Alexandre Teixeira	bf56010aad	test: split provider classification tests (#4392 )	2026-06-16 09:54:07 +00:00
Karl Jussila	ee72d71872	fix(auth): centralize password and username validation constants (#4120 ) Added PASSWORD_MIN_LENGTH and RESERVED_USERNAMES to src/constants.py as the single source of truth. Previously PASSWORD_MIN_LENGTH was hardcoded as 8 in four route handlers and all three JS validation paths; RESERVED_USERNAMES was an inline frozenset duplicated in core/auth.py, routes/assistant_routes.py, routes/research_routes.py, and src/task_scheduler.py. Added GET /api/auth/policy (unauthenticated) so the frontend reads the real values from the server instead of hardcoding them in JS. Added missing empty-username guard to /setup and admin POST /users. Both returned a misleading 500/409 on whitespace-only input. /signup already had the check; this makes all three consistent.	2026-06-16 09:52:15 +02:00
RaresKeY	2b519bf355	fix(routes): normalize session owner fallback helpers (#4313 ) * fix(memory): normalize import session fallback * fix(chat): use token owner for compaction scope * fix(background): honor session endpoint fallback	2026-06-16 06:07:42 +01:00
Kfir Sadeh	d795d9a923	feat(launcher): add portable windows launcher (#976 ) * feat(windows): add standalone portable executable, splash screen, and system tray * test: fix test_get_wsl_windows_user_profile_falls_back_to_users_dir on Windows * Refactor launcher: isolate desktop logic into launcher.py, clean app.py/requirements, update build scripts, and add tests * chore: clean launcher test whitespace --------- Co-authored-by: Alexandre Teixeira <alexandremagteixeira@gmail.com>	2026-06-16 04:58:16 +01:00
Tal.Yuan	648db61b45	docs(architecture): add Phase 0 runtime inventory document (#4148 ) * docs(architecture): add Phase 0 runtime inventory document Per #4082 requirements, this no-code planning document maps: - Largest runtime modules (Python + frontend) - Import dependency graph and cross-layer violations - Route ownership grouped by feature domain - Tool registry boundaries and split candidates - Risk-ranked candidate slices with recommended first 3 PRs - Safety guardrails and validation commands for follow-up work Closes #4082 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * docs(architecture): correct inventory metrics per review feedback Address @alteixeira20 review on #4148 (CHANGES_REQUESTED): - src/ flat .py: ~60 -> 91; routes/: 52 -> 54 - core/database.py importers: 49 -> 94; src/agent_loop.py: -> 21 - src/ -> routes/ import lines: ~20 -> 38 - src/ subdirs: 3 -> 2 (agent_tools/, search/); drop non-existent agent/ - move main.py and src/agent/ out of current-structure into new section 10 'Future Direction (NOT current state)' - route grouping: frame as one domain per PR, not a broad reorganization (helper imports / registration / test path risk) * docs(architecture): round-2 fixes — move to specs/, correct counts, frame as candidate Per @alteixeira20 + @RaresKeY review on #4148: - Move docs/architecture-runtime-inventory.md -> specs/ (docs/ is GitHub Pages public content, per @RaresKeY) - src/ -> routes/ import lines: 38 -> 30 (direct grep of import lines referencing routes/, matching reviewer's count) - self-caught count drift: tests 552 -> 544; routes->src 349 -> 351; src->core 49 -> 99 - frame section 6 (rankings/package shapes/split order/route grouping) and section 10 (future direction) as candidate proposals pending maintainer agreement, not a committed plan (per @RaresKeY) * docs(architecture): round-3 reviewer fixes — fix tool categorization, counts, appendix Self-review as reviewer found: - §5.2 tool categories were wrong: listed filesystem/shell/email-sending tools that are NOT in tool_implementations.py (they live in src/agent_tools/). Rewrote to the actual 33 do_* functions grouped by domain (system/cookbook/calendar/notes/search/research/contacts/vault/image) - §2.1 builtin_actions.py: 0 -> 2 classes, ~26 -> ~24 functions - §5.1: '33+' -> '33' (exact count) - Appendix A: 'Complete File Listing' -> 'File Listing'; src noted as '61 of 91 shown' (was claiming complete but listed 61) - Last updated date refreshed * docs(architecture): round-4 — verify remaining counts, soften §6.3 framing - task_scheduler ~6 -> 5 funcs; tool_index ~580 -> 542 lines (verified vs dev) - §6.3 'Recommended First 3 Slices' -> 'Candidate' (ownership unsettled, per review) - verified §4 route-domain line counts, §2.2 frontend counts, mcp_servers=4 - full test suite: 3267 passed, 1 skipped, 0 failed * docs(architecture): refresh Phase 0 inventory metrics + document counting method Refresh every count against current dev (`b58af42`) per review on #4148: - src/ flat .py: 91 -> 95; tests/test_.py: 544 -> 583 - core.database importers: 94 -> 102; src.agent_loop importers: 21 -> 22 - src/ -> routes/ lines: 30 -> 31; routes/ -> src/: 351 -> 374; src/ -> core/: 99 -> 106 - Last updated: dev@9d7a3d6 -> dev@b58af42 Add a "How the metrics are computed" note under section 3.4 with the exact grep/find command for each count, so the numbers are reproducible and future dev drift is a one-command recheck instead of another review round (per the request to note the counting method). Documentation-only; no code changes. Co-Authored-By: Claude <noreply@anthropic.com> docs(architecture): refresh remaining counts + add snapshot basis note Reviewer self-audit of the previous refresh caught more stale counts after the rebase onto dev@b58af42: - tool_implementations importers: 18 -> 17 (§3.2, §6.2, Appendix B) - core/database classes: 27 -> 28 (§2.1, §6.2) - mcp_servers .py files: 4 -> 5 (§1.1) - routes/ -> core/ import lines: 124 -> 126 (§3.4) Line counts in §2.1/§2.2 also drifted over the rebased range but are left as-is and covered by a new "Snapshot basis" note in the header: line counts are a snapshot that drifts as dev moves (recompute with wc -l), while the importer/file/import-line counts are the authoritative ones refreshed here. This keeps the inventory honest about live metric vs structural snapshot, so dev drift no longer triggers a review round. Documentation-only; no code changes. Co-Authored-By: Claude <noreply@anthropic.com> * docs(architecture): fix missed tool_implementations importer count in §6.3 Follow-up to the previous refresh: §6.3 Slice 1 still read "18 importers" after the 18->17 update elsewhere. Correct to 17 for consistency. Doc-only. Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: yuandonghao <yuandonghao@cohl.com> Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-16 04:57:24 +01:00
RaresKeY	260ce8ba59	fix(email): enforce MCP owner boundaries (#4335 ) * fix(email): enforce MCP owner boundaries * fix(email): fail closed for unowned MCP fallback	2026-06-16 04:31:24 +01:00
RaresKeY	2f9ae43a58	test(email): cover sender signature owner cache writes (#4278 )	2026-06-16 04:21:11 +01:00
RaresKeY	293bbfabf4	test(hwfit): cover SSH target validation regressions (#4279 )	2026-06-16 04:18:21 +01:00
Alexandre Teixeira	0086399656	test: add fire_and_forget to API chat webhook stub (#4383 )	2026-06-16 03:15:14 +00:00
RaresKeY	9d2989f386	test(auth): cover reserved username sentinel gate (#4276 )	2026-06-16 04:09:58 +01:00
RaresKeY	b5edbd3df7	fix(devops): harden docker config defaults (#4349 )	2026-06-16 04:03:43 +01:00
RaresKeY	33fe7276be	fix(endpoints): normalize URL handling (#4338 )	2026-06-16 03:59:18 +01:00
RaresKeY	a031a94a2e	fix(cookbook): harden remote serve host handling (#4345 )	2026-06-16 03:46:32 +01:00
RaresKeY	4d10c16d02	fix(auth): clean up rename and null-owner ownership (#4340 )	2026-06-16 03:33:02 +01:00
RaresKeY	745c10e0d7	fix(gallery): confine gallery image path resolution (#4352 )	2026-06-16 03:28:09 +01:00
Alexandre Teixeira	6b7a4c1e70	test: add oversized test split plan (#3987 ) * test: add oversized test split plan * test: refresh oversized split plan	2026-06-16 02:28:03 +00:00
RaresKeY	422f23fb12	fix(mcp): scope memory server by owner (#4315 )	2026-06-16 03:18:17 +01:00
TheDragonTail	0f966d6b9f	fix(embeddings): fall back to default cache dir when FASTEMBED_CACHE_PATH is empty (#3434 ) docker-compose.yml injects FASTEMBED_CACHE_PATH=${FASTEMBED_CACHE_PATH:-}, which sets the variable to an empty string when the host has not defined it. FASTEMBED_CACHE_DIR used os.getenv("FASTEMBED_CACHE_PATH", default), and os.getenv only returns the default when the variable is ABSENT -- so the empty value won and FASTEMBED_CACHE_DIR became "". os.makedirs("") then raised [Errno 2] No such file or directory: '', FastEmbed failed to initialise, and every vector feature (RAG, semantic memory, tool index) silently degraded on the default Docker stack. Treat an empty value like an absent one via `os.getenv(...) or default`. Add a regression test covering the empty, unset, and explicit cases. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-16 03:11:48 +01:00
Afonso Coutinho	7b09491557	fix: check-in calendar digest leaks every user's events (missing owner scope) (#1925 ) * fix: check-in calendar digest leaks every user's events (no owner scope) * Seed dtend on calendar events in digest test so the NOT NULL column is satisfied	2026-06-16 02:42:41 +01:00

1 2 3 4 5 ...

1571 Commits