odysseus

mirror of https://github.com/pewdiepie-archdaemon/odysseus.git synced 2026-06-17 02:05:22 -04:00

Author	SHA1	Message	Date
Christian Eriksson	497f455da6	fix(cookbook): open() no longer crashes when a task has a diagnosis (#4417 ) _showDiagnosis referenced an undefined `body` (left over from the refactor that moved the diagnosis text into the toolbar), throwing a ReferenceError whenever a failed task rendered fix buttons. Because open() wraps its render in try/finally with no catch, the throw escaped before the modal was un-hidden, so the whole Cookbook silently failed to open. - cookbook-diagnosis.js: append the fixes row to `diag` (the in-scope container) instead of the removed `body` element. - cookbook.js: guard the render passes in open() so one broken task card can't leave the entire panel stuck hidden. Fixes #4406	2026-06-16 13:35:51 +00:00
Ashvin	dd20c2bc75	fix(tasks): offer shell/file tools to scheduled task agents by default (#4398 ) The scheduled-task runner built the agent's tool set from RAG retrieval plus ASSISTANT_ALWAYS_AVAILABLE. Neither includes bash/python (nor the file tools), and no keyword hint force-includes them, so a task only saw the shell when the tool-embedding index happened to surface it. On hosts where that index is empty or degraded (e.g. a fresh Docker deploy), retrieval returns nothing and the task agent never receives bash/python — telling the user the shell is disabled even for an admin owner. Offer the shell/file group to task agents by default, mirroring the chat agent where these are on unless a privilege or global setting turns them off. The existing blocked_tools_for_owner() gate in stream_agent_loop still strips the whole group for non-admin multi-user owners and only admits it for admins and single-user (AUTH_ENABLED=false) deployments, so this changes what is offered, not who is allowed. A crew that defines an explicit enabled_tools allowlist still has its restriction honored. Also merge the operator's global disabled_tools setting into the scheduler's disabled set before composing relevant_tools and before entering the agent loop, matching what chat already does. Without it, the global tool-disable contract did not reach unattended scheduled tasks: an admin or AUTH_ENABLED=false task could still see and call shell/file tools the operator had turned off globally, since the prompt/schema/execution gates only enforce the disabled tools passed in.	2026-06-16 13:27:30 +00:00
Afonso Coutinho	a36b423a4e	Fix odysseus-calendar list dropping in-progress / multi-day events (#2065 ) cmd_list filtered on the event START falling inside the window (dtstart >= start AND dtstart < end). The canonical web route (routes/calendar_routes.py) and the recurrence contract test use OVERLAP semantics for non-recurring events: dtstart < end AND dtend > start. So an event that began before the window but is still ongoing inside it — e.g. a 09:00-17:00 conference listed at 14:00, or any multi-day event spanning the window — was silently dropped by the CLI even though the web UI shows it. Use overlap, matching the route. dtend is NOT NULL in the schema, so no null-end regression.	2026-06-16 14:04:56 +02:00
Rudy Wolf	4e477741e7	harden(agent-loop): wrap non-native tool results as untrusted data (#1629 ) The non-native (prompted) tool-call path fed tool output back to the model as a plain "[Tool execution results]" user message, bypassing the untrusted_context_message wrapper that THREAT_MODEL.md requires for tool output. That path is what models without native tool-calling (many smaller local models) use, so prompt-injection inside a tool result (fetched page, file read, MCP/email output) could be read as instructions there. Wrap it via untrusted_context_message("tool execution results", ...), the same hardening already applied to skills (#788) and escalation traces (#275). Also update _recent_context_for_retrieval, which used the old "[Tool execution results]" prefix as a sentinel to keep tool envelopes out of the retrieval query, to recognise the wrapped envelope via metadata.trusted. The native path keeps returning tool-role messages (a user-role wrapper would break the native tool-call contract); it is covered by UNTRUSTED_CONTEXT_POLICY. Adds tests/test_tool_output_prompt_injection.py. Fixes #1627. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-16 13:35:07 +02:00
Kenny Van de Maele	a2261c38c1	refactor(auth): centralize the internal-tool pseudo-username into a constant (#4333 ) The in-process tool loopback stamps current_user = "internal-tool" and require_admin grants admin to that sentinel; it is also a reserved username. That security-sensitive string was hand-typed in ~7 places (stamp, admin gate, RESERVED_USERNAMES, and standalone admin-equivalent checks in note/research/ shell/task routes), where a typo silently breaks an auth gate. Add INTERNAL_TOOL_USER in core/middleware.py next to INTERNAL_TOOL_TOKEN/ INTERNAL_TOOL_HEADER and use it at every such site. A typo is now an ImportError, not a silent mismatch. auth.py importing middleware is acyclic (middleware imports no app modules). Behaviour is unchanged. The multi-sentinel sets bundling internal-tool with api/demo/system (assistant_routes, task_scheduler, research_routes) are a separate reserved-set dedup, left for a follow-up. Closes #4332	2026-06-16 13:13:00 +02:00
Alexandre Teixeira	bf56010aad	test: split provider classification tests (#4392 )	2026-06-16 09:54:07 +00:00
Karl Jussila	ee72d71872	fix(auth): centralize password and username validation constants (#4120 ) Added PASSWORD_MIN_LENGTH and RESERVED_USERNAMES to src/constants.py as the single source of truth. Previously PASSWORD_MIN_LENGTH was hardcoded as 8 in four route handlers and all three JS validation paths; RESERVED_USERNAMES was an inline frozenset duplicated in core/auth.py, routes/assistant_routes.py, routes/research_routes.py, and src/task_scheduler.py. Added GET /api/auth/policy (unauthenticated) so the frontend reads the real values from the server instead of hardcoding them in JS. Added missing empty-username guard to /setup and admin POST /users. Both returned a misleading 500/409 on whitespace-only input. /signup already had the check; this makes all three consistent.	2026-06-16 09:52:15 +02:00
RaresKeY	2b519bf355	fix(routes): normalize session owner fallback helpers (#4313 ) * fix(memory): normalize import session fallback * fix(chat): use token owner for compaction scope * fix(background): honor session endpoint fallback	2026-06-16 06:07:42 +01:00
Kfir Sadeh	d795d9a923	feat(launcher): add portable windows launcher (#976 ) * feat(windows): add standalone portable executable, splash screen, and system tray * test: fix test_get_wsl_windows_user_profile_falls_back_to_users_dir on Windows * Refactor launcher: isolate desktop logic into launcher.py, clean app.py/requirements, update build scripts, and add tests * chore: clean launcher test whitespace --------- Co-authored-by: Alexandre Teixeira <alexandremagteixeira@gmail.com>	2026-06-16 04:58:16 +01:00
Tal.Yuan	648db61b45	docs(architecture): add Phase 0 runtime inventory document (#4148 ) * docs(architecture): add Phase 0 runtime inventory document Per #4082 requirements, this no-code planning document maps: - Largest runtime modules (Python + frontend) - Import dependency graph and cross-layer violations - Route ownership grouped by feature domain - Tool registry boundaries and split candidates - Risk-ranked candidate slices with recommended first 3 PRs - Safety guardrails and validation commands for follow-up work Closes #4082 Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * docs(architecture): correct inventory metrics per review feedback Address @alteixeira20 review on #4148 (CHANGES_REQUESTED): - src/ flat .py: ~60 -> 91; routes/: 52 -> 54 - core/database.py importers: 49 -> 94; src/agent_loop.py: -> 21 - src/ -> routes/ import lines: ~20 -> 38 - src/ subdirs: 3 -> 2 (agent_tools/, search/); drop non-existent agent/ - move main.py and src/agent/ out of current-structure into new section 10 'Future Direction (NOT current state)' - route grouping: frame as one domain per PR, not a broad reorganization (helper imports / registration / test path risk) * docs(architecture): round-2 fixes — move to specs/, correct counts, frame as candidate Per @alteixeira20 + @RaresKeY review on #4148: - Move docs/architecture-runtime-inventory.md -> specs/ (docs/ is GitHub Pages public content, per @RaresKeY) - src/ -> routes/ import lines: 38 -> 30 (direct grep of import lines referencing routes/, matching reviewer's count) - self-caught count drift: tests 552 -> 544; routes->src 349 -> 351; src->core 49 -> 99 - frame section 6 (rankings/package shapes/split order/route grouping) and section 10 (future direction) as candidate proposals pending maintainer agreement, not a committed plan (per @RaresKeY) * docs(architecture): round-3 reviewer fixes — fix tool categorization, counts, appendix Self-review as reviewer found: - §5.2 tool categories were wrong: listed filesystem/shell/email-sending tools that are NOT in tool_implementations.py (they live in src/agent_tools/). Rewrote to the actual 33 do_* functions grouped by domain (system/cookbook/calendar/notes/search/research/contacts/vault/image) - §2.1 builtin_actions.py: 0 -> 2 classes, ~26 -> ~24 functions - §5.1: '33+' -> '33' (exact count) - Appendix A: 'Complete File Listing' -> 'File Listing'; src noted as '61 of 91 shown' (was claiming complete but listed 61) - Last updated date refreshed * docs(architecture): round-4 — verify remaining counts, soften §6.3 framing - task_scheduler ~6 -> 5 funcs; tool_index ~580 -> 542 lines (verified vs dev) - §6.3 'Recommended First 3 Slices' -> 'Candidate' (ownership unsettled, per review) - verified §4 route-domain line counts, §2.2 frontend counts, mcp_servers=4 - full test suite: 3267 passed, 1 skipped, 0 failed * docs(architecture): refresh Phase 0 inventory metrics + document counting method Refresh every count against current dev (`b58af42`) per review on #4148: - src/ flat .py: 91 -> 95; tests/test_.py: 544 -> 583 - core.database importers: 94 -> 102; src.agent_loop importers: 21 -> 22 - src/ -> routes/ lines: 30 -> 31; routes/ -> src/: 351 -> 374; src/ -> core/: 99 -> 106 - Last updated: dev@9d7a3d6 -> dev@b58af42 Add a "How the metrics are computed" note under section 3.4 with the exact grep/find command for each count, so the numbers are reproducible and future dev drift is a one-command recheck instead of another review round (per the request to note the counting method). Documentation-only; no code changes. Co-Authored-By: Claude <noreply@anthropic.com> docs(architecture): refresh remaining counts + add snapshot basis note Reviewer self-audit of the previous refresh caught more stale counts after the rebase onto dev@b58af42: - tool_implementations importers: 18 -> 17 (§3.2, §6.2, Appendix B) - core/database classes: 27 -> 28 (§2.1, §6.2) - mcp_servers .py files: 4 -> 5 (§1.1) - routes/ -> core/ import lines: 124 -> 126 (§3.4) Line counts in §2.1/§2.2 also drifted over the rebased range but are left as-is and covered by a new "Snapshot basis" note in the header: line counts are a snapshot that drifts as dev moves (recompute with wc -l), while the importer/file/import-line counts are the authoritative ones refreshed here. This keeps the inventory honest about live metric vs structural snapshot, so dev drift no longer triggers a review round. Documentation-only; no code changes. Co-Authored-By: Claude <noreply@anthropic.com> * docs(architecture): fix missed tool_implementations importer count in §6.3 Follow-up to the previous refresh: §6.3 Slice 1 still read "18 importers" after the 18->17 update elsewhere. Correct to 17 for consistency. Doc-only. Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: yuandonghao <yuandonghao@cohl.com> Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-16 04:57:24 +01:00
RaresKeY	260ce8ba59	fix(email): enforce MCP owner boundaries (#4335 ) * fix(email): enforce MCP owner boundaries * fix(email): fail closed for unowned MCP fallback	2026-06-16 04:31:24 +01:00
RaresKeY	2f9ae43a58	test(email): cover sender signature owner cache writes (#4278 )	2026-06-16 04:21:11 +01:00
RaresKeY	293bbfabf4	test(hwfit): cover SSH target validation regressions (#4279 )	2026-06-16 04:18:21 +01:00
Alexandre Teixeira	0086399656	test: add fire_and_forget to API chat webhook stub (#4383 )	2026-06-16 03:15:14 +00:00
RaresKeY	9d2989f386	test(auth): cover reserved username sentinel gate (#4276 )	2026-06-16 04:09:58 +01:00
RaresKeY	b5edbd3df7	fix(devops): harden docker config defaults (#4349 )	2026-06-16 04:03:43 +01:00
RaresKeY	33fe7276be	fix(endpoints): normalize URL handling (#4338 )	2026-06-16 03:59:18 +01:00
RaresKeY	a031a94a2e	fix(cookbook): harden remote serve host handling (#4345 )	2026-06-16 03:46:32 +01:00
RaresKeY	4d10c16d02	fix(auth): clean up rename and null-owner ownership (#4340 )	2026-06-16 03:33:02 +01:00
RaresKeY	745c10e0d7	fix(gallery): confine gallery image path resolution (#4352 )	2026-06-16 03:28:09 +01:00
Alexandre Teixeira	6b7a4c1e70	test: add oversized test split plan (#3987 ) * test: add oversized test split plan * test: refresh oversized split plan	2026-06-16 02:28:03 +00:00
RaresKeY	422f23fb12	fix(mcp): scope memory server by owner (#4315 )	2026-06-16 03:18:17 +01:00
TheDragonTail	0f966d6b9f	fix(embeddings): fall back to default cache dir when FASTEMBED_CACHE_PATH is empty (#3434 ) docker-compose.yml injects FASTEMBED_CACHE_PATH=${FASTEMBED_CACHE_PATH:-}, which sets the variable to an empty string when the host has not defined it. FASTEMBED_CACHE_DIR used os.getenv("FASTEMBED_CACHE_PATH", default), and os.getenv only returns the default when the variable is ABSENT -- so the empty value won and FASTEMBED_CACHE_DIR became "". os.makedirs("") then raised [Errno 2] No such file or directory: '', FastEmbed failed to initialise, and every vector feature (RAG, semantic memory, tool index) silently degraded on the default Docker stack. Treat an empty value like an absent one via `os.getenv(...) or default`. Add a regression test covering the empty, unset, and explicit cases. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-16 03:11:48 +01:00
Afonso Coutinho	7b09491557	fix: check-in calendar digest leaks every user's events (missing owner scope) (#1925 ) * fix: check-in calendar digest leaks every user's events (no owner scope) * Seed dtend on calendar events in digest test so the NOT NULL column is satisfied	2026-06-16 02:42:41 +01:00
Kenny Van de Maele	fafaf089c5	refactor(search): centralize the web-scraping User-Agent into one constant (#4325 ) The outbound UA for web_fetch / web_search was inlined in four places with two different values and nothing keeping them current: content.py pinned a mid-2021 Chrome 91 build, and providers.py sent a bare Mozilla/5.0 in three spots. Some sites serve a degraded or blocked page to a UA that old. Add WEB_FETCH_USER_AGENT to src/constants.py (env-overridable, matching the existing Copilot/Kimi UA-constant pattern) and import it in content.py and providers.py. Default to a current, common desktop UA so pages return their normal HTML: the market-leading desktop OS (Windows; NT 10.0 covers Windows 10 and 11) and browser (Chrome) on a current stable build. The version is now bumped in one place. Service-specific self-identifying agents (Copilot, Kimi, webhooks, cookbook) are intentionally left separate. Adds a regression pinning the constant shape, the env override, and a guard against a new inline Mozilla literal in the search sources. Closes #4324	2026-06-16 01:33:47 +00:00
RaresKeY	b58af4267b	fix(companion): require chat scope for model inventory (#4319 )	2026-06-16 01:15:05 +02:00
AkioKoneko	8ff76f083c	fix(cookbook): avoid launching Ollama during Windows cache scan (#4368 )	2026-06-16 01:00:40 +02:00
Wei Hong	2196869c86	fix(webhooks): route public emitters through fire_and_forget (#3964 ) (#4336 ) The three public webhook emitters in chat_helpers and webhook_routes schedule deliveries via asyncio.create_task(webhook_manager.fire(...)), which bypasses WebhookManager._bg_tasks. asyncio only holds a weak reference to the outer task, so the GC can collect it mid-delivery and the webhook is silently dropped. Route all three through webhook_manager.fire_and_forget() so the task is tracked by _spawn_tracked() and the manager owns the full lifecycle. Adds an AST-level guard test that scans routes/ for direct asyncio.create_task wrapping webhook_manager.fire(...) to prevent regressions.	2026-06-16 00:41:45 +02:00
holden093	dd2e23c9af	fix(agent): report phone numbers from resolve_contact when a matched contact has no email (#4327 ) When a CardDAV contact matched the search query but had no email address (only phone numbers), the tool silently dropped it and returned 'No contacts found'. Fall back to the contact's phone number(s) so the caller still receives usable information. Refs: #4178 (the contacts-domain classifier fix that made the model actually call resolve_contact for contacts queries, surfacing this pre-existing gap)	2026-06-16 00:03:33 +02:00
Fahim	facc50cb0f	fix(api): attribute bearer-token actions to the token owner on owner-scoped routes (#4054 ) * fix(api): attribute bearer-token actions to the token owner on owner-scoped routes Owner-scoped chat, session, and upload routes called get_current_user(), which resolves a bearer ody_ API token to the sandboxed "api" pseudo-user. A paired API-token client (companion, CLI, IDE extension) therefore saw and created a separate "api"-owned silo instead of the owner's data. effective_user() already exists for exactly this: it attributes a token's actions to request.state.api_token_owner, is identical to get_current_user() for cookie sessions, and falls back safely when a token has no owner. session_routes.py was already migrated; this completes the migration for the remaining owner-scoped routes: - chat_helpers.py: chat-privilege enforcement, message attribution, prefs/context - chat_routes.py: orphaned-endpoint owner, session-auth owner, message search - upload_routes.py: upload owner attribution + access checks The /api/models swap is intentionally omitted: #4292 already migrated it to effective_user (plus the chat-scope gate and ownerless-token 403), so this PR keeps dev's version of routes/model_routes.py unchanged. chat_routes.py keeps importing get_current_user for the workspace owner gate; session_routes.py drops the now-unused import. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * test: target effective_user in auth monkeypatches and owner-scope assertion The owner-scoped routes now call effective_user() instead of get_current_user(), so the tests that stubbed get_current_user (or asserted on it) follow suit: - test_chat_helpers.py, test_review_regressions.py, test_kv_cache_invalidation_2927.py: monkeypatch effective_user - test_session_endpoint_owner_scope.py: assert the owner-scope guard uses effective_user(request) Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-15 23:56:22 +02:00
Kenny Van de Maele	074a1e6eff	fix(search): add download budgets to web_fetch with truncation notice and hard ceiling (#3955 ) * fix(search): add download budgets to web_fetch with truncation notice and hard ceiling MAX_OUTPUT_CHARS only trims what the agent sees; fetch_webpage_content buffered and cached the entire response body first, so a large or hostile URL could pull arbitrarily many bytes into memory and the content cache. The fetch is now a capped streaming GET (SSRF redirect guard unchanged): a soft default budget (WEB_FETCH_SOFT_MAX_BYTES, 2 MB), a per-call override via full/max_bytes on the web_fetch tool, and a hard ceiling (WEB_FETCH_HARD_MAX_BYTES, 20 MB) that the override can never exceed. When Content-Length already declares a body over the ceiling the fetch is refused before any body bytes are buffered. Truncated results carry truncated/fetched_bytes/total_bytes, the tool output leads with a partial-content notice telling the model how to re-fetch with full=true, and the tool schema documents the flag. A truncated PDF is reported as a budget error since a cut PDF is unparseable. The effective cap is part of the content-cache key so a truncated fetch is never served to a full-budget request. Existing tests that faked httpx.get or the old _get_public_url signature are adapted to the streaming interface; behavior pins are unchanged. Fixes #3812 * fix(search): close compressed-body cap bypass and protect the partial notice Addresses RaresKeY's review on #3955: - Force Accept-Encoding: identity for the capped fetch. With gzip/deflate the wire bytes (and Content-Length) can be a fraction of the decoded body, so a tiny compressed response could pass the hard-cap preflight and then expand past the ceiling in a single decoded chunk before the streamed cap could slice it. Identity makes Content-Length the true body size and keeps each streamed chunk bounded by the network read, so the hard ceiling actually bounds memory. - Lead web_fetch output with the partial-content notice and cap the page title. The notice is the user-facing contract for partial fetches, but the title is untrusted, uncapped page content; placed ahead of the notice a giant title could push it past MAX_OUTPUT_CHARS and drop it. The notice now leads and the title is capped as a second guard. Adds regressions: the fetch advertises identity encoding, and a truncated result with an oversized title still surfaces the partial notice. * fix(search): reject compressed responses that ignore the identity request Requesting Accept-Encoding: identity is not enough on its own: a server can ignore it and still return Content-Encoding: gzip, and httpx.iter_bytes would decode that, so a tiny compressed body could balloon into one decoded chunk far past the hard cap before the streamed loop slices it (and Content-Length, the compressed wire length, makes the preflight and size metadata unreliable). Refuse a non-identity Content-Encoding before reading the body. Adds a regression where the server ignores the identity request and returns gzip; the fetch is refused before any body is decoded.	2026-06-15 17:38:09 +00:00
Kenny Van de Maele	2fab378c6a	refactor(search): import REQUEST_TIMEOUT from constants in providers.py (#4331 ) providers.py redefined REQUEST_TIMEOUT = 20 locally, shadowing the same value in src/constants.py and risking drift if the constant is bumped. Import it from src.constants and drop the local copy; same value, one source of truth. Closes #4329	2026-06-15 17:22:08 +00:00
Michael	5bafc30622	fix(api): normalize non-object JSON bodies to empty dict in token PATCH (#3976 ) * fix(api): normalize non-object JSON bodies to empty dict in token PATCH Valid non-dict JSON (e.g. [] or null) reaches payload.get(...) and raises AttributeError. Normalize to {} so the route returns a controlled response instead of an unhandled 500. Fixes #3966 * test(api): add regression tests for PATCH with non-object JSON bodies Covers array body ([]), null body, and normal object body as requested in alteixeira20's review of #3976. --------- Co-authored-by: michaelxer <michaelxer@users.noreply.github.com>	2026-06-15 18:05:15 +01:00
darius-f96	d6d2e17214	fix(hwfit): add GB10 unified-memory bandwidth so speed scores are real (#4270 ) NVIDIA Grace Blackwell GB10 / DGX Spark was missing from GPU_BANDWIDTH, so _lookup_bandwidth() returned None for it and _estimate_speed() fell through to the crude FALLBACK_K path (k/active-params). That over-stated tok/s and let speed scores saturate regardless of the box's real ~273 GB/s LPDDR5X pool — distorting model ranking on these 128GB unified-memory rigs. Add "gb10": 273 (GB/s). nvidia-smi reports the device name as "NVIDIA GB10", which substring-matches the new key, so detected GB10 boxes now estimate speed from the real bandwidth instead of the fallback.	2026-06-15 18:55:15 +02:00
Lucas Daniel	f4e8990635	chore: add warnings to silent except Exception blocks (#3212 ) * log(app): add warnings to silent except Exception blocks - Internal tool auth header failure now logs a warning instead of silently passing, making auth bypass easier to spot in logs. - Token last_used_at update failure now logs at DEBUG (fire-and-forget, non-critical, but useful when debugging token tracking issues). - Image ownership verification failure now logs a warning so unexpected access-check errors surface instead of silently allowing the request. * log(chat_routes): add warnings to silent except Exception blocks - clear_orphaned_session_endpoint: log before rollback so failures appear in traces when users see stale/deleted model options. - _endpoint_has_model (JSON parse): log malformed cached_models instead of silently treating endpoint as valid. - _has_any_visible_model (JSON parse): log malformed cached_models instead of silently returning empty list. - timezone header parse: log failure so time-zone-related tool bugs (wrong scheduled times, calendar events) are traceable. - attachments JSON parse: log failure so silently-dropped attachments are visible in server logs. * log(email_routes): add warnings to silent except Exception blocks - Email alias resolution failure now logs a warning instead of silently returning an empty list, making broken account configs diagnosable. * log(document_routes): add warnings to silent except Exception blocks - Export ZIP request body parse failure now logs a warning so empty exports caused by malformed requests are diagnosable. - clear_active_document failure on detach now logs a warning to help trace doc re-injection bugs like #1160. * log(agent_loop): add warnings to silent except Exception blocks - builtin tool overrides load failure now logs a warning so misconfigured settings don't silently fall back to defaults without a trace. - Timezone context injection failure now logs a warning to help debug incorrect scheduled times in agent-created tasks. - PDF form-backed document detection failure now logs a warning so broken form-doc UI is traceable to the root cause. * log(llm_core): add warnings to silent except Exception blocks - Malformed URL in _is_ollama_native_url now logs a warning so bad endpoint configs are traceable instead of silently returning False. - Model list fetch failure now logs a warning with the endpoint URL so endpoints that silently vanish from the model picker are diagnosable. * log: pass exception via exc_info instead of string interpolation * fix(logging): avoid logging raw URLs in llm_core error paths Drop the raw url/base_chat_url from the Ollama-detection and model-list-fetch warning logs added by this sweep, since these values can contain private hostnames, internal IPs, credentials, or other deployment details. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-15 17:49:27 +01:00
Kfir Sadeh	fc3a5e555e	feat(paths): abstract runtime path logic for frozen distribution packages (#969 ) * feat(core): abstract runtime path logic for frozen distribution packages * Address review feedback: revert browser MCP check, persistent data dir default when frozen, and add path tests	2026-06-15 17:44:10 +01:00
Hriday Ranka	270b8570fc	feat(email): add Google OAuth2 for Google Workspace / .edu IMAP & SMTP (#237 ) * feat(email): add Google OAuth2 for Google Workspace / .edu IMAP & SMTP Google deprecated basic-auth (password) access for Google Workspace accounts in May 2025. This means any .edu or org Google email account could no longer connect via IMAP/SMTP with a username + password — the email feature was silently broken for a large class of users. This PR adds full OAuth2 (XOAUTH2) support for Google accounts so Workspace / .edu emails work out of the box. ## What changed ### Backend - `core/database.py`: add `oauth_provider`, `oauth_access_token`, `oauth_refresh_token`, `oauth_token_expiry`, and `display_name` columns to `EmailAccount` + idempotent migration - `routes/email_helpers.py`: XOAUTH2 auth in `_imap_connect()` and `_send_smtp_message()`, automatic token refresh, OAuth fields in `_get_email_config()` - `routes/email_routes.py`: OAuth authorize + callback routes, `_smtp_ready()` fix, OAuth fields through `_deliver()` closure, `display_name` in `From:` header ### Frontend - `static/js/settings.js`: "Google Workspace / .edu" provider preset, "Connect with Google" button, success/error banner, display name field - `static/js/document.js`: `_accountCanSend()` recognises OAuth accounts as SMTP-capable * security: sign OAuth state, scope callback by owner, fix quotes & logs Addresses reviewer feedback on the email OAuth2 PR: - OAuth state is now HMAC-SHA256 signed (keyed with the app secret from secret_storage) encoding account_id + owner + a random nonce, and is verified with constant-time comparison in the callback before any token write. Replaces the bare account_id state, closing the CSRF / state-guessing gap. - Callback extracts the owner from the verified state and re-checks it against EmailAccount.owner before writing tokens, matching the ownership guards used elsewhere in the email routes. Single-user mode (owner == "") still accepts any account, consistent with _assert_owns_account. - Replaced curly/smart quotes in the Name/Email/Display Name input rows with plain ASCII so getElementById lookups and event wiring work. - Stripped account name, SMTP host/user, owner, and raw provider error text from send-config and OAuth logs; failures now surface as generic error codes in the redirect instead of raw exception strings. * test(email): add OAuth2 state, _smtp_ready, and XOAUTH2 tests Move the OAuth state sign/verify helpers out of the setup_email_routes closure into module-level make_oauth_state/verify_oauth_state in email_helpers.py so they can be unit-tested, then add tests/test_email_oauth.py: - signed state round-trips account_id + owner, nonce is unique per call - tampered account_id, forged signature, and garbage states are rejected - _smtp_ready treats an OAuth account (no password) as send-capable, and still rejects host+user-only accounts with neither password nor OAuth - _xoauth2_string / _xoauth2_bytes produce the correct SASL XOAUTH2 framing 14 new tests; existing test_security_regressions.py still passes (28). * refactor(email): single XOAUTH2 frame helper, use RuntimeError Polish from self-review before merge: - Collapse the XOAUTH2 framing to one source of truth: _xoauth2_raw() returns the unencoded SASL string used by both the SMTP and IMAP auth callbacks (each library base64-encodes it), and _xoauth2_bytes() is just its .encode(). Removes the unused base64 _xoauth2_string helper and the duplicated inline frame in _send_smtp_message. - Raise RuntimeError (not bare Exception) for the "OAuth token unavailable" path, matching the convention used across src/. - Update tests accordingly. All 14 OAuth tests + 28 security regressions pass; SMTP/IMAP XOAUTH2 verified live against a real Workspace account. * tests(email-oauth): cover the security-sensitive OAuth paths before merge The previous tests only exercised pure helpers (state signing, _smtp_ready, XOAUTH2 framing). This adds coverage for the actual token-custody and ownership behaviour, pinning the real route handlers rather than re-implementations of their logic. Real OAuth callback route (pulled live from setup_email_routes()): - missing code -> generic missing_code redirect, no account id / owner in URL - provider error -> generic google_error redirect, raw error not echoed - tampered/invalid state -> invalid_state redirect, auth code never leaked - signed state with owner mismatch -> token write refused (ownership_error), DB row left untouched - signed state with matching owner -> tokens written encrypted, and only to the intended account (a second account stays untouched) Real accounts-list route: - exposes oauth_provider status but never the access/refresh token values, encrypted or otherwise Token storage / refresh helpers (isolated in-memory SQLite, mocked HTTP): - refreshed access token stored encrypted; expiry is a timestamp, not a token - fresh token uses cache (no refresh call); expired token triggers refresh - refresh HTTP failure returns None silently, no exception or secret surfaced - missing client credentials short-circuits to None Password-account regression: - password IMAP accounts call conn.login(); OAuth accounts call XOAUTH2 authenticate() and never login() 28 tests pass (14 prior + 14 new). * fix(email-oauth): drop raw exception text from token-refresh log Google token refresh failures now log the account id only, matching the conservative logging used elsewhere on the OAuth path — no raw provider/exception details surfacing in logs. * fix(email-oauth): bring OAuth UI parity to the Integrations email form The Google Workspace / .edu provider preset, Display Name field, and Connect-with-Google flow were only wired into the Email-tab account form. The Integrations-tab form (a separate code path for the same account type) was missing all three, so the OAuth option was invisible from that entry point. Mirrors the same PROVIDERS entry, OAuth section, and connect handler so both forms behave identically. --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: Alexandre Teixeira <111787685+alteixeira20@users.noreply.github.com>	2026-06-15 17:02:58 +01:00
Léo	0750486654	fix(notes): fail closed when an unauthenticated request reaches owner-scoped routes (#4062 ) * fix(notes): fail closed when an unauthenticated request reaches owner-scoped routes The notes CRUD routes resolved the acting user with bare get_current_user(). A request that reached them with no identity (auth-middleware regression, SSRF from a sibling service) came through as user=None — which every query treats as the single-user mode: list all accounts' notes, read/update/ delete/pin/archive any row, reorder globally. Resolve the owner through require_user() instead, which already encodes the right policy: 401 when auth is configured, while the documented anonymous modes (AUTH_ENABLED=false, LOCALHOST_BYPASS on loopback, unconfigured first-run) still resolve to the single-user path. fire-reminder in the same file already gated this way; the CRUD routes now match, and the inline require_user import there is folded into the module import. Extracted from #2940 (stabilization slice). Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> * test(notes): drive fail-closed test via ASGITransport, not sync TestClient The focused fail-closed test hung at `TestClient(app).get(...)` on some environments. Starlette's sync TestClient runs the app in a background event-loop thread (anyio blocking portal) and then dispatches each sync endpoint onto a second worker thread; that handshake deadlocks on certain anyio/httpx/platform combos. The identity injection also used BaseHTTPMiddleware (@app.middleware("http")), the other known TestClient deadlock source. Switch to the repo's existing httpx.ASGITransport + AsyncClient idiom so the whole request runs on the test's own event loop (no portal thread, no BaseHTTPMiddleware). Identity now comes from a pure-ASGI shim that writes the same request.state fields the real auth middleware sets, and a non-loopback client peer keeps require_user's loopback fall-throughs out of the picture. Same assertions and coverage; production code unchanged. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Fable 5 <noreply@anthropic.com>	2026-06-15 17:43:28 +02:00
RaresKeY	d38e2cbc07	fix(ci): avoid duplicate CodeQL setup (#4297 )	2026-06-15 16:39:13 +01:00
Ashvin	7fd937fa57	fix(calendar): parse "mins"/"hrs" reminder offsets in manage_calendar (#4266 ) _reminder_minutes matched the offset with (?:m\|min\|minute\|minutes)\b and (?:h\|hr\|hour\|hours)\b. The trailing \b makes the common plural abbreviations "mins"/"hrs" fail to match (after "min" the "s" is a word char, so no boundary), so reminder_minutes "5 mins" or "2 hrs" returned None and the event was created with no reminder, silently. Widen the two unit regexes and the matching reminder_only description regex to a strict superset that also accepts mins/hrs. The sibling duration parser already accepts these forms (it has no \b), so this only brings the reminder parser in line.	2026-06-15 17:37:28 +02:00
Catalin Iliescu	c41caac438	fix(cookbook): only persist successfully stopped scheduled serves (#4267 ) Co-authored-by: Cata <cata@bigjohn.local>	2026-06-15 17:30:18 +02:00
Kenny Van de Maele	1747c13133	test: align README presentation guards with the #4306 refresh (#4311 ) * test: align README presentation guards with the #4306 refresh The 'Refresh README presentation' change (#4306) swapped the ASCII banner for a centered wordmark image and moved the native quickstart into docs/setup.md, which left four base tests failing on dev and froze the merge gate: - test_security_regressions::test_readme_native_quickstart_uses_loopback now also accepts the loopback guidance from docs/setup.md, where the quickstart moved (no behaviour change; the guidance is intact there). - test_readme_ascii_fenced guards the new wordmark title instead of the removed ASCII banner, and keeps a defensive check that any reintroduced box-drawing banner stays inside a code fence (the original #1390 mode). - The five unreferenced demo gifs under docs/ (chat, compare, document, notes, research) are removed so test_docs_no_orphan_images passes; they were de-referenced by the refresh. Recoverable from history if a docs page wants to embed them again. * chore: refresh PR checks --------- Co-authored-by: Alexandre Teixeira <alexandremagteixeira@gmail.com>	2026-06-15 16:25:38 +01:00
RaresKeY	ffd0aaf69b	fix(cookbook): validate adopt host (#4282 )	2026-06-15 16:44:24 +02:00
RaresKeY	81e7074d93	fix(gallery): confine replacement image path (#4285 )	2026-06-15 16:42:41 +02:00
RaresKeY	f66a23d19d	fix(ai): validate generated image result URLs (#4289 )	2026-06-15 16:40:49 +02:00
RaresKeY	f602819523	fix(models): scope API-token model listing (#4292 )	2026-06-15 16:38:41 +02:00
RaresKeY	85a773ea02	fix(personal): resolve upload delete path (#4291 )	2026-06-15 16:38:37 +02:00
PewDiePie	fb0a64fe4f	Merge pull request #4306 from pewdiepie-archdaemon/readme-refresh-default-branch docs: refresh README presentation	2026-06-15 23:28:20 +09:00
pewdiepie-archdaemon	bcf46dafb9	Refresh README presentation	2026-06-15 23:26:10 +09:00
pewdiepie-archdaemon	b118c33e37	test(provider): align lookalike-host URL expectations with /models behavior build_models_url returns /models (no /v1 prefix) for non-local generic OpenAI-compatible hosts (intentional, see endpoint_resolver.py:206). The tests added in #4272 expected /v1/models, which is the local/deepseek behavior. Match production semantics.	2026-06-15 23:21:49 +09:00

1 2 3 4 5 ...

1545 Commits