odysseus

mirror of https://github.com/pewdiepie-archdaemon/odysseus.git synced 2026-06-16 09:45:24 -04:00

Author	SHA1	Message	Date
Vykos	4a9085d252	fix(endpoint): scope secondary endpoint lookups by owner * Scope secondary endpoint lookups by owner * Reject unregistered image endpoint URLs for non-admins * Adjust owner-scope tests for rebased routes * Allow non-admins to compare endpoints they own The compare owner-scope guard called _reject_raw_endpoint_url_for_non_admin with endpoint_id=None, so it rejected every signed-in non-admin /api/compare/start request — even for endpoints the caller owns — because compare resolves endpoints by URL and carries no endpoint_id. That locked non-admins out of compare entirely. Resolve the owned ModelEndpoint first and pass its id, so a registered endpoint the caller owns is allowed while only truly raw, unregistered URLs are rejected (mirrors the gallery inpaint/harmonize checks in this PR). Replace the source-only reject test with deterministic reject + allow regressions that no longer depend on the dev DB contents. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * Bind compare sessions to the resolved owner-scoped endpoint /api/compare/start created the [CMP] helper sessions with the raw caller-supplied endpoint URL and only used the owner-scoped lookup to decide whether to copy an API key. That stopped key borrowing but still let a non-admin inject an arbitrary raw endpoint URL into the compare session path. Now, when the supplied URL resolves to a registered endpoint visible to the caller, the session binds to that row's own normalized base URL (build_chat_url(normalize_base(ep.base_url))) plus its headers — the same registered-endpoint shape session_routes uses. The raw URL survives only when ep is None, which non-admins already hit a 403 on, leaving raw URLs reachable solely for admins / single-user mode with no borrowed key. Adds compare-specific behavior tests: another user's private endpoint is rejected (nothing created), the session binds to the stored URL rather than the raw input, and an admin raw URL is allowed but carries no inherited key. Addresses the review on #1511. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * Validate both compare endpoints before creating any session start_comparison resolved + created each [CMP] session inside one loop, so a request pairing a valid owned endpoint A with an unregistered raw endpoint B raised 403 only after A's session was already created — and its Authorization header copied in. The rejected request left a partial compare session with that header behind. Split the flow into two phases: phase 1 resolves and owner-validates both endpoints (running the raw-URL reject helper) and stashes the session URL + headers; phase 2 creates the two sessions only once both passed. A 403 on either endpoint now aborts with nothing created and no header copied. Adds a regression test: owned endpoint A + unregistered/raw endpoint B -> 403 with no sessions created. Addresses the follow-up review on #1511. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * Resolve compare credentials by endpoint id, not URL alone Two endpoints visible to a caller can share a base_url but hold different api_keys. _owned_endpoint_by_url returned whichever row sorted first, so /api/compare/start could copy the wrong key into the [CMP] session. Add _owned_endpoint_by_id (same owner scoping) and optional endpoint_a_id/ endpoint_b_id form fields. The id pins the exact registered endpoint; URL resolution remains only for legacy/admin raw-URL callers. An id the caller can't see 404s instead of falling back to a same-URL row. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * Loosen research-routes owner-scope assertion to the stable substring The rebased _resolve_research_endpoint generalized its owner derivation to honor an explicit owner arg first (owner = owner or getattr(sess, ...)), so the exact-line assertion broke CI. Assert the stable session-derivation substring instead of the full line. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-08 11:51:55 +01:00
stocky789	1e0d9b92af	feat: add ChatGPT Subscription provider (#2876 ) * feat: Add ChatGPT Subscription support and related features - Introduced a new provider option for ChatGPT Subscription in the endpoint selection UI. - Implemented OAuth flow for ChatGPT Subscription sign-in, including polling for authorization status. - Updated admin interface to handle ChatGPT Subscription, including disabling API key input and providing user guidance. - Enhanced cost tracking logic to differentiate between subscription and non-subscription endpoints. - Added new slash commands for managing skills, including listing, searching, and invoking skills. - Implemented caching for skill catalog to optimize performance. - Updated tests to cover new ChatGPT Subscription functionality and ensure proper endpoint probing. - Refactored existing code to accommodate new features and improve maintainability. * refactor: share provider device-flow setup - reuse one device-flow backend for Copilot and ChatGPT Subscription - add one frontend device-flow helper for Settings and /setup - put GitHub Copilot back into Add Models, now as a dropdown option - make provider selection just select; clicking Add starts sign-in - stop ChatGPT Subscription setup from opening auth tabs automatically - make /setup copilot and /setup chatgpt-subscription work from chat - show ChatGPT Subscription in the /setup suggestions - show the real error message when setup fails - add focused tests for the shared flow and setup UI * feat(chatgpt-subscription): harden credential lifecycle and streamline auth UX Backend: - Resolve runtime bearer for provider-auth endpoints at probe time via a shared _resolve_probe_key() that delegates to resolve_endpoint_runtime, applied across all probe/refresh call sites. - Skip live completion probes and health pings for discovery-only providers (centralized behind _is_discovery_only_provider) — the Codex/Responses API has no such endpoints, so status is derived from cached models. - Never persist the short lived ChatGPT bearer to the plaintext sessions table; proactively clear any stale bearer left by an earlier code path. - Revoke orphaned ProviderAuthSession credentials when the last endpoint backing them is deleted (_delete_orphaned_provider_auth), surfaced via cleared_provider_auth in the delete response. Frontend (admin.js): - Auto-start the device-auth flow on provider selection so the authorization panel (code + Authorize) shows immediately instead of behind a "Sign in" click. - Remove the redundant top button for device auth providers, move retry into the panel via an inline "Try again". - Drop the self-evident hint text and add an execCommand clipboard fallback so Copy works in non-secure (HTTP/LAN) contexts. * fix: harden chatgpt subscription provider * chore: remove PR media from branch * Fix chatgpt subscription recovery and token handling --------- Co-authored-by: 5p00kyy <admin@5p00ky.dev>	2026-06-08 10:19:18 +02:00
Mike	ac94885c84	refactor(constants): single source of truth for data dir (#3368 ) * refactor(constants): single source of truth for data dir + merge core/src constants Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * docs(contributing): use named src.constants for data paths, drop core/constants references Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-08 09:58:52 +02:00
Lucas Daniel	adc6ac9394	fix(compare): stream Compare panes directly to stop upstream promptly The previous approach polled request.is_disconnected() inside the async-for body of the chat/agent streaming loops. That happens too late: by the time the poll runs, __anext__() has already awaited and consumed the next upstream chunk, so a slow or silent generation could still run for a full round-trip (or until a read timeout) after the client disconnected. It was also unconditional, which would have made ordinary chat navigation/refresh/tab-close stop a run that the detached-run design intentionally keeps going server-side. Both problems trace back to the same root cause: chat_stream always wraps its generator in agent_runs (the detached-run manager), which decouples the generator's lifetime from the SSE response on purpose so normal chat/agent streams survive the client going away. Polling disconnection inside a detached generator can never be "prompt" — the generator isn't tied to that request anymore — and doing so defeats the whole point of detaching it. Compare panes don't need (or want) that: each pane's session exists only to drive that one generation, there's nothing meaningful to /resume, and the user expects the pane's Stop button — which aborts the fetch and closes the SSE — to cancel the upstream call right away. So route compare-mode requests around the agent_runs wrapper entirely and stream the generator directly as the SSE body. Starlette already cancels a streaming response's body iterator (raising CancelledError/GeneratorExit into it) the instant it notices the client disconnected — including while the generator is mid-await on the next upstream chunk — and the existing except (CancelledError, GeneratorExit) handlers in both the chat-mode and agent-mode loops already save the partial response exactly once. No polling needed; the redesign just stops getting in its own way. Normal (non-Compare) chat and agent streams are untouched and keep going through agent_runs, preserving detached-run semantics (surviving tab close / navigation / refresh, reconnect via /api/chat/resume). Replaces the source-text assertions in tests/test_compare_stop_disconnect_poll.py with runtime tests that actually exercise the cancellation contract: a Compare-shaped generator is cancelled mid-await (not after the next chunk arrives) and saves its partial exactly once; a normal completion still saves exactly once via the completion path; agent_runs keeps a detached run alive when its subscriber disconnects and only stops it on an explicit stop()/cancel (also saving the partial exactly once); and the cancellation contract is pinned for both chat-mode- and agent-mode-shaped chunk sequences.	2026-06-08 01:13:45 +01:00
Lucas Daniel	fa7c4f8ea9	fix(search): catch HTTPStatusError so 403/404 URLs degrade gracefully instead of 500 (#2203 ) raise_for_status() raises httpx.HTTPStatusError for 4xx/5xx responses, but the surrounding try/except only caught httpx.RequestError (network errors) and RateLimitError (429). Any other HTTP error code propagated uncaught up through chat_processor -> chat_helpers -> chat_routes and surfaced as a 500 Internal Server Error. Added an explicit except httpx.HTTPStatusError clause that logs a warning and returns an empty result, matching the behaviour already in place for network errors. Also adds focused regression tests that exercise the real fetch_webpage_content() path with a mocked _get_public_url: - 403/404 responses return the standard empty-result shape instead of raising, proving the new HTTPStatusError handling works end to end. - 429 responses still take their own dedicated rate-limit branch (the status_code == 429 check runs before raise_for_status() is reached), keeping that behaviour distinct from the new generic HTTPStatusError handling. Dropped the unrelated builtin_mcp.py change that had been carried over from a rebase; that fix is tracked separately in #2018 and this branch should stay scoped to the search content fetch path. Closes #2148	2026-06-08 01:09:21 +01:00
Alexandre Teixeira	77b75ca97e	docs(tests): define testing standard and taxonomy (#3372 )	2026-06-08 01:15:47 +02:00
Kenny Van de Maele	505d8bae5a	fix(cookbook): locate cookbook_state.json via DATA_DIR, not hardcoded /app/data (#3332 ) Three call sites hardcoded Path("/app/data/cookbook_state.json"), which only exists in Docker; on a native run the real path is <repo>/data, so the state file looked missing and cookbook serve-state was silently ignored. Two others used os.environ.get("DATA_DIR", "data") (a relative fallback, since DATA_DIR is never set as an env var). Route all five through core.constants.DATA_DIR so the path is consistent and absolute on both Docker and native. Part of #3331. Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-08 00:13:47 +01:00
horribleCodes	9c90f62657	fix(platform): Improve WSL SSH remote compatibility (#3316 ) * fix(platform): add WSL compatibility functions and path translation fix(cookbook): enhance model scan script to support additional HuggingFace cache paths fix(hardware): improve cache key generation for remote SSH context test(tests): add tests for WSL detection and path translation functionality * fix(cookbook): prefer prebuilt wheels for llama-cpp-python and normalize package aliases * fix: enable StrictHostKeyChecking in nvidia probe refactor: consolidate ssh & powershell command execution to utility functions in core module refactor: consolidate nvidia path candidates in to single variables in core module tests: add tests for new utility functions * fix: correct wrong variable name	2026-06-08 00:33:50 +02:00
Lucas Daniel	73315e6ddc	fix(skill-extractor): walk all brace candidates so stray braces in prose do not swallow valid JSON (#2205 ) * fix(skill-extractor): walk all brace candidates so stray braces in prose do not swallow valid JSON The extractor sliced from the FIRST brace to the LAST brace to recover JSON embedded in surrounding commentary. When the model emits stray braces before the JSON object, the slice produces invalid JSON, json.loads raises, and the exception is swallowed -- the skill is silently lost. Fix: walk each brace candidate left-to-right and attempt json.loads on each slice. The first candidate that parses successfully wins. If none parse, json.loads on the original text raises and the existing JSONDecodeError handler logs and returns None as before. Tested locally -- 8/8 tests passed: tests/test_extract_skill_json_nonstring.py (2 passed) tests/test_skill_extractor_rows.py (1 passed) tests/test_search_content_extraction_parity.py (2 passed) tests/test_deep_research_search_error.py (3 passed) Closes #2199 * test(skill-extractor): add focused repro for stray-brace JSON recovery * test(skill-extractor): add regression test for leading invalid-brace fragment Addresses the remaining edge case from review: a response that starts with a brace but the leading fragment isn't valid JSON (e.g. '{not json}\n{"title": "Valid later", ...}') still needs to recover the valid skill object that follows. _extract_json_object (already on dev) handles this correctly — it tries the whole de-fenced string first, then walks each '{' candidate left-to- right regardless of whether the response begins with '{', so the leading invalid fragment no longer short-circuits recovery of the real object. Updates the comment at the call site to call this out explicitly and adds a regression test covering exactly the scenario described in review.	2026-06-07 23:31:12 +01:00
Alexandre Teixeira	a017108d41	refactor(tests): add temp sqlite helper (#2930 )	2026-06-07 23:44:16 +02:00
Alexandre Teixeira	9ad6a2809e	test(diffusion-server): exercise security middleware wiring (#3214 )	2026-06-07 23:42:11 +02:00
nubs	1a0e1c5d69	fix(documents): restore PDF library metadata and preview (#2483 ) PDF uploads are stored as markdown wrappers with pdf_source or pdf_form_source markers so the editor can preserve extracted text, form fields, and annotations. The library exposed that internal wrapper: auto-created PDF documents used the hashed storage filename as the title, and row/facet language reported markdown instead of pdf. Derive chat-upload PDF titles from the original upload name, derive document-library display language from the PDF source marker for rows, filters, and facets, and keep markdown wrappers excluded from the markdown facet when they represent PDFs. The expanded library card already renders PDF-backed documents through /api/document/{id}/render-pdf. Allow only that inline PDF preview endpoint to be framed by same-origin app pages while leaving normal routes on X-Frame-Options: DENY and frame-ancestors none. Also tighten the existing PDF marker regression assertion so it matches the actual historical corruption signature instead of contradicting the preserved [Page 1 text]: marker. Fixes #2468	2026-06-07 23:23:27 +02:00
Kenny Van de Maele	76c1f42ab0	fix: route all agent loopback calls through internal_api_base() helper (#3322 ) #2753 made the agent loopback base port-configurable but only for _COOKBOOK_BASE in tool_implementations. Several other in-process loopback calls still hardcoded http://localhost:7000 and broke off port 7000: cookbook_serve_lifecycle (model-endpoints x2, shell/exec), builtin_actions (model/serve), task_routes (calendar x3), and the gallery/email calls in tool_implementations. Extract the resolution (ODYSSEUS_INTERNAL_BASE / APP_PORT / 7000 fallback, 127.0.0.1 to avoid IPv6 ambiguity) into core.constants.internal_api_base() and route every call site through it. Rename the now-misnamed _COOKBOOK_BASE to _INTERNAL_BASE since it serves gallery/email/calendar/serve too. Adds a test for the resolver plus a regression guard against reintroducing the literal. Part of #2752. Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 22:22:09 +01:00
Gunnar Arias	d85c5e335e	fix(security): harden untrusted_context_message against delimiter spoofing (#3086 ) * fix(security): harden untrusted_context_message against delimiter spoofing Root cause: untrusted_context_message() did not sanitise content before interpolating it into the <<<UNTRUSTED_SOURCE_DATA>>> / <<<END_UNTRUSTED_SOURCE_DATA>>> delimited sandbox block. Malicious content embedding the literal delimiter strings could prematurely close the sandbox and inject instructions that the LLM treats as trusted. Fix: add _escape_guard_markers() helper that replaces the guard marker strings with structurally inert tokens (<<<_UNTRUSTED_DATA>>> and <<<_END_UNTRUSTED_DATA>>>) before the content is wrapped. The function is applied in untrusted_context_message() after casting content to str. The existing ~13 call sites (chat_processor.py, agent_loop.py, deep_research.py, chat_helpers.py, chat_routes.py) are unaffected because they pass content through without inspecting the output delimiters. Regression tests added in tests/test_prompt_security.py covering: - _escape_guard_markers unit tests (open, close, both, benign passthrough) - untrusted_context_message integration tests (delimiter spoofing neutralisation, type coercion, None handling, metadata preservation) Resolves #3056 * fix(security): sanitize label for newlines and guard markers Addresses reviewer feedback on PR #3086: - Normalize label: strip CR/LF to prevent pre-guard line injection - Escape guard marker literals in label via _escape_guard_markers() - Add regression tests for label-based newline injection, GUARD_OPEN and GUARD_CLOSE in label, and exactly-one-structural-guard assertion * fix(security): move Source label inside GUARD_OPEN block The reviewer correctly identified that even after sanitizing the label, any user-derived label text (e.g. `f"web page: {url}"`) still appeared before GUARD_OPEN in the trusted framing zone, where the LLM treats it as trusted instructions. Fix: move the 'Source: {label}' line to inside the guarded block so only the hardcoded UNTRUSTED_CONTEXT_HEADER sits before GUARD_OPEN. The raw label is still kept in metadata["source"] for traceability. _sanitize_label() and _escape_guard_markers() are kept for defence-in- depth on the label stored inside the block. Update test_label_newline_injection_is_blocked to assert no label- derived instruction text appears before GUARD_OPEN (pre-guard zone is now empty of any user-derived content).	2026-06-07 22:15:50 +01:00
Syed Ali Jaseem	f939cb65ce	refactor(tests): replace local function copies in test_endpoint_resolver with real imports (#3359 ) * refactor(tests): replace local function copies in test_endpoint_resolver with real imports The test file carried 9 verbatim copies of src/endpoint_resolver.py functions to avoid import-pollution concerns, but these copies are a drift hazard — PR #3343 had to update both in parallel. Replace them with direct imports so future changes to endpoint_resolver are automatically exercised by the test suite. Also fixes _ollama_api_root in endpoint_resolver.py: the bare-URL Ollama case (e.g. http://nas:11434 with empty path) was already handled correctly in the test copy but was missing from the real function, which would return /chat instead of /api/chat for native Ollama endpoints without an explicit /api prefix. Closes #3351 * refactor: import _ollama_api_root from llm_core instead of duplicating it endpoint_resolver already imports _detect_provider and _host_match from llm_core. Add _ollama_api_root to that import and remove the local copy, collapsing two implementations to one source of truth. llm_core's version is a superset (also strips /api/chat\|tags\|generate paths), and since normalize_base already removes those suffixes upstream the result is identical for every input used here.	2026-06-07 22:47:57 +02:00
nubs	865e61450e	fix(upload): configure chat attachment size limit (#2439 )	2026-06-07 22:42:24 +02:00
nubs	8746c9c0df	fix(documents): discard pending AI diff before switching active doc (#2484 ) The document editor stores the AI-edit diff state (_diffModeActive, _diffOldContent, _diffNewContent, _diffChunks) as a module-global singleton bound to whatever document was active when the diff opened, and every document shares one #doc-editor-textarea. When the active document is switched while an unapproved diff is open, the stale diff must be discarded first or a later exitDiffMode (tab switch / Accept-Reject-All) flushes the old document's content into the new active document and overwrites it (issue #2467). Guard both paths that switch the active document for an AI update, while activeDocId still points at the previously-active doc: - handleDocUpdate(): a doc_update targeting a different document. - streamDocOpen(): the AI streaming a NEW document — this runs first on that path, so a guard only in handleDocUpdate would fire too late and still overwrite the streamed document. Both reuse the exact `if (_diffModeActive) exitDiffMode(true);` guard switchToDoc() and enterDiffMode() already use. Fixes #2467	2026-06-07 22:35:35 +02:00
nubs	f7c0b3f23b	fix document preview refresh after AI edits (#2259 )	2026-06-07 22:33:01 +02:00
Syed Ali Jaseem	e3e37ce526	fix(sessions): scope enrichment queries by owner, add LIMIT to auto_sort (#3350 ) GET /api/sessions fired full-table scans against sessions, documents, and gallery_images on every call. Added DbSession.owner == user (line 265), Document.owner == user (line 283), GalleryImage.owner == user (line 289), and .limit(2000) to auto_sort_sessions (line 1013). All follow the existing owner-scoping pattern at lines 700 and 1230. No behaviour change — the response was already correct; this eliminates the over-fetch.	2026-06-07 21:32:21 +02:00
adabarbulescu	a8859bb25c	fix(llm): Properly detect remote Ollama bare URLs as native endpoints (fixes #3252 ) (#3343 )	2026-06-07 21:19:19 +02:00
lekt8	accdc4fc53	Add hover tooltips for clipped model names (#1982 ) (#1985 ) Long model names are truncated with ellipsis in two places with no way to see the full name: the model-picker dropdown items and the chat-header model indicator. Add a native title tooltip carrying the full name to both — the dropdown item's name span (nameSpan.title = m.display) and the header label (label.title = the full model id; empty for the 'Select model' placeholder). Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 19:23:44 +02:00
RaresKeY	3a91c11ff8	fix: block app_api access to Cookbook host controls (#3231 )	2026-06-07 19:20:11 +02:00
Ashvin	00e8084969	fix(notes): handle time-first due_date phrases in parse_due_for_user (#3319 ) parse_due_for_user only matched day-first format ('today at 3pm'). Time-first strings like '3pm today' or '11pm today' — which the tool schema and tool_index both advertise as valid examples — fell through all branches, hit dateutil or the legacy _parse_dt fallback, and in many cases raised ValueError. do_manage_notes then stored the raw string verbatim, and the ISO-only reminder scanner (action_ping_notes) never fired the note. Add a time-first regex branch immediately after the day-first branch to handle '<time> today\|tonight\|tomorrow\|tmrw\|yesterday'. Existing day-first parsing is unchanged. Fixes #3302	2026-06-07 19:15:38 +02:00
ooovenenoso	681a2a3f2a	fix(cookbook): scan persisted HF cache paths (#3189 )	2026-06-07 18:19:47 +02:00
Sebastian Andres El Khoury Seoane	8d9d4ec9c6	feat(platform): Add support for APFEL as part of the dependencies and models for the Cookbook. (#2657 ) * feat(platform): add support for Apple Silicon detection in platform compatibility test(tests): enhance shell_routes tests for Apple Silicon compatibility * fix issues with missing import * fix: correct package name in package-lock.json and enhance package installation commands in shell_routes.py and cookbook.js * feat: add Apfel startup and health checks on macOS - bootstrap Apfel via Homebrew on arm64 macOS - start `apfel --serve --port 11435` detached for Odysseus - verify readiness via `/health` - clean up the Apfel process on exit or Ctrl+C * fix: duplicate variable declaration post-merge conflict - Should fix `node` CI issues. * fix: issues with the update status of the APFEL dependency. - fixed by changing the main conditional that determines the update. * Fix: Remove unnecessary whitespaces and formatting for the model_routes.py file. * Fix: whitespace issues with the model_routes file * Fix: Remove unnecessary whitespaces and formatting for the model_routes.py file. Final * Fix: Fixed updates using PIP for APFEL instead of custom cmd	2026-06-07 17:28:02 +02:00
Kenny Van de Maele	8f2c8d2dc8	fix(test): tolerate owner kwarg in compaction summary resolve_endpoint mock (#3304 ) #2996 made context_compactor call resolve_endpoint('utility', owner=owner), but the mock added by #2174 stubbed it as lambda which: ..., which rejects the owner kwarg. Each PR passed alone; merged on dev the two compaction tests fail with TypeError and the pytest job goes red. Widen the mock to lambda a, *k. Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 17:23:06 +02:00
SurprisedDuck	b8463e3ac2	fix(email): decode headers without injected spaces (#2433 ) routes.email_helpers._decode_header joined the runs from email.header.decode_header() with " ". Those runs carry their own surrounding whitespace (e.g. (b"Re: ", None)), and RFC 2047 §6.2 requires the whitespace between two adjacent encoded-words to be dropped, so the join produced a double space after an ASCII prefix ("Re: Jóse"), a spurious space in "Name <addr>" senders, and a stray space between two adjacent encoded-words ("Café 日本"). _decode_header backs the inbox list, message read, search, and the background pollers, so the corruption hit essentially every non-ASCII subject/sender. Use email.header.make_header(...) for RFC-correct concatenation, keeping the existing lossy per-part fallback for malformed/unknown MIME charsets (make_header raises LookupError there) so the unknown-charset contract in tests/test_email_decode_header.py still holds. The sibling mcp_servers.email_server._decode_header was already fixed the same way (commit `46999de`); this brings the routes.email_helpers copy in line, with regression coverage. Supported by Claude Opus 4.8 Co-authored-by: SurprisedDuck <288741682+SurprisedDuck@users.noreply.github.com>	2026-06-07 16:56:20 +02:00
Mazen Tamer Salah	92ef01d4fa	fix(skills): tolerate a stray brace before the JSON in skill extraction (#2200 ) maybe_extract_skill() sliced the LLM response from the first '{' to the last '}'. When a model emits a stray brace in prose before the real object (e.g. "uses {placeholder} then {...}"), the slice starts at the prose brace, json.loads fails, and a valid skill is silently dropped. Factor parsing into _extract_json_object(), which tries the whole (de-fenced) string first and then each '{' start position, returning the first candidate that parses to a JSON object. Adds tests/test_skill_extractor_json.py.	2026-06-07 16:54:36 +02:00
max-freddyfire	43c16fc7e4	fix(context_compactor): return original messages when compaction summary fails (#2174 ) On summary LLM call failure, maybe_compact was returning system_msgs+recent (dropping the older half) with was_compacted=False, misleading the caller into thinking the list was unchanged. Return the original messages list unchanged so no history is lost; the next trim_for_context call handles length if needed. Fixes #2160 Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 16:40:16 +02:00
SurprisedDuck	c75d3e1975	fix(memory): record dislikes as dislikes, not preferences (#2435 ) _fallback_memory_candidates matched both positive (prefer/like/love) and negative (hate / do not like / don't like) sentiment verbs in one regex alternation, then formatted every hit as "User prefers {X}.". So "I hate cilantro" was stored as "User prefers cilantro." -- the inverse of what the user said. These fallback facts are persisted to memory and later re-injected into the model's context, so the inverted preference actively misleads the assistant. Capture the matched verb and branch on it: negatives become "User dislikes {X}.", positives stay "User prefers {X}." (still filed under the existing "preference" category). Supported by Claude Opus 4.8 Co-authored-by: SurprisedDuck <288741682+SurprisedDuck@users.noreply.github.com>	2026-06-07 16:36:07 +02:00
YotamPeled	adbcb3763f	fix(agent): don't abort legitimate tool batches as runaway loops (#3183 ) The loop-breaker's runaway backstop counted per-tool-type call totals and tripped whenever any tool was used >=15 times — treating 15+ DISTINCT calls to one tool as a stuck loop. A real batch (e.g. "add these 18 birthdays to my calendar" emits 18 distinct manage_calendar create_event calls in one round) got flagged "calling manage_calendar over and over", the calls were discarded (next round tools_sent=0), and 0 events were created. Count IDENTICAL repeated call signatures instead (same tool AND args), via a small, unit-testable _detect_runaway_call() helper. Genuine batches pass; a model truly stuck repeating one call still trips the backstop. Adds a regression test. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-07 16:16:17 +02:00
michaelxer	bdf4ec8b24	fix: fall back to /models probe when base URL returns 404 (#3205 ) _ping_endpoint() probes the bare base URL for non-Ollama endpoints. OpenAI-compatible servers like llama-swap return 404 on the /v1 prefix but 200 on /v1/models, causing endpoints to appear offline despite being fully functional. Add a /models fallback when the base URL returns a non-auth 4xx. Auth failures (401/403) are treated as definitive — probing /models would just repeat the same rejection. Fixes #3181 Co-authored-by: michaelxer <michaelxer@users.noreply.github.com>	2026-06-07 16:09:33 +02:00
danielroytel	5d3e3c7053	feat(tasks): assign folder='Tasks' at creation + backfill migration (#2834 ) * feat: assign folder='Tasks' to task sessions at creation Task sessions (LLM, action, research) now set folder='Tasks' on their DbSession row, matching the pattern used by the Assistant folder. This enables sidebar lens filtering without changing existing session behaviour. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: add backfill script for task session folders One-shot script to set folder='Tasks' on existing [Task]/[Research] sessions that predate the folder assignment in task_scheduler.py. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * refactor: replace standalone backfill script with automatic migration Convert scripts/backfill_task_folders.py into _migrate_backfill_task_folders() in core/database.py, called from init_db(). The migration is idempotent (only touches rows where folder IS NULL/empty) and runs automatically on upgrade, so operators no longer need a manual step to tag pre-existing task sessions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-06-07 15:33:17 +02:00
Marius	04d6a5ccaa	Fix: CORS preflight 401'd by AuthMiddleware before CORSMiddleware (#3262 ) AuthMiddleware is the outermost middleware, so a credential-less CORS preflight (OPTIONS + Access-Control-Request-Method) was rejected with 401 before CORSMiddleware could answer it. That blocks every cross-origin browser/WebView client: the preflight fails, so the real request is never sent. Let a genuine preflight through at the top of AuthMiddleware.dispatch via a pure, unit-tested predicate (core.middleware.is_cors_preflight). Precise -- only OPTIONS carrying Access-Control-Request-Method; a credentialed request is never matched -- and no data access. Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-07 15:23:23 +02:00
RaresKeY	a3784da172	fix: block app_api access to shell routes (#3225 )	2026-06-07 15:19:08 +02:00
Ashvin	cbbb41dfb1	fix: avoid double bcrypt on login by using create_session_trusted (#3236 ) * fix: avoid double bcrypt on login by adding create_session_trusted * fix: update test to expect create_session_trusted instead of create_session	2026-06-07 15:10:53 +02:00
Vykos	83b0ab7cd3	Scope auxiliary LLM endpoints by owner (#2996 ) * fix(auth): scope auxiliary llm endpoints by owner * fix(auth): scope auxiliary llm fallbacks by owner	2026-06-07 14:47:44 +02:00
Vykos	2149f0fb67	fix(rag): forward owner through manager wrapper (#2991 )	2026-06-07 12:56:57 +02:00
Vykos	83fca6ac62	fix(personal): require document privilege for rag upload (#2990 )	2026-06-07 12:56:53 +02:00
Vykos	000932a6d9	fix(auth): gate api tokens from user routes (#2992 )	2026-06-07 12:55:01 +02:00
Vykos	299538ea4e	Harden note reminder dispatch ownership (#2999 )	2026-06-07 12:52:27 +02:00
Vykos	67aeea4f8b	Scope gallery image endpoints by owner (#3001 )	2026-06-07 12:51:21 +02:00
Vykos	f2a79aaf5c	Tighten manage notes owner checks (#3002 )	2026-06-07 12:50:10 +02:00
Vykos	a6490ffb1b	Harden gallery album assignment scope (#3004 )	2026-06-07 12:49:03 +02:00
Vykos	06d28e23ac	Scope document session links by owner (#3005 )	2026-06-07 12:47:20 +02:00
Vykos	7b4e6c4c1b	Enforce task chain owner scope (#3006 )	2026-06-07 12:43:43 +02:00
Vykos	3cff06781e	Scope model helper endpoint resolution (#3007 )	2026-06-07 12:40:23 +02:00
Vykos	ff4508d396	Scope vision model resolution by owner (#3009 )	2026-06-07 12:39:02 +02:00
Lucas Daniel	34bd8f0491	fix(email): guarantee IMAP conn.logout() on all exception paths (#1530 ) Three IMAP connection leaks were recently fixed via try/finally (#1325, #1330, #1423). This commit applies the same pattern to the remaining callsites that still used inline logout-only cleanup. routes/email_helpers.py: - _fetch_sender_thread_context: conn was uninitialized when the outer try/except returned early on connect failure, causing the finally block to crash on conn.close()/conn.logout(). Merged the two separate try blocks into one and added conn=None guard. - _pre_retrieve_context: ctx_conn.logout() was inside the loop body with no finally, so any exception in the folder/search loop leaked the socket. Moved cleanup into a finally block with ctx_conn=None guard. mcp_servers/email_server.py: - _list_emails: multiple inline conn.logout() calls on early-return paths; exception between them leaked the socket. Wrapped in try/finally. - _read_email: same pattern — four separate logout() calls replaced by a single finally block. - _reply_to_email: logout() called before the error check, so an exception in conn.select() leaked the socket. Wrapped in try/finally. - _download_attachment: same pattern as _reply_to_email. Also adds tests/test_imap_leak_fixes.py with 9 regression tests (one per function/failure-mode) that monkeypatch _imap_connect and assert conn.logout() is called exactly once even when IMAP operations raise.	2026-06-07 05:09:28 +01:00
Joeseph Grey	f78539ba15	fix(caldav): disable redirects on the sync/write-back DAVClient (SSRF) (#2663 ) validate_caldav_url resolves and vets the initial host, but caldav's niquests session follows 3xx redirects by default, so a validated public URL can be redirected at request time to loopback/link-local/private space, re-opening the SSRF the host check closes. The existing redirect guard only covered the settings test-connection path. Add a shared _build_dav_client helper that pins the session to zero redirects (any 3xx then raises instead of silently following an attacker-chosen Location), and route both the pull (_sync_blocking) and write-back (_writeback_blocking) paths through it. Mirrors the follow_redirects=False already used on the test-connection path. Tests exercise the real DAVClient request path (a 302 toward an internal host is refused, the sink is never contacted; the PROPFIND is asserted to reach the public server first so the check can't pass vacuously), confirm the helper disables redirects on the installed client, guard against a raw DAVClient creeping back in, cover mixed public/internal DNS results in both orderings, and add the resolves-to-no-usable-records fail-closed branch.	2026-06-07 05:05:24 +01:00

1 2 3 4 5 ...

642 Commits