odysseus

mirror of https://github.com/pewdiepie-archdaemon/odysseus.git synced 2026-06-16 17:55:26 -04:00

Author	SHA1	Message	Date
Vykos	7b4e6c4c1b	Enforce task chain owner scope (#3006 )	2026-06-07 12:43:43 +02:00
Vykos	ff4508d396	Scope vision model resolution by owner (#3009 )	2026-06-07 12:39:02 +02:00
Joeseph Grey	f78539ba15	fix(caldav): disable redirects on the sync/write-back DAVClient (SSRF) (#2663 ) validate_caldav_url resolves and vets the initial host, but caldav's niquests session follows 3xx redirects by default, so a validated public URL can be redirected at request time to loopback/link-local/private space, re-opening the SSRF the host check closes. The existing redirect guard only covered the settings test-connection path. Add a shared _build_dav_client helper that pins the session to zero redirects (any 3xx then raises instead of silently following an attacker-chosen Location), and route both the pull (_sync_blocking) and write-back (_writeback_blocking) paths through it. Mirrors the follow_redirects=False already used on the test-connection path. Tests exercise the real DAVClient request path (a 302 toward an internal host is refused, the sink is never contacted; the PROPFIND is asserted to reach the public server first so the check can't pass vacuously), confirm the helper disables redirects on the installed client, guard against a raw DAVClient creeping back in, cover mixed public/internal DNS results in both orderings, and add the resolves-to-no-usable-records fail-closed branch.	2026-06-07 05:05:24 +01:00
Karandeep Bhardwaj	3940297655	fix(webhooks): redact IPv6 addresses in sanitized error messages (#3038 ) * fix(webhooks): redact IPv6 addresses in sanitized error messages sanitize_error() only stripped IPv4 literals, so a failed webhook delivery to an internal IPv6 host (::1, fe80::/fc00:: ...) leaked the address into Webhook.last_error, which is surfaced in the UI. The module already treats internal IPv6 as sensitive (see _PRIVATE_NETWORKS and src/url_safety.py); the scrubber just didn't keep up. Add an IPv6 redaction pass covering bracketed, full 8-group, and ::-compressed forms. The pattern is scoped to leave clock times ("12:34:56"), MAC addresses, and C++ "::" tokens untouched, and the ::-branch uses a lookahead over a flat character class so there is no nested quantifier to backtrack on (no ReDoS on long colon/hex runs). Adds tests/test_webhook_sanitize_error_ipv6.py. * webhook: validate IPv6 candidates with ipaddress, not a regex grammar Per review on #3038: instead of hand-rolling the IPv6 grammar in a regex (brittle, and easy to over-match colon-heavy text), use a loose regex to find candidate tokens and let ipaddress.ip_address() decide. Only tokens it parses as IPv6 are redacted, so the false-positive guards (clock times, MACs, "std::vector") now come from the stdlib instead of a custom pattern. This also covers cases the old pattern missed -- zone ids (fe80::1%eth0) and IPv4-mapped addresses -- and no longer partially mangles invalid colon strings (a 9-group token is preserved whole rather than losing its first 8 groups). The bracketed branch is a single greedy class with no X:X backtracking; verified ~1ms on 40k-char adversarial input. Extends the test file with zone-id, IPv4-mapped, and invalid-token cases. * webhook: redact bracketed/scoped/IPv4-mapped IPv6 as one unit Review on #3038 found a few IP forms left partially redacted or malformed by sanitize_error(): [fe80::1%eth0]:8080 -> [[redacted]]:8080 [::ffff:192.168.0.1]:8080 -> [[redacted][redacted]]:8080 ::ffff:192.168.0.1 -> [redacted][redacted] Two causes: the bracketed branch's character class dropped zone ids, so scoped addresses fell through to the bare branch and left the brackets and port behind; and the IPv4 pass ran first, stripping the embedded v4 of an IPv4-mapped address so the v6 pass then redacted the "::ffff:" remnant separately. Fix: - run the IP-candidate pass before the IPv4 pass, so IPv4-mapped forms are matched and redacted whole - match the full bracketed authority ([...] + optional %zone + :port) as a single token, and redact a v4-or-v6 literal inside [ ] as one [redacted] - extend the bare branch with a bounded (exactly-3) dotted-quad tail for IPv4-mapped forms; exactly-3 so it can't swallow a partial suffix and accidentally preserve an otherwise-valid address Each form now collapses to a single [redacted]; the candidate finder stays linear (~1.3ms on 40k-char adversarial input). Adds regression tests for the three reported forms and keeps the timestamp/MAC/std::vector coverage.	2026-06-07 04:55:33 +01:00
Nicholai	a3cb15d0a1	fix(agent): enforce guide-only tool policy (#3088 )	2026-06-06 18:48:24 -06:00
Mohammed Riaz	6ccd4500d7	fix(chat): show requested and actual reply models Show requested and actual reply models in chat labels when fallback or provider routing changes the responding model.	2026-06-06 04:30:16 -06:00
Ocean Bennett	fb9c7cf3da	fix(calendar): accept list event range aliases	2026-06-06 03:47:18 -06:00
Nicholai	33edc40eae	fix: route misfenced web lookups to web tools Fixes #3067	2026-06-06 03:46:31 -06:00
Giuseppe	e87a1ad8d2	fix(deep-research): wrap fetched webpage content in untrusted-context sandbox The goal-based extractor passed raw fetched webpage content straight into the LLM prompt via string substitution, bypassing the prompt-injection hardening layer in src/prompt_security.py. Split EXTRACTOR_PROMPT into EXTRACTOR_SYSTEM (task instructions + goal, trusted) and a second message built with untrusted_context_message() (raw page content, sandboxed with <<<UNTRUSTED_SOURCE_DATA>>> guards). This aligns the extractor with every other external-content injection site in the codebase (agent_loop, chat_processor, chat_routes). Fixes #3044 Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-06 03:37:10 -06:00
Nicholai	86abcb75d0	fix: split Chroma embedding lanes (#3046 )	2026-06-06 03:17:19 -06:00
Nicholai	463713c2c6	feat(search): unify session transcript search (#2877 )	2026-06-05 18:08:31 -06:00
Mateus Oliveira	c2017fa089	Phase 1: consolidate tool output constants into src/constants.py (#2989 ) MAX_OUTPUT_CHARS, MAX_READ_CHARS, and MAX_DIFF_LINES are now defined once in src/constants.py and imported by the three files that previously duplicated them (tool_execution.py, tool_implementations.py, agent_tools.py). agent_tools.py re-exports them for backward compatibility. Co-authored-by: mcnoliveira <mcnoliveira@gmail.com>	2026-06-05 23:05:02 +02:00
Fijar Lazuardy	66599b02a2	allow user who disable auth to use chat (#2548 ) * allow user who disable auth to use chat * only check non user on verify session owner * fix import source * rollback 401 to 403 for unauthorized error due to unit test * change unauthenticated http code error to 401 and fix unit tests	2026-06-05 22:54:19 +02:00
Logan Davis	f72e1bd412	feat(reminders): add generic webhook as a fourth reminder channel (#2952 ) Replaces any Discord-specific reminder channel with a generic outbound webhook channel. Users pick any saved Integration as the target and supply a JSON payload template with {{title}} and {{message}} placeholders — values are JSON-escaped before substitution. Works with Discord, Slack, Teams, ntfy (JSON mode), or any service that accepts a POST with a JSON body. - `src/settings.py` — reminder_webhook_integration_id + reminder_webhook_payload_template defaults - `routes/note_routes.py` — webhook delivery block; Integration lookup, template rendering, auth wiring; built-in preset defaults so discord_webhook works out of the box without a configured template; settings_override kwarg avoids test-button race condition - `routes/auth_routes.py` — discord_webhook preset test handler - `src/integrations.py` — discord_webhook preset with description + example templates; hides auth/key fields in the Integration form - `src/builtin_actions.py` — webhook_sent delivery check - `src/tool_implementations.py` — webhook aliases + enum updated - `static/index.html` — Webhook channel option; Integration picker + payload template textarea - `static/js/settings.js` — Integration list, populateWebhookIntegrations, syncChannelRows, hints, load/save, auto-fill preset templates, test-button override payload, hide auth/key for URL-auth presets Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-05 22:47:57 +02:00
Paweł Drużyński	f4aa661502	fix ambiguous naming, remove redundant json imports, fix _MCP_ARG_PARSERS type annotations (#2874 )	2026-06-05 21:30:22 +02:00
nubs	08e543d1ff	fix(tool-parsing): don't ship unconvertible <invoke> fence content to the code executor (#2926 )	2026-06-05 21:08:54 +02:00
nubs	47a47bf71d	fix(llm): guard against null arguments in streaming tool-call accumulator (#2923 )	2026-06-05 20:57:36 +02:00
michaelxer	71dda5b106	fix: respect user round count in deep research (#2896 ) The STOP_PROMPT did not include the target round count, so the LLM could decide to stop after 2-3 rounds even when the user requested 8. Additionally, min_rounds was capped at 3 regardless of max_rounds. - Add max_rounds to STOP_PROMPT so the LLM knows the target - Change min_rounds from min(3, max_rounds) to max(2, max_rounds - 2) Fixes #2863 Co-authored-by: michaelxer <michaelxer@users.noreply.github.com>	2026-06-05 20:49:42 +02:00
Logan Davis	ad82ee1c83	feat(calendar): support multiple CalDAV accounts (#2942 ) * feat(calendar): support multiple CalDAV accounts Replaces the single CalDAV credential slot with a named account list so users can sync both a personal and work calendar simultaneously. - Add `account_id` column to `CalendarCal` + startup migration - `_load_caldav_accounts()` in caldav_sync.py reads `caldav_accounts` list from prefs, auto-migrating the legacy single `caldav` key on first use (no user action required) - `sync_caldav()` iterates all accounts and aggregates counts/errors - `writeback_event()` resolves credentials via `CalendarCal.account_id`, falling back to the first account for legacy rows - New REST endpoints: GET/POST/PUT/DELETE `/api/calendar/config/accounts` - Legacy GET/POST `/api/calendar/config` preserved for backward compat - Settings UI: one card per account with Label, URL, Username, Password fields; Test button works for both unsaved (inline creds) and saved (by account_id) accounts; delete removes only that account - Update test_caldav_url_hardening.py mock to include `_save_for_user` and updated `_sync_blocking` signature * fix(calendar): restore #2765 PK scoping and #2819 writeback URL validation Two regressions introduced by the multi-account refactor: 1. PK collision (#2765): _stable_cal_id was back to hashing only the URL, so two users — or one user with two accounts on the same server — would collide on the primary key. Restore owner+account_id in the hash key (format: "{owner}\n{account_id}\n{url}") and thread both values through _sync_blocking → _writeback_blocking → push_event → find_remote_calendar so the hash round-trips correctly on write-back. 2. URL validation dropped (#2819): _load_caldav_accounts imported _save_for_user at function scope, causing an ImportError on test mocks that only provide _load_for_user, which prevented writeback_event from reaching the validate_caldav_url call. Move the import inside the migration branch and wrap in try/except (best-effort save; next call re-migrates from the still-present legacy key). Update fake_writeback_blocking in test_caldav_writeback.py to accept the new owner/account_id optional params.	2026-06-05 20:32:50 +02:00
nubs	fa9f62b44c	fix(compactor): shrink oversized tool_calls arguments so trim_for_context can fit a tool-only turn (#2949 )	2026-06-05 20:23:38 +02:00
Kenny Van de Maele	8ce945d338	feat: Add plan mode to the chat agent (#638 ) * feat: Add plan mode to the chat agent Adds a plan mode: the agent investigates read-only, proposes a checklist, and waits for approval before changing anything. On approval it runs with full tools and checks items off as it goes. Enforcement reuses the existing disabled_tools gate. Includes a slash command: `/plan [on\|off]` (and `/toggle plan`) to flip the plan toggle from the chat input. - src/tool_security.py, src/mcp_manager.py: read-only allowlist (tools + MCP). - src/agent_loop.py, routes/chat_routes.py: union the disabled set, prepend the plan directive, force agent mode. - static/: plan toggle pill, Approve & Run, dockable plan window, task-list checkboxes, and the /plan slash command. - tests/test_plan_mode.py. * Plan mode: persistent re-referenceable plan + agent write-back Three improvements so a long plan survives a weak model and stays in reach: 1. Re-reference the plan (out-of-context fix). On the execution turn the frontend sends the approved checklist back (`approved_plan`); the backend pins it as a top-of-context `## ACTIVE PLAN` system note (kept by the context trimmer), so the agent can always re-read the plan instead of losing the thread on a long run. New `build_active_plan_note()` (unit-tested). 2. Re-open / dock the plan anytime. The plan checklist is stored per-session (localStorage). When a plan exists, the plan-mode button opens a small menu ("Show plan" / "Plan mode: On/Off") that re-opens the side-dockable plan window — so it can stay docked while the agent works. The window live-refreshes as the plan changes. 3. Agent write-back: new `update_plan` tool. The agent calls it to tick steps `- [x]` after finishing them, or to revise steps when the user asks. Marker tool (no I/O) → `plan_update` SSE event → the stored plan + docked window update live. The ACTIVE PLAN note instructs the agent to use it. Backend: src/agent_loop.py (param + pin + note builder + emit + prompt blurb), src/tool_execution.py (update_plan handler), routes/chat_routes.py (parse `approved_plan`, relay `plan_update`), registration in tool_schemas / agent_tools / tool_index (always-available, not admin-gated). Frontend: static/js/chat.js (plan store, send `approved_plan`, handle `plan_update`, capture restated checklists), static/app.js (plan-button menu), static/js/planWindow.js (`isPlanWindowOpen`), static/js/storage.js (PLAN key). Tests: tests/test_plan_mode.py (plan-note), tests/test_update_plan_tool.py. * Plan mode: drop bash/python, rely on read-only discovery tools Shell can mutate (write files, hit the network) and can't be constrained to read-only at the tool layer, so plan mode no longer relies on a prompt to keep it well-behaved — bash/python are removed from the read-only allowlist and added to the fail-closed block set. Discovery is covered by the dedicated read-only tools (read_file, grep, glob, ls) instead. Rewrites the plan-mode directive to state shell is disabled and lists the available read-only tools positively. Addresses review feedback on #638. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * Comment: note _MCP_READONLY_VERBS are prefixes not whole words Clarifies that entries like "summar" are intentional stems matched via startswith (covers summarise/summarize/summary), not typos. Addresses review feedback on #638. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * Plan mode: clarify why gating inverts the allowlist into a denylist Rename _PLAN_MODE_FALLBACK_BLOCK -> _PLAN_MODE_KNOWN_MUTATORS and rewrite the comments. The tool gate is a denylist (disabled_tools); plan mode's policy is an allowlist, so it returns the inverse (all known tool names minus the allowlist). The static mutator set is a backstop for the schema-derived name list, which misses XML-only tools and can fail to import. Addresses review feedback on #638. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * Plan mode: stop hardcoding the read-only tool list in the directive The model is already shown its available (read-only) tools by _assemble_prompt, which removes every disabled tool. Enumerating them again in the directive only duplicated that list and would drift as tools change. Point at the tools listed below instead. Addresses review feedback on #638.	2026-06-05 16:32:25 +02:00
nubs	6973c5427c	fix(model-context): count tool_calls in estimate_tokens so compaction sees real size (#2751 )	2026-06-05 15:56:54 +02:00
nubs	8354948a1c	fix(llm): route harmony thinking streams (#2449 )	2026-06-05 15:22:08 +02:00
L1	8159733c6c	fix(caldav): pull Google Calendar events from the events collection, not the /user principal (#2531 ) * fix(caldav): pull Google Calendar events from the events collection, not the /user principal Google serves its CalDAV principal at .../caldav/v2/<id>/user but events live under .../caldav/v2/<id>/events. The caldav library's principal->home-set discovery does not reliably enumerate calendars from Google's /user endpoint, so _sync_blocking fell into its 'treat the URL as a single calendar' fallback and ran every calendar-query REPORT against the principal URL. /user holds no VEVENTs, so the REPORT returned a clean but empty 200 for every date range: auth succeeded, the calendar stayed empty (Apple Calendar works because iCloud exposes standard discovery at the pasted URL). Add _google_caldav_events_url() to map a recognised Google principal URL to its events collection, and route both discovery-less fallbacks through _open_url_as_calendar() so Google syncs hit /events while other servers' URLs are used unchanged. Fixes #2507 * fix(caldav): also map Google's legacy www.google.com/calendar/dav principal URL Some Google accounts authenticate against the older CalDAV endpoint (https://www.google.com/calendar/dav/<id>/user) rather than the newer apidata.googleusercontent.com/caldav/v2 form (reported on #2507). Both have the same principal-vs-events split, so map the legacy /user URL to its /events collection as well. The legacy branch is gated on the /calendar/dav/ path so an unrelated www.google.com URL ending in /user is left untouched.	2026-06-05 15:18:16 +02:00
Yiğit Egemen	ec8fbf5d8f	Add support for EMBEDDING_API_KEY (#2691 ) * feat: support for embedding API key * feat: encrypt and decrypt embedding API key * test: add unit tests for EmbeddingClient authorization header behavior	2026-06-05 14:47:24 +02:00
the_peaceful	b5c45326e4	Fix Windows Cookbook background tasks, exit statuses, and empty SSH logs wrapper (#1389 ) This commit consolidates all Windows Cookbook background fixes into a single comprehensive commit based on the latest main branch. Key fixes included: 1. React looksSuccessful Mismatch: Append 'DOWNLOAD_OK' for pip install commands in routes/cookbook_routes.py. 2. Local Windows SSH Wrapper & Log Directory Mismatch: Bypassed ssh wrappers and dynamically selected odysseus-tmux logs for local tasks in static/js/cookbookRunning.js. 3. WSL Bash Filtration: Filtered out the WSL bash stub at C:\Windows\System32\bash.exe in core/platform_compat.py. 4. Drive-Colon Path Normalization: Replaced .as_posix() with git_bash_path() in routes/shell_routes.py and src/bg_jobs.py. 5. GGUF-Only Hardware Fitting: Restructured local Windows recommendations to rank GGUF only in services/hwfit/fit.py. 6. Safe Win32 Process Liveness Probe: Replaced os.kill(pid, 0) with a safe Win32 API probe using GetExitCodeProcess in core/platform_compat.py. 7. Prebuilt llama-cpp-python Wheels: Supply the CPU extra index during compilation failure fallback. 8. Enforce UTF-8 log encoding: Set PYTHONIOENCODING=utf-8 on Windows bootstrap runners. 9. Fix Linux Llama.cpp Build script syntax error in routes/cookbook_helpers.py. 10. Page Reload Status Check: Run sys.executable instead of 'python3' to bypass Microsoft Store execution stubs on local Windows hosts. 11. Llama.cpp serve build bypass: Bypassed cmake compilation checks on local Windows and verified python bindings directly. 12. Serve Command Path Validation: Masked safe GGUF path printf subshells '' inside the serve command validator. 13. CPU Mismatch Diagnostics: Intercepted AVX2-lacking '0xc000001d' (Illegal Instruction) crashes in static/js/cookbook-diagnosis.js and guided users to Ollama. 14. Windows Pytest stability: Fixed stub import leakage in test files.	2026-06-05 14:41:07 +02:00
Vykos	370ae5d451	Harden DAV outbound URL validation (#2819 )	2026-06-05 13:22:21 +02:00
Vykos	6d64055328	Constrain research handler JSON paths (#2846 )	2026-06-05 13:20:02 +02:00
Ocean Bennett	2a1febdeef	fix(actions): scope scheduled model resolution to owner (#2773 )	2026-06-05 13:13:13 +02:00
nsgds	0f8d12363a	fix(images): render agent-generated images in chat (#2809 ) * fix(images): render agent-generated images in chat When a chat model calls generate_image mid-conversation (agentic flow), the image does not display — it survives only as a URL the model echoes in prose. generate_image runs as a text-only MCP server, so result['image_url'] is never populated and the existing buildImageBubble render path never fires. Promote the image URL out of the tool's stdout in tool_execution so the agent loop's existing forwarding renders it via buildImageBubble — deterministically, no dependence on the model echoing the URL. Backend-only; reuses dev's image bubble, forwarding, and the tool's existing parseable output. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * feat(images): fully-qualified, valid generated-image links The chat model often mangled the generated-image URL it echoed in prose (relative path, or copying the 'image_url:' label into the link href). Build a fully-qualified link by prefixing the existing app_public_url setting (empty default keeps relative paths), and present it as a clean 'Direct link:' the model can echo verbatim (the frontend auto-links bare https URLs). One file; independent of how the image is rendered. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(images): cover _promote_image_fields; make exit-code guard self-contained Adds the unit tests requested in review on #2809: absolute URL, relative URL, no URL (result unchanged), and non-zero exit_code (not promoted). Moves the dict/exit_code==0 guard from the call site into _promote_image_fields so the function is self-contained and the failure case is unit-testable; call-site behavior is unchanged. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-05 13:04:33 +02:00
Kenny Van de Maele	0a2adc9c96	Add ask_user tool: agent-posed multiple-choice questions (#2111 ) Let the agent pause and ask the user a multiple-choice question when a task is genuinely ambiguous and the answer changes what it does next — choosing between approaches, confirming an assumption, picking a target — instead of guessing. Modeled on the existing `ui_control` marker pattern: the `ask_user` tool returns an `ask_user` payload that the agent loop emits as an SSE event and then ends the turn. The frontend renders the question with clickable option buttons, a free-text "Other" input, and an x to dismiss; the user's choice is sent as the next message and the agent resumes with it in context. - src/tool_execution.py: `ask_user` handler — pure UI marker, no I/O. Validates a non-empty question + 2..6 options, normalizes string/object options, returns the payload. - src/agent_loop.py: emit the `ask_user` event and break the round loop so the turn ends and waits for the user's selection. Stream the question as assistant text so it persists/replays (prevents a re-ask loop). - Registration: TOOL_TAGS, ALWAYS_AVAILABLE, BUILTIN_TOOL_DESCRIPTIONS, FUNCTION_TOOL_SCHEMAS, the system-prompt blurb. Not admin-gated (any user can be asked); the structured args serialize via the default json.dumps path. - routes/chat_routes.py: relay the `ask_user` event to the client. - static/js/chat.js + static/style.css: render the question card (options + free-text Other + dismiss x; removed once answered). Reuses CSS vars and the .modal-close button; emoji go through the monochrome-SVG pipeline. Bump chat.js cache pin. - tests/test_ask_user_tool.py: payload, multi flag, string options, option cap, validation errors, serializer round-trip, registration.	2026-06-05 11:49:11 +02:00
Kenny Van de Maele	367858a587	Merge branch 'main' into dev Bring main's maintainer-curated work (cookbook scheduler, calendar rendering/sync, settings polish, agent debug loop) into dev so dev is a superset of main (resolves the dev/main drift, #2543).	2026-06-05 10:50:51 +02:00
Vykos	11ba46505b	Constrain generated-image paths to image root (#2837 )	2026-06-05 10:33:47 +02:00
nubs	8b386a172e	fix(calendar): route read requests to agent (#2452 )	2026-06-05 09:24:04 +01:00
Nicholai	4df4cfeaff	Merge pull request #2387 from cirim-au/fix/manage-memory-always-available fix(tool_index): add manage_memory to ALWAYS_AVAILABLE	2026-06-05 02:14:10 -06:00
nubs	5271d529d6	fix(tool-schemas): preserve web_search time_filter through native tool-call conversion (#2757 )	2026-06-05 08:00:59 +01:00
pewdiepie-archdaemon	fbd34334a5	Calendar overnight-event rendering + clickable [View note] link from chat - Calendar overnight events render proportionally across day boundaries via --start-frac / --end-frac CSS vars instead of bleeding as full-day on day 2. - Recurring-event delete strips the master uid + all master::* sibling instances optimistically so the row clears immediately instead of waiting for the next sync re-render. - manage_notes(create) now returns note_id + open_url, and agent_loop appends a markdown [View note](#note-<id>) link mirroring the deep-research pattern. - chatRenderer's hash-link router (already wired for #note-id) reaches the new notes.openNote(id) helper, which force-closes/reopens the Notes panel, polls for the target card, and runs a brief outline flash so the user can locate it on long lists.	2026-06-05 14:41:48 +09:00
pewdiepie-archdaemon	e2f449f4ef	Cookbook scheduler + serve: schedule via Tasks, Stop verifies kill, Ollama auto port-pick - Schedule cookbook serves through the existing ScheduledTask system: the serve preset gets a ^ button next to Launch that opens a daily/hourly/ weekly form mirroring the admin-switch style; the schedule action runs action_cookbook_serve, which delegates to /api/model/serve and stamps the resulting task with _scheduledStopAtMs. A background cookbook_serve_lifecycle loop ticks every 60s and kills any serve whose window has ended, also dropping the auto-registered endpoint so the model picker doesn't keep pointing at a dead server. - Stop and remove on a Running serve now awaits the SSH/tmux kill, re-checks tmux has-session, and surfaces an error toast (leaving the row) when the kill failed. Previously fire-and-forget, so a failed SSH/tmux call silently left the live serve running while the row vanished from the UI. - Cookbook tasks/status orphan-adoption sweep no longer requires the serve-/cookbook- session-id prefix; any tmux session whose pane is running a known model-server process gets auto-pulled into Running. Without this loosening, a cookbook-launched serve whose tmux id fell back to a bare number was invisible — you couldn't see it, let alone stop it. - Ollama serve always launches a fresh process under cookbook's tmux (no more monitor-mode reattach to a systemd/Docker ollama Stop can't reach). The handler pre-picks a free port by probing the target host over SSH and mutates req.cmd's OLLAMA_HOST so the runner script AND the auto-registered endpoint agree on the same bind port. - Auto-register uses host.docker.internal (when running inside Docker) instead of localhost, matching the URL /setup adds for Ollama by hand. Local cookbook serves now produce a chat-reachable endpoint on first launch. - Cascade-delete: removing a scheduled cookbook task also deletes any linked calendar event (cookbook_task_id marker in the description). - Tasks list groups cookbook_serve under a "Cookbook" category that sorts above the rest, so scheduler-launched serves are easy to find.	2026-06-05 14:41:43 +09:00
pewdiepie-archdaemon	f8aaeab245	Merge remote-tracking branch 'origin/dev'	2026-06-05 12:14:34 +09:00
pewdiepie-archdaemon	f19ac6ed03	Merge branch 'main' of github.com:pewdiepie-archdaemon/odysseus # Conflicts: # static/js/cookbookRunning.js	2026-06-05 11:23:15 +09:00
nubs	ae48ea7064	fix(mcp): sanitize and cap rendered MCP tool param hints (#2682 )	2026-06-05 03:00:22 +02:00
nubs	19a3fc59c9	fix(model-context): key context-window cache by (endpoint, model) (#2614 ) get_context_length() cached the resolved context window by model id alone, so two different remote endpoints serving the same model id (e.g. a capped proxy at 8k vs. the full provider at 200k) collided: the first to resolve won process-wide and the other endpoint was served the wrong window. That silently over-trims conversations on the larger-window endpoint (it feeds context_compactor) or overflows the smaller one (provider 400s). Key the cache on (endpoint_url, model). Local endpoints already always re-query, so they are unaffected. Fixes #2603	2026-06-05 02:50:56 +02:00
L1	f8cf791491	fix(caldav): don't prune locally-created events on sync (#2706 ) The CalDAV pull prunes events in the synced calendar+window whose UID the server didn't just return, to propagate upstream deletions. But CalendarEvent had no field distinguishing a server-pulled row from a locally-created one, so the prune also deleted events that were never on the server: events created by the agent / email triage (which never write back to the server) and UI events whose best-effort write-back failed. Result: silent, unrecoverable loss of the user's appointments (hard db.delete, no soft-delete). Add an 'origin' column to calendar_events (lightweight idempotent migration, mirroring _migrate_add_calendar_is_utc), set origin='caldav' on rows the sync inserts/updates, and gate the prune on origin == 'caldav'. Locally-created events carry origin NULL and are never pruned. On the first sync after the migration nothing is pruned (all rows NULL until re-marked), erring toward keeping data. Fixes #2704	2026-06-05 02:48:03 +02:00
Abylaikhan Zulbukharov	1d80bf5e65	feat(mcp): add Streamable HTTP transport with OAuth 2.0 (#1033 ) * feat(mcp): add Streamable HTTP transport with OAuth 2.0 Odysseus could only reach MCP servers over stdio and SSE, so modern remote servers like https://mcp.higgsfield.ai/mcp (Streamable HTTP, gated behind OAuth) could not be connected. Add an `http` transport that connects via the SDK's streamablehttp_client and authenticates with the SDK's OAuthClientProvider: RFC 9728 protected-resource discovery, RFC 8414 authorization-server metadata, Dynamic Client Registration, authorization-code + PKCE, and token refresh. A small bridge (src/mcp_oauth.py) connects the SDK's blocking callback to the existing web callback route via an asyncio.Future keyed by the OAuth `state`, and the dynamic client registration plus tokens persist per-server in a new encrypted `oauth_tokens` column. The connect runs as a bounded background task so the "Add server" request returns immediately; redirect_handler publishes needs_auth + auth_url to connection state as soon as discovery/DCR completes (which can exceed the bounded wait), and the UI polls until connected. Remote users finish via the existing paste-back flow. The Google OAuth path is left unchanged. - core/database.py: encrypted oauth_tokens column + migration - src/mcp_oauth.py: OAuth provider, DB-backed TokenStorage, state registry - src/mcp_manager.py: http dispatch, background connect, _connect_http - routes/mcp_routes.py: http validation, needs_auth/auth_url, callback bridge - static/js/settings.js: Streamable HTTP option + OAuth flow with polling - tests: 5 new unit tests (transport dispatch, registry, token storage) Verified against the live Higgsfield server: discovery, DCR (client_id issued), loopback redirect accepted, and a PKCE authorization URL with needs_auth status. No regressions (full suite delta is only the 5 added passing tests). * fix(mcp): address PR #1033 review feedback - mcp_oauth: derive redirect URI from OAUTH_REDIRECT_BASE_URL/APP_PUBLIC_URL (default http://localhost:7000) instead of hardcoding the port - mcp_oauth: leave OAuth scope unset so the SDK derives it from the server's WWW-Authenticate/protected-resource metadata; hardcoding an OIDC scope broke non-OpenID MCP servers (verified: Higgsfield still gets its server-derived scope) - mcp_oauth: prune abandoned OAuth flows (_prune_stale + _pending_ts) so the module-level registries can't grow unbounded - mcp_oauth: persist tokens/client-info in a single DB session/commit (_update) instead of a load+save double round-trip - mcp_manager: cancel and drop the background connect task in disconnect_server so a deleted server stops publishing status - database: document why the oauth_tokens migration uses TEXT while the model declares EncryptedText (encryption is applied at the Python layer) - settings.js: surface persistent OAuth-poll failures and an explicit timeout message instead of silently swallowing errors - tests: cover the stale-flow pruning * static/js/settings.js now shows an in-flight loading state on the buttons that fire requests:	2026-06-05 02:40:52 +02:00
Isaiah Gardner	134c608466	fix: degrade missing/None content key in system messages to empty string (#2570 )	2026-06-05 00:10:11 +02:00
Kenny Van de Maele	2be3779e6e	feat: Add workspace: confine agent tools to a folder (#1103 ) * feat: Add workspace: confine agent tools to a folder Pick a server folder as the agent's workspace so its file/shell tools work there and don't touch files outside it. File tools are hard-confined; bash/ python run with cwd set to the folder. Includes a slash command: `/workspace` (alias `/ws`) — show / `set <path>` / `clear` / `pick` (open the directory browser). - routes/workspace_routes.py: GET /api/workspace/browse (admin-only). - src/tool_execution.py: hard path confinement for read_file/write_file; bash/python cwd. Threaded route → stream_agent_loop → execute_tool_block. - src/agent_loop.py: workspace note prepended to the system prompt. - static/: overflow menu item, input-bar pill, directory-browser modal, and the /workspace slash command. - tests/test_workspace_confine.py. * Wire workspace confinement into tools that landed after this PR edit_file (#1239) and grep/glob/ls (#1670) merged after workspace-confine was written, so they bypassed the workspace boundary. Thread the workspace through: - edit_file: _do_edit_file resolves via _resolve_tool_path_in_workspace - grep/glob/ls: _resolve_search_root confines to the workspace (root + paths) - bash/python/bg cwd: workspace or _AGENT_WORKDIR (keep the #2586 data-dir default when no workspace is set) Tests cover edit_file + grep/ls confinement (inside ok, outside rejected). * Workspace picker: editable path bar + modal style cohesion + cross-platform hardening - Make the current-folder strip an editable address bar: type/paste a full path and press Enter to navigate (also reaches other Windows drives and hidden dirs the up-only browser cannot). - Reuse shared modal CSS: drop bespoke .workspace-modal-content/.workspace-btn* in favour of base .modal-content/.modal-body and the .confirm-btn button family; separators/hover use var(--border). Net -31 CSS lines. - Fix the path field overflowing the modal right edge (flex stretch + margin vs an overflow:auto scrollbar-feedback loop): full-bleed, no h-margin. - Cross-platform confinement: normcase the workspace commonpath check so containment holds on case-insensitive filesystems (Windows/macOS). - Make tests OS-portable: sibling temp dirs instead of /etc, python os.getcwd() instead of pwd. 5 pass.	2026-06-05 00:06:37 +02:00
Kenny Van de Maele	7b4365fe57	Make write_file/edit_file always-available like read_file (#2684 ) read_file/grep/glob/ls are in ALWAYS_AVAILABLE but the on-disk write tools (write_file, edit_file) were only surfaced via per-query tool-RAG retrieval. On a bare 'edit X' request the retriever could miss them, so the model was never offered edit_file/write_file and wrongly fell back to edit_document (editor panel) or improvised with bash sed. Add both to ALWAYS_AVAILABLE next to read_file; they stay admin-gated by tool_security so non-admin exposure is unchanged. Fixes #2683	2026-06-05 00:02:14 +02:00
pewdiepie-archdaemon	a260e0abd4	Revert calendar-based cookbook scheduler Reverts `b98ee04` + `4ed48ba` + `a19b6d2`. Calendar events turned out to be the wrong abstraction for scheduling model serve windows. Pivoting to the existing ScheduledTask infrastructure (cron / daily / weekly recurrence, next_run tracking, edit-from-Tasks-tab UI) in a follow-up commit. The ScheduledTask path: - reuses dispatch logic the rest of the app already understands - drops the calendar dependency entirely (no auto-created "Cookbook" calendar, no calendar.js hook) - shows up in the Tasks UI that already exists for everything else What this revert removes: - src/cookbook_scheduler.py — calendar reconciler - routes/cookbook_schedule_routes.py — /api/cookbook/schedule/* endpoints - static/js/cookbookSchedule.js — Schedule modal / settings card - cookbook_scheduler_enabled + cookbook_schedule_calendar_href settings keys - The window.cookbookOpenScheduleForm hook in calendar.js - The Schedule button + paired-button CSS in cookbookServe.js + style.css	2026-06-05 06:57:21 +09:00
Michiel Van de Velde	7ddc5eaef4	Merge pull request #2529 from NubsCarson/codex/2509-mcp-tool-input-params fix(mcp): expose MCP tool input parameters to the agent	2026-06-04 23:07:42 +02:00
Kenny Van de Maele	64d65b73c1	feat: round-limit handling — Continue affordance at the cap + configurable cap (#1999 ) * feat: round-limit handling — Continue affordance at the cap + configurable cap When the agent loop runs out of rounds (per-message step cap, default 20) while still actively using tools, it stopped silently mid-task. Now: 1. The loop emits a `rounds_exhausted` SSE event at the cap, and the UI shows a "Continue" pill at the bottom of the chat that resumes the task from where it left off. Repeated cap-hits each get a fresh Continue (multiple continues in a row). 2. The cap is configurable in Settings → Agent ("Max steps per message"), validated on the client, at the save endpoint, and at the read site. - src/agent_loop.py: track `_exhausted_rounds` (set only when a full tool-executing round completes on the last allowed round — i.e. the agent wanted to keep going); emit `{"type":"rounds_exhausted","rounds":N}` (logged). - routes/chat_routes.py: read `agent_max_rounds` (clamped 1..200), pass as `max_rounds`; forward the new event through the SSE relay. - routes/auth_routes.py: validate numeric settings on save (int + clamp; agent_max_rounds 1..200, agent_max_tool_calls 0..1000; 400 on non-int). - src/settings.py: default `agent_max_rounds = 20`. - static/: Settings input + client-side clamp; the Continue pill (reuses the existing .stopped-indicator / .continue-btn classes and theme vars --border/--fg/--bg/--accent); appended to the chat container so it survives the message re-render at stream finalize. chat.js cache version bumped. * test: cover rounds_exhausted emission (cap-hit vs normal finish) Drives the real stream_agent_loop with mocked LLM stream / tool exec / settings: a tool block every round exhausts the cap and must emit rounds_exhausted; a plain answer hits the done-break and must not. Guards the for/else logic.	2026-06-04 22:36:05 +02:00

1 2 3 4 5 ...

356 Commits