odysseus

mirror of https://github.com/pewdiepie-archdaemon/odysseus.git synced 2026-06-16 09:45:24 -04:00

Author	SHA1	Message	Date
pewdiepie-archdaemon	9112861d8e	cookbook agent debug loop: persistent log files, auto-adopt orphan tmux, Codex/Claude skill parity Three converging fixes so the chat agent + external Codex/Claude skills can actually debug a crashed serve instead of staring at a post-crash neofetch banner: * Serves now `tee` to /tmp/odysseus-tmux/SESSION.log on the host running them. Runner saves fds 3/4 before the tee and restores them right before `exec ${SHELL}`, so the post-crash interactive zsh banner does NOT pollute the log file. * `tail_serve_output` (chat agent) and `/api/codex/cookbook/output/{sid}` (Codex+Claude skills) both prefer the persistent log file over the tmux pane. Pane is fallback for sessions predating the tee runner. Default tail bumped 150 -> 400. * `list_served_models` "recent log" snippet seeks to the Traceback line instead of showing the last 6 lines (which was always the bash prompt). Cookbook auto-adoption sweep on `/api/cookbook/tasks/status`: every 20s (rate-limited) the cookbook SSHes each configured server, finds `serve-` / `cookbook-` tmux sessions running an actual model process (vllm/python/llama-server/etc., filtered via `pane_current_command`), and writes them into state.tasks. So when the agent falls back to raw ssh+tmux, the session appears in the Cookbook UI on the next poll. `serve_model` error path now reads `data["detail"]` in addition to `data["error"]` so the FastAPI HTTPException message ("Invalid characters in cmd") actually reaches the agent instead of being swallowed as a generic "Serve failed". Tool description updated to warn against `cd …`/`source …`/`&&` prefixes. Intent-without-action supervisor in agent_loop: when the model writes "Let me tail the output" / "I'll check the logs" / "Let me investigate" and ends the turn without emitting a tool call, the loop injects a sharp system nudge ("You said you would X — DO IT NOW") and continues. Capped at 2 nudges per chat so a model that genuinely cannot use the tool does not pin the loop. Codex/Claude skill parity: adds `/cookbook/cached`, `/cookbook/presets`, `/cookbook/preset/{name}`, `/cookbook/adopt` so external agents have the same surface as the chat agent. SKILL.md docs + odysseus_api.py wrapper updated for both bundles. `adopt_served_model` promoted to the always-on tool set so the agent has a documented fallback when serve_model rejects a cmd. Also various cookbook UI tweaks accumulated alongside the above (cookbook.js, cookbookRunning.js, cookbookServe.js, cookbook-diagnosis.js, settings.js, style.css).	2026-06-04 23:27:18 +09:00
Giuseppe	f6a5f6592f	fix: log warnings on silently swallowed agent and endpoint failures (#2367 ) get_builtin_overrides() was swallowing all exceptions with a bare `except Exception: pass`, so misconfigured tool-description overrides would silently produce wrong agent behaviour with no log trace. The background endpoint refresh loop had the same pattern: any probe failure was silently ignored, giving operators no signal that the refresh was broken. Also removes a circular self-import (`from src.agent_loop import _build_base_prompt`) inside _build_system_prompt; the function is already in scope and the import created a latent circular reference risk. Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-04 12:29:31 +01:00
Giuseppe	68cb715914	fix(endpoint): import ModelEndpoint from core database ModelEndpoint is defined in core.database, not src.database. The wrong import silently prevented the module from loading in deployment configurations that do not have a src/database.py shim, resulting in an ImportError at startup. Also adds a warning log when resolve_endpoint finds no usable model (all models hidden or the list is empty), making the otherwise-silent failure visible in operator logs. The test_auth_regressions stub for src.endpoint_resolver was missing the build_models_url attribute, which caused test collection errors. Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-04 11:51:47 +01:00
Marius Popa	dc365a1b27	Fix Ollama agent single-token responses (#1591 ) Agent mode treated local /v1 endpoints, including Ollama on :11434, as native-tool-capable by host/model heuristics. On Ollama's OpenAI-compatible surface some models that advertise tool support stop after a single token when schemas are sent (issue #1567). Default local Ollama /v1 back to fenced tool blocks unless the endpoint explicitly has supports_tools=True. Also compare both the runtime chat URL and the normalized endpoint base when reading ModelEndpoint.supports_tools. That keeps a saved base URL such as http://localhost:11434/v1 effective when the active session URL is /v1/chat/completions. Tests: .venv/bin/python -m pytest tests/test_tool_support_heuristic.py	2026-06-04 11:45:10 +01:00
ooovenenoso	e163384015	fix: treat Nix files as readable uploads (#2249 )	2026-06-04 12:06:24 +02:00
Nicholai	4dc11cfe6b	refactor(memory): canonicalize memory imports (#50 )	2026-06-04 05:31:15 +01:00
Yuri	a2e691da2b	fix(models): stabilize proxy endpoint refresh behavior * fix: support large proxy model endpoint refresh Large OpenAI-compatible proxy endpoints can expose hundreds of models and make /v1/models slow. Treating those endpoints like local model servers caused model picker opens and background probes to repeatedly hit /models, producing timeouts and making otherwise usable endpoints appear offline. Make model endpoint discovery cached-first for normal UI usage, add explicit proxy/API classification and refresh policy fields, exclude proxy/API endpoints from aggressive local probing, and preserve cached models when refresh fails. Manual Test/Add/Refresh actions still fetch the full model list with longer timeouts so users can intentionally import large proxy model lists without blocking normal model picker usage. * fix: preserve endpoint ping status semantics	2026-06-04 04:56:11 +01:00
Sushanth Reddy	eee2167502	Stop API key save() from writing other providers' keys as plaintext (#1944 ) save() called load(), which DECRYPTS every stored key, then re-encrypted only the key being saved and wrote the whole dict back. The other providers' keys were thus persisted in plaintext; on the next load() Fernet raised InvalidToken on them and they were silently dropped. Add _load_raw() that returns the still-encrypted on-disk dict (reusing the existing missing/corrupt-file guards) and have save() build on that, so untouched providers keep their ciphertext. load() now also goes through _load_raw(), keeping its behavior identical. Fixes #1914 Co-authored-by: EkaTantra Dev <dev@ekatantra.com>	2026-06-04 04:47:13 +01:00
Afonso Coutinho	49c14af5c7	fix(calendar): scope CalDAV event lookup by calendar * fix: CalDAV sync hijacks another user's event sharing a VEVENT uid * Seed schema-valid dtstart/dtend in caldav uid-scope test fixture	2026-06-04 04:01:21 +01:00
Vykos	5f58f9a45f	fix(ai): scope tool model resolution by owner * Stabilize full test collection * Scope AI tool model resolution by owner	2026-06-04 00:37:28 +01:00
Vykos	aaef6b1c49	fix(search): align content URL guards * Stabilize full test collection * Align search content URL guards	2026-06-04 00:34:06 +01:00
Vykos	193dc2f085	fix(uploads): bound direct upload reads * Stabilize full test collection * Add bounded reads for direct uploads	2026-06-04 00:32:50 +01:00
pewdiepie-archdaemon	089246614d	feat: Claude Agent integration + cookbook reconnect + UI polish - Claude Agent integration: AGENT_CONFIGS.claude, INTG_TYPES.claude, setup_claude_routes + integrations/claude/ skill bundle. Wired in app.py alongside the existing Codex integration; same scope-gated /api/codex/* backend; agent form has new description so users know it's setup for an external CLI, not an agent streamed inside Odysseus. - Remove mark_email_boundaries action: not good enough yet. Stripped from task UI, scheduler defaults, registry, tool schema, clear-cache route. Added to RETIRED_HOUSEKEEPING_ACTIONS so existing rows + their task_runs auto-purge on startup. - Cookbook download reliability: "Reconnect" fix button in the crash diagnosis runs _reconnectTask after probing has-session. 30s confirm window before marking a download "done" — kills the Finished/Downloading flicker when tmux briefly drops between captures. - Mobile UX: tap anywhere on a note card body opens the editor; Update button morphs to Archive when no text was edited; bell icon accent-colored; chip-trashing notif pills fade so only the icon rotates into the trash zone. - Settings integrations: SVG-per-provider in email + API preset dropdowns, custom drop-up-aware menus, accent sub-header icons (IMAP/SMTP), consistent card styling between list + edit, contacts Edit/Delete icons, agent form description copy.	2026-06-04 08:27:26 +09:00
pewdiepie-archdaemon	6861c41580	Reapply "Merge branch 'main' of github.com:pewdiepie-archdaemon/odysseus" This reverts commit `cc8fe2f6e3`.	2026-06-03 22:47:00 +09:00
pewdiepie-archdaemon	cc8fe2f6e3	Revert "Merge branch 'main' of github.com:pewdiepie-archdaemon/odysseus" This reverts commit `8161c1253d`, reversing changes made to `8c2705b42a`.	2026-06-03 22:46:19 +09:00
Alexandre Teixeira	b1a4ed13b0	Harden API-token chat endpoint selection Validate only token-supplied direct base_url values for API-token chat requests, while keeping admin-configured endpoints available for local/LAN providers. Scope configured endpoint fallback selection to the API token owner, fail closed for unknown token owners, and preserve strict session ownership checks when resuming sessions from chat-scoped API tokens. Add focused regression coverage for direct base_url SSRF rejection, configured endpoint fallback behavior, token-owner scoping, URL validation, and null-owner session/endpoint handling.	2026-06-03 13:05:13 +01:00
Alexandre Teixeira	a75dd4a231	fix(search): apply recency UTC fix to live ranking module	2026-06-03 12:49:32 +01:00
Shaw	49bf73b228	fix(forms): keep PDF-form export from dropping values when the label has '' (#1407 ) parse_markdown_to_values — the read-back path for export-pdf, the export preview, and prepare-signed-reply — matched the bold field label with [^]+, so it could not match a label containing '' (the near-universal required-field marker: "Email ", "State ", "Signature "). The value then stayed empty, so the exported PDF and the signed-reply attachment came out blank for that field with no error — a whole form of required fields could export completely empty. Match the label non-greedily (.+?) so '' in labels is tolerated while still splitting at the first ':' / '[', which also preserves a value that itself contains ':*'. Adds tests/test_form_markdown_roundtrip.py (render -> parse roundtrip): asterisk text/choice/signature labels survive (fail before, pass after); plain labels and colon-bearing values are unaffected. Co-authored-by: NubsCarson <nubs@nubs.site>	2026-06-03 14:24:07 +09:00
Afonso Coutinho	b55c970ec5	fix: sports-hint ranking penalty fires on 'transport'/'passport' substrings (#1473 ) * fix: sports-hint ranking penalty fires on 'transport'/'passport' substrings * Apply word-boundary sports-hint fix to src/search/ranking.py as well	2026-06-03 14:23:52 +09:00
Paulo Victor Cordeiro	1feb2ae7d5	fix: close AsyncExitStack on MCP init/tool-discovery failure (#1493 ) If session.initialize() or list_tools() raises after the stdio subprocess or SSE connection is already open, the AsyncExitStack is never closed — leaking the child process or HTTP connection. Wrap the setup phase in try/except to aclose() the stack before re-raising.	2026-06-03 14:23:46 +09:00
ghreprimand	8c4ea484a9	Cap inline attachment context across files (#1498 ) Co-authored-by: ghreprimand <203024559+ghreprimand@users.noreply.github.com>	2026-06-03 14:23:43 +09:00
Lucas Daniel	398892cced	fix(settings): catch PermissionError in load_settings + error-path tests (#1570 ) PermissionError was not in the except tuple so an unreadable settings.json would crash the app instead of falling back to defaults. Added alongside the existing FileNotFoundError/JSONDecodeError/ValueError catches. Also adds test_settings_error_paths.py covering all four failure modes: missing file, corrupted JSON, wrong type, and permission denied.	2026-06-03 14:23:27 +09:00
danielroytel	39848a168b	fix: recognize Gemma 4 as a thinking model and add context entry (#1642 ) Gemma 4 returns reasoning_content in streaming responses via llama-server, but the model wasn't listed in _THINKING_MODEL_PATTERNS, causing reasoning tokens to be mishandled. Add "gemma" to the pattern list and register Gemma 4's 128K context window in KNOWN_CONTEXT_WINDOWS so the agent loop budgets context correctly. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-06-03 14:23:18 +09:00
Afonso Coutinho	b45611e9c5	fix: _strip_reasoning_prose discards the answer when reasoning trails it (#1643 )	2026-06-03 14:23:15 +09:00
Afonso Coutinho	3e33cf6439	Anchor shell-verb intent patterns to imperative or can-you position (#1664 )	2026-06-03 14:23:10 +09:00
Afonso Coutinho	8a0b79bc84	fix: deep research runs the prompt's example queries when the model echoes them (#1666 )	2026-06-03 14:23:07 +09:00
Afonso Coutinho	b396252af6	fix: monthly tasks scheduled for day 29-31 skip every short month (#1668 )	2026-06-03 14:23:01 +09:00
Afonso Coutinho	1161040efe	fix: visual report drops photos whose URL slug contains icon or logo (#1685 )	2026-06-03 14:22:45 +09:00
Shaw	eb5727abda	fix(agent): coerce non-object tool-call arguments instead of crashing (#1370 ) A native function/tool call whose `arguments` field is valid JSON but not an object — a bare array like ["ls -la"], or a string/number/bool/null — parsed fine in function_call_to_tool_block and then every branch called args.get(...), raising AttributeError ('list'/'str' object has no attribute 'get'). That propagated out of the streamed agent loop (no surrounding try/except at the call site in stream_agent_loop) and aborted the user's entire turn. Weaker and local models routinely emit malformed args like this. Coerce non-dict parsed arguments to {} (mirrors the existing empty-arguments behavior), so the tool runs with empty args instead of killing the stream. Adds tests/test_function_call_non_object_args.py covering array/string/number/ bool/null arguments — they fail before this change and pass after.	2026-06-03 14:14:37 +09:00
ghreprimand	41d2767b30	Replace task scheduler utcnow calls (#1456 ) Co-authored-by: ghreprimand <203024559+ghreprimand@users.noreply.github.com>	2026-06-03 14:14:30 +09:00
Marius Oppedal Ringsby	4f03f5ccdd	Replace cleanup service datetime.utcnow calls (#1494 ) datetime.utcnow() is deprecated in Python 3.12 and removed in 3.14. Swap the five calls in src/cleanup_service.py for a local _utcnow() helper returning naive UTC, matching the naive DateTime columns the archive/delete cutoffs compare against (same approach as the task-scheduler and core-database slices). Add a regression test asserting the helper stays naive so the cutoff math can't hit a naive/aware TypeError. Part of #1116	2026-06-03 14:14:27 +09:00
ghreprimand	6fd52cf317	Replace webhook manager datetime.utcnow calls (#1499 ) Co-authored-by: ghreprimand <203024559+ghreprimand@users.noreply.github.com>	2026-06-03 14:14:23 +09:00
red person	56cd8add18	Fall back from invalid preset stores (#1402 )	2026-06-03 14:12:31 +09:00
Afonso Coutinho	33ae982968	fix: context_compactor token helpers crash on non-string message text (#1634 ) * fix: context_compactor token helpers crash on non-string message text * fix: _truncate_text_to_token_budget returns an empty string for non-string text, not the raw value	2026-06-03 14:12:14 +09:00
Shaw	63aa15d155	fix(scheduler): fail closed on malformed scheduled_time instead of 500 (#1410 ) compute_next_run parsed scheduled_time as "HH:MM" with int(parts[0]), int(parts[1]) and no validation, so "9", "9am", "25:00", "9:" or ":30" raised IndexError/ValueError. The POST /tasks create route passes the user/LLM-supplied scheduled_time before its try block (and only validates the cron field), so a bad value surfaced as an unhandled 500 rather than the clean 400 used for other invalid fields — and the same crash could fire inside the scheduler loop when recomputing next_run for an already-stored bad row. Guard the parse and fail closed (warn + return None), matching the existing invalid-cron handling in the same function. Adds tests/test_scheduler_scheduled_time_validation.py — malformed values return None (fail before with IndexError/ValueError), valid HH:MM still computes.	2026-06-03 14:12:07 +09:00
red person	db8c0b3dac	Ignore non-string background stream deltas (#1549 )	2026-06-03 14:11:45 +09:00
red person	38bfa85ad0	Reject invalid Tailscale discovery JSON (#1556 ) * Reject invalid Tailscale discovery JSON * Guard nested Tailscale IP shapes	2026-06-03 14:11:31 +09:00
Afonso Coutinho	1453458519	fix: is_public_blocked_tool crashes on a truthy non-string tool name (#1620 ) * fix: is_public_blocked_tool crashes on a truthy non-string tool name * fix: is_public_blocked_tool fails closed (blocks) on a malformed non-string tool name	2026-06-03 14:11:14 +09:00
red person	d1309f3bd6	Ignore non-object settings scrub inputs (#1645 )	2026-06-03 14:11:05 +09:00
red person	b409b20940	Handle non-string src search queries (#1646 )	2026-06-03 14:11:02 +09:00
red person	558d6ddf24	Ignore invalid background job store rows (#1261 )	2026-06-03 14:07:14 +09:00
red person	34efabdec8	Ignore invalid integration rows (#1404 )	2026-06-03 14:07:11 +09:00
Afonso Coutinho	1571d8bba0	fix: agent_tools._truncate crashes on non-string input (#1624 ) * fix: agent_tools._truncate crashes on non-string input * fix: agent_tools._truncate returns a string for non-string input, not the raw value	2026-06-03 14:06:39 +09:00
Afonso Coutinho	3a741edbf1	fix: visual_report markdown helpers crash on a non-string input (#1633 )	2026-06-03 14:06:35 +09:00
red person	8af1f85665	Ignore non-string email thread bodies (#1654 )	2026-06-03 14:06:31 +09:00
Afonso Coutinho	28dbd5346c	Treat non-string research summaries as low quality Filter malformed non-string research summaries instead of letting the broad exception path classify them as usable, with regression coverage.	2026-06-03 13:42:24 +09:00
Afonso Coutinho	a880b17624	Skip malformed personal keyword index rows Make personal keyword retrieval tolerate corrupted non-dict index entries and missing chunk lists, with regression coverage.	2026-06-03 13:42:05 +09:00
Afonso Coutinho	35b9509da3	fix: memory entry validation crashes on a non-dict row from memory.json (#1691 )	2026-06-03 13:38:02 +09:00
Afonso Coutinho	f0b172020e	fix: require_privilege 500s on a non-dict privileges blob from auth.json (#1693 )	2026-06-03 13:37:54 +09:00
Afonso Coutinho	02ff2e3cb0	fix: updating a calendar event ignores user timezone and shifts the time (#1695 )	2026-06-03 13:37:39 +09:00

1 2 3 4 5

233 Commits