odysseus

mirror of https://github.com/pewdiepie-archdaemon/odysseus.git synced 2026-06-16 17:55:26 -04:00

Author	SHA1	Message	Date
BSG-Walter	c0466274ed	fix: resolve DuckDuckGo redirect URLs in HTML fallback search The DuckDuckGo HTML fallback returns redirect URLs (//duckduckgo.com/l/?uddg=...) instead of actual page URLs. This caused fetch_webpage_content() to reject them instantly because _public_http_url() requires an http/https scheme, making search results unfetchable in deep research mode. Added _resolve_url() to: - Convert protocol-relative URLs to absolute (https:) - Convert path-relative URLs to absolute - Extract the real URL from DuckDuckGo's /l/?uddg= redirect parameters	2026-06-01 19:42:01 -03:00
pewdiepie-archdaemon	ab0a480f30	Show Ollama models in Cookbook Serve	2026-06-02 07:38:45 +09:00
Ernest Hysa	cb6f6b65ea	fix(research): validate session_id to block path traversal Every research endpoint interpolates session_id into filesystem paths (Path('data/deep_research') / f'{session_id}.json') without checking for traversal sequences. A crafted ID like '../../data/auth' reaches arbitrary JSON files — readable via research_detail (which also leaks file paths in error messages), writable via research_archive, and deletable via research_delete. Add _validate_session_id() which rejects anything outside [a-zA-Z0-9-]{1,128}. Called before filesystem access in all 12 endpoints that accept a session_id path parameter.	2026-06-01 23:25:38 +01:00
pewdiepie-archdaemon	cd53ad01e8	Clarify AI tasks and skipped activity rows	2026-06-02 07:11:40 +09:00
pewdiepie-archdaemon	81109b85d3	Fix Brain tab panel visibility	2026-06-02 07:07:51 +09:00
pewdiepie-archdaemon	cd0c5fec03	Match mobile task state button height	2026-06-02 07:06:17 +09:00
pewdiepie-archdaemon	ed946d8e61	Polish task activity icons	2026-06-02 07:04:52 +09:00
pewdiepie-archdaemon	1ff8669199	Compact mobile task controls	2026-06-02 07:02:26 +09:00
pewdiepie-archdaemon	7f9afe75e2	Remove mobile notes close button	2026-06-02 07:00:40 +09:00
pewdiepie-archdaemon	b7477d063a	Clarify task status controls on mobile	2026-06-02 06:57:53 +09:00
pewdiepie-archdaemon	59516ec126	Make favorite dot feedback transient	2026-06-02 06:52:03 +09:00
pewdiepie-archdaemon	637c7511a2	Add model favorite dot feedback	2026-06-02 06:50:22 +09:00
pewdiepie-archdaemon	4a112175e2	Remove broken remind slash command	2026-06-02 06:48:41 +09:00
pewdiepie-archdaemon	eda0f1258a	Nudge model picker favorite dots	2026-06-02 06:46:05 +09:00
pewdiepie-archdaemon	d5c7e3d3e4	Add direct tool slash commands	2026-06-02 06:44:29 +09:00
pewdiepie-archdaemon	3959eec602	Refresh slash command hints	2026-06-02 06:40:23 +09:00
pewdiepie-archdaemon	5a5e0e9823	Adjust model picker favorite dot alignment	2026-06-02 06:36:10 +09:00
pewdiepie-archdaemon	3c1e0edea3	Polish model picker favorites	2026-06-02 06:33:53 +09:00
tanmayraut45	2d7d7b2412	Fix TOCTOU race in chat stream status endpoint The /api/chat/stream_status handler did a membership test against _active_streams followed by an indexed read of the same key. Between those two ops, a sibling stream's finally block (or a stop / cleanup path) can pop the entry, turning the indexed read into a KeyError that bubbles up as a 500. The race is the exact one _stream_set was already written to avoid; the comment on the helper at the top of the module spells out why a single .get() is the right pattern here too. Collapse the two-step into a single .get() call so the lookup either returns the live record or None, and report 'detached' / 404 based on that single read. No behavior change on the happy path; the failure mode under concurrent stream cleanup is now handled deterministically. Closes #658.	2026-06-02 03:02:30 +05:30
pewdiepie-archdaemon	e5cae37d15	Merge branch 'pr-673' into visual-pr-playground	2026-06-02 06:26:32 +09:00
pewdiepie-archdaemon	5f2509d6a8	Merge branch 'pr-644' into visual-pr-playground	2026-06-02 06:26:32 +09:00
pewdiepie-archdaemon	7242431335	Merge branch 'pr-738' into visual-pr-playground	2026-06-02 06:26:32 +09:00
pewdiepie-archdaemon	2e0b384d72	Merge branch 'pr-480' into visual-pr-playground	2026-06-02 06:26:32 +09:00
pewdiepie-archdaemon	b3765c7b63	Merge branch 'pr-611' into visual-pr-playground	2026-06-02 06:26:32 +09:00
pewdiepie-archdaemon	b041e53c0b	Merge branch 'pr-506' into visual-pr-playground	2026-06-02 06:26:32 +09:00
pewdiepie-archdaemon	0ae30211d8	Merge branch 'pr-550' into visual-pr-playground	2026-06-02 06:26:32 +09:00
pewdiepie-archdaemon	6873b60721	Merge branch 'pr-594' into visual-pr-playground	2026-06-02 06:26:31 +09:00
pewdiepie-archdaemon	c1cb6f0d55	Merge branch 'pr-575' into visual-pr-playground	2026-06-02 06:26:31 +09:00
pewdiepie-archdaemon	664acf73ee	Merge branch 'pr-469' into visual-pr-playground	2026-06-02 06:26:31 +09:00
pewdiepie-archdaemon	7ef7791ac8	Merge branch 'pr-684' into visual-pr-playground	2026-06-02 06:26:31 +09:00
pewdiepie-archdaemon	3224cd2ec7	Merge branch 'pr-668' into visual-pr-playground	2026-06-02 06:26:31 +09:00
pewdiepie-archdaemon	49ae46001c	Merge branch 'pr-696' into visual-pr-playground	2026-06-02 06:26:31 +09:00
tanmayraut45	d2bad10781	Fix searxng container permission errors during setup A fresh `docker compose up -d` shows the searxng container failing its healthcheck with permission errors at setup (reported in #721 — the service comes up under names like `odysseus_searxng_1` and never goes ready, which then blocks the main odysseus container because of the `depends_on: searxng: condition: service_healthy` gate). Root cause: the official `searxng/searxng:latest` image runs as the non-root `searxng` user but its entrypoint still needs to 1. chown /etc/searxng on first boot so the persisted named volume is owned by the searxng user inside the container, 2. su-exec to drop / re-assert privileges before launching uwsgi, and 3. let our wrapper entrypoint (which seeds settings.yml into the named volume on first boot) write the file through the volume mount. Without explicit `cap_add`, the container has neither CHOWN nor DAC_OVERRIDE nor SETUID/SETGID, so the entrypoint aborts at the first chown / su-exec / redirection with EACCES. The upstream searxng-docker compose file solves this with the standard "drop everything, grant only what's needed" capability pattern. Fix: mirror the upstream cap_drop ALL / cap_add CHOWN, SETGID, SETUID, DAC_OVERRIDE on the searxng service. This grants only the four caps the entrypoint actually needs, matches what searxng-docker ships with, and leaves ports, volumes, env, healthcheck, and the wrapper entrypoint unchanged. Closes #721.	2026-06-02 02:47:30 +05:30
pewdiepie-archdaemon	7b9ef95b60	Stabilize auth session revocation tests	2026-06-02 06:02:49 +09:00
Lohinth	a8d9a180d9	Scope document tools to caller owner Co-authored-by: Lohinth <lohinth25@proton.me>	2026-06-02 06:00:02 +09:00
Ernest Hysa	d42e6a7acc	Scope skill mutations to caller owner SkillsManager.update_skill walks every SKILL.md on disk and matches by slug only; the 'owner' key in its scalar_keys whitelist meant a caller could pass updates={'owner': 'attacker', 'description': 'pwned'} and the first matching file on disk got silently re-owned. Two users with the same slug under different category directories (which is supported by the on-disk layout <category>/<name>/SKILL.md) could each stomp the other's skill via the manage_skills tool or the in-process callers in tool_implementations.py (edit, patch, publish, delete). update_skill and delete_skill now require the caller's owner and only match a file whose parsed owner field matches. The default of None means 'no scope' and only matches ownerless skills, so an unsafe call without an explicit owner is now a no-op. 'owner' is also removed from scalar_keys so the updates dict cannot be used to reassign ownership even when the manager is called from an in-process path that didn't supply the owner argument. The in-process callers in tool_implementations.py are updated to pass owner=owner (which was already in scope at every call site) so the HTTP and agent paths both go through the scoped check. The HTTP route at routes/skills_routes.py:1499 was already owner-scoped via sm.load(owner=user); the fix brings the in-process path up to the same standard.	2026-06-02 05:59:43 +09:00
Alexandre Teixeira	5dd5847d4b	Revoke stale sessions after password change After a successful password change, revoke all browser sessions for the same user except the one that submitted the request. This prevents stale sessions on other devices from remaining valid after credentials are updated. Keep API-token behavior unchanged. The current browser session is preserved so the user can continue from the tab that changed the password. Add focused regression tests for preserving the current session, revoking other sessions, persisting revocation, and avoiding revocation when the current password is incorrect.	2026-06-02 05:59:22 +09:00
SurprisedDuck	7d10fb6260	Reserve internal sentinel usernames `core.middleware.require_admin` grants admin to any request whose `request.state.current_user == "internal-tool"` — the sentinel meant only for the in-process tool-loopback path. But the normal cookie auth path (app.py) sets `current_user` to the raw username, and neither `create_user` nor the signup route reserved that name. As a result an account literally named "internal-tool" was silently treated as admin by every `require_admin`-gated route. With self-service signup enabled this is an anonymous -> admin privilege escalation. Reserve the full synthetic-owner set the codebase already special-cases — "internal-tool", "api", "demo", "system" (see `_SYNTHETIC_OWNERS` in routes/assistant_routes.py and the matching guards in src/task_scheduler.py and routes/research_routes.py). "api" collides with the bearer-token owner sentinel; "demo"/"system" would leave a real account denied an assistant and inconsistently owner-scoped. Refuse to create or rename into any reserved name (case/space-normalized), and reject empty usernames while we're here. Adds a regression test. Co-authored-by: Claude <noreply@anthropic.com>	2026-06-02 05:58:58 +09:00
SurprisedDuck	b70ae56ffa	Sanitize preserved markdown HTML `mdToHtml` deliberately stashes literal <details> blocks and <a> tags from the source text before the global HTML-escape pass and restores them verbatim into the string callers assign to `innerHTML` (e.g. chatRenderer's `b.innerHTML = ...processWithThinking(text)`). Nothing scrubbed those fragments, so message/agent content containing `<details><img src=x onerror=...></details>` or `<a href="javascript:..." onmouseover=...>` executed arbitrary script in the authenticated page. Route both stashed fragments through `sanitizeAllowedHtml()`, which parses them in an inert <template> (no resource loads, no script execution), removes script-capable elements, and strips event-handler attributes plus javascript:/vbscript:/data: URL schemes. Hardening details: - Compare tag names case-insensitively and drop the SVG/MathML foreign- content roots. An SVG-namespaced <script> has the lower-case tagName 'script', so an HTML-only upper-case check would miss it — a real bypass. - Sanitize to a fixpoint (re-parse + re-clean until stable) to blunt mutation-XSS, where re-serializing/re-parsing reshapes the tree. Benign anchors and <details> blocks are preserved unchanged. Verified under jsdom against the obvious vectors plus mutation-XSS probes (svg/math-namespaced <script>, foreignObject, ns-confusion, comment breakout, template smuggling): no script/iframe element, event handler, or javascript:/data: URL survives, and benign markup is kept. Co-authored-by: Claude <noreply@anthropic.com>	2026-06-02 05:58:38 +09:00
ghreprimand	491a8a5480	Harden backup restore tar extraction Co-authored-by: ghreprimand <203024559+ghreprimand@users.noreply.github.com>	2026-06-02 05:55:03 +09:00
Alexandre Teixeira	26483661da	Restrict provider discovery to admins Require admin access before serving provider discovery data from GET /api/providers. This prevents normal authenticated users from triggering provider discovery or receiving cached provider host data. Keep GET /api/models available to normal users and leave the existing admin-only GET /api/discover behavior unchanged. Add a focused regression test to ensure unauthorized callers cannot trigger discovery and cannot receive cached provider data.	2026-06-02 05:54:40 +09:00
SurprisedDuck	7268c49992	Make LLM host health maps thread-safe The synchronous llm_call() runs in FastAPI's threadpool (sync route handlers such as POST /sessions/auto-sort), while llm_call_async() runs on the event loop. Both mutate the module-level _response_cache, _host_fails and _dead_hosts dicts, so these are touched from multiple OS threads concurrently. Two races result: - _set_cached_response() snapshots 64 keys then deletes them with `del _response_cache[key]`; if another thread evicts the same key first, the del raises KeyError mid-eviction. Switched to pop(key, None). - _mark_host_dead() does get()+1+set() on _host_fails with no lock, so concurrent connect failures lose increments and a genuinely dead host can stay under its cooldown threshold. Guarded the host-health maps with a threading.Lock (also applied to _is_host_dead / _clear_host_dead for consistent reads). Adds tests/test_llm_core_concurrency.py with deterministic regression tests (phantom snapshot key for the eviction race; a slow-read dict that forces the lost-update window for the counter). Both fail on the unpatched code and pass with the fix.	2026-06-02 05:54:23 +09:00
ooovenenoso	cd6041477c	Refresh local model context after restart Co-authored-by: Kevin <120500656+oooindefatigable@users.noreply.github.com>	2026-06-02 05:54:06 +09:00
Prakhya	a96593a99b	Improve Ollama endpoint error messages	2026-06-02 05:53:50 +09:00
SurprisedDuck	7a830e504d	Escape email fold summary metadata The email reader folds quoted history into <details> summaries via `_foldSummary()` (static/js/emailLibrary/signatureFold.js), which builds a sender/date "meta" chip into the summary HTML and assigns it to innerHTML. The server-side thread parser (`_extract_quote_meta`, src/email_thread_parser.py) strips tags but then un-escapes HTML entities and preserves `<...>` patterns, and that raw meta reaches `_foldSummary` unescaped via `_renderTurnsFromServer` (`t.meta`) — so an inbound email whose quoted attribution contains `From: <img src=x onerror=...>` runs script when the victim merely opens the message (stored XSS). Make `_foldSummary` the single escaping chokepoint: escape `primary` and `subMeta` with the module's existing `_esc`. The client-side `_extractQuoteMeta` previously pre-escaped its output, and every consumer of it routes through `_foldSummary`, so drop that now-redundant escaping to avoid double-encoding (e.g. "Ben & Jerry" -> "Ben &amp; Jerry"). Verified (jsdom): server-raw and client-extracted malicious metas yield 0 live elements and 0 event-handler attributes; benign "Ben & Jerry" renders single-escaped. Co-authored-by: Claude <noreply@anthropic.com>	2026-06-02 05:50:53 +09:00
Yatsuiii	63d93ff211	Normalize stored usernames on auth load verify_password() and create_session() both call .strip().lower() on the incoming username, but _load() stored keys verbatim from auth.json. Any mixed-case key (e.g. written by manual edit or a future migration) would never match, producing a permanent 'Invalid credentials' error. Fix: lowercase all keys at load time so the in-memory dict always matches what the login path expects. Fixes #423	2026-06-02 05:50:36 +09:00
Afonso Coutinho	5da662441c	Validate slash command time minutes * fix: reject hour > 23 in 'today/tomorrow' reminder time parsing * fix: reject minute > 59 in reminder time parsing	2026-06-02 05:50:19 +09:00
Elle	d885c70462	Treat Docker host gateway as local When running Odysseus in Docker and connecting to a local LLM on the host machine (e.g. `llama.cpp` or `Ollama`), the standard endpoint `http://host.docker.internal` is used to breach the container network. Because `host.docker.internal` was missing from `_LOCAL_HOSTS`, Odysseus incorrectly treated local self-hosted models as cloud APIs. This triggered the fallback behavior where actual API-reported context limits were being ignored and overridden by hardcoded fallbacks in `KNOWN_CONTEXT_WINDOWS`. Changes - Added `"host.docker.internal"` to the `_LOCAL_HOSTS` whitelist in `src/model_context.py` so that Dockerized deployments correctly trust and respect the context limits of locally hosted models. Checks Ran - [x] Syntax check (`python -m py_compile src/model_context.py`) - [x] Tested manually in Docker (`docker compose up -d --build`) on a Windows host using `llama-server`. The correct API context length is now correctly reported in the UI instead of falling back to the 131k hardcode.	2026-06-02 05:49:59 +09:00
2revoemag	3ef88fc7ff	Recognize Gemma as tool-capable Gemma models (gemma-2/3/4) support OpenAI-style function calling, but "gemma" was missing from the _model_supports_tools heuristic in stream_agent_loop(). On a non-allowlisted endpoint (e.g. a self-hosted OpenAI-compatible server), a Gemma-backed agent therefore never receives native tool schemas and falls back to the prompt-text tool-call convention — which Gemma does not follow. The result is that tool calls are emitted as raw text and never execute. Add "gemma" to the capability keyword list alongside the other tool-capable families. Co-authored-by: 2revoemag <2revoemag@users.noreply.github.com> Co-authored-by: Claude <noreply@anthropic.com>	2026-06-02 05:49:43 +09:00
pewdiepie-archdaemon	e03491664a	Stabilize security regression tests	2026-06-02 05:48:59 +09:00

... 3 4 5 6 7 ...

401 Commits