odysseus

mirror of https://github.com/pewdiepie-archdaemon/odysseus.git synced 2026-06-16 17:55:26 -04:00

Author	SHA1	Message	Date
James Arslan	6776c7d691	Surface silent model fallback instead of masking it (#868 ) When the selected model fails before producing output, stream_llm_with_fallback quietly switches to the next candidate and the reply is shown under the originally selected model's name, so a misconfigured provider looks like it works. (Concretely: a Bedrock gateway that 400s every Anthropic/Claude request appears fine because another model silently answers under the Claude label.) Emit a `fallback` SSE event ({selected_model, answered_by, reason}) the first time a non-primary candidate produces output, forward it through the agent loop and both chat-route paths, stamp the response metrics with the model that actually answered, and show a notice + relabel the reply in the UI. Tested: python -m pytest tests/test_llm_core_fallback.py (3 pass); python -m py_compile src/llm_core.py src/agent_loop.py routes/chat_routes.py; node --check static/js/chat.js.	2026-06-02 11:37:25 +09:00
tanmayraut45	0e31c38be0	Support in-place endpoint updates and recover empty-model sessions (#786 ) The "don't wipe endpoint_url/model on endpoint delete" half of #587 landed in `6a78b02` (Fix endpoint model preservation for tasks). The three remaining follow-up pieces from the original PR — flagged in the review on #786 — are: - routes/model_routes.py: toggle_model_endpoint (PATCH) now accepts api_key and base_url, so the admin UI can rotate a key or fix a typo'd URL without going through delete+recreate. base_url is normalized the same way the POST handler does (strip /models, /chat/completions, /completions, /v1/messages, then _normalize_base). Cache invalidation matches the POST/DELETE paths and the response includes base_url so the frontend can confirm what was saved. - routes/chat_routes.py: new _recover_empty_session_model picks cached_models[0] from the endpoint that matches sess.endpoint_url and persists it onto the Session row before the LLM call goes out. Wired into both /api/chat and /api/chat_stream after the existing _clear_orphaned_session_endpoint guard, so the order is: drop truly-orphaned sessions first, then heal the "picker showed it, session never knew" case. - routes/chat_routes.py: when recovery fails (no endpoint, no cached models) raise HTTP 400 with a clear message instead of letting model="" reach the upstream as 401/503. Closes #587.	2026-06-02 11:26:38 +09:00
Mahdi Salmanzade	4a84a895a0	Keep reasoning (thinking) tokens out of the saved chat reply (#856 ) Streamed deltas flagged thinking:true (reasoning-model traces) were being folded into full_response and persisted as part of the assistant message, so saved replies were polluted with the model's chain-of-thought. Forward those deltas to the client (for a live thinking indicator) but exclude them from the accumulated saved reply, in both chat and research-stream paths. Mirrors the existing rewrite path's handling.	2026-06-02 11:17:41 +09:00
tanmayraut45	2d7d7b2412	Fix TOCTOU race in chat stream status endpoint The /api/chat/stream_status handler did a membership test against _active_streams followed by an indexed read of the same key. Between those two ops, a sibling stream's finally block (or a stop / cleanup path) can pop the entry, turning the indexed read into a KeyError that bubbles up as a 500. The race is the exact one _stream_set was already written to avoid; the comment on the helper at the top of the module spells out why a single .get() is the right pattern here too. Collapse the two-step into a single .get() call so the lookup either returns the live record or None, and report 'detached' / 404 based on that single read. No behavior change on the happy path; the failure mode under concurrent stream cleanup is now handled deterministically. Closes #658.	2026-06-02 03:02:30 +05:30
Rifqi Akram	5b1e56407b	Add SSRF-guarded web fetch agent tool * feat(web-fetch): add web_fetch tool to read a specific URL's content * test(web-fetch): add SSRF coverage and fail closed on empty DNS resolution Add explicit SSRF regression tests for the web_fetch path covering loopback, private LAN ranges, link-local/metadata, IPv6 private/local, redirect-into-private, and unsupported schemes. Harden _public_http_url to fail closed when a hostname resolves to no addresses.	2026-06-01 16:57:28 +09:00
Alexander Kenley	cb8a0b268d	Route calendar action requests to tools Co-authored-by: Alex Kenley <Alex.Kenley@threatvectorsecurity.com>	2026-06-01 14:32:41 +09:00
Collin	0a7de1fdf4	fix: stop leaking DB connections when persisting session mode (#64 ) chat_routes.py persisted a session's "mode" in three best-effort spots — reading the current mode, writing the effective mode, and setting research_pending on the stream path. Each opened a session with SessionLocal() and called .close() as the LAST statement inside a try/except, so if anything before close() raised (e.g. a SQLite "database is locked" under concurrent chat streams) the except only logged and the connection was never returned to the pool. DATABASE_URL defaults to file-backed SQLite, whose engine uses SQLAlchemy's default QueuePool (5 connections + 10 overflow). Repeated leaks on these hot paths exhaust the pool; later requests then block for pool_timeout and fail with "QueuePool limit ... reached", taking the app down until restart. Move the logic into two best-effort helpers in core.database, next to the existing session helpers (update_session_last_accessed, get_session_by_id): - get_session_mode(session_id) -> Optional[str] - set_session_mode(session_id, mode) -> bool Both route through the existing get_db_session() context manager, which commits on success, rolls back on error, and always closes in a finally, so the connection is returned to the pool on every path. chat_routes.py now calls these instead of hand-rolling sessions, also removing three copies of the same try/except. Add tests/test_session_mode_helpers.py: the helpers commit+close on success and, on a mid-operation DB error, swallow + roll back + close (no leak). The error-path tests fail against the old close()-inside-try pattern.	2026-06-01 13:57:48 +09:00
pewdiepie-archdaemon	fc7f107b22	Improve Ollama setup and model endpoint handling	2026-06-01 10:00:15 +09:00
pewdiepie-archdaemon	e5c99a5eee	Odysseus v1.0	2026-05-31 23:58:26 +09:00

9 Commits