odysseus

mirror of https://github.com/pewdiepie-archdaemon/odysseus.git synced 2026-06-16 17:55:26 -04:00

Author	SHA1	Message	Date
tanmayraut45	0e31c38be0	Support in-place endpoint updates and recover empty-model sessions (#786 ) The "don't wipe endpoint_url/model on endpoint delete" half of #587 landed in `6a78b02` (Fix endpoint model preservation for tasks). The three remaining follow-up pieces from the original PR — flagged in the review on #786 — are: - routes/model_routes.py: toggle_model_endpoint (PATCH) now accepts api_key and base_url, so the admin UI can rotate a key or fix a typo'd URL without going through delete+recreate. base_url is normalized the same way the POST handler does (strip /models, /chat/completions, /completions, /v1/messages, then _normalize_base). Cache invalidation matches the POST/DELETE paths and the response includes base_url so the frontend can confirm what was saved. - routes/chat_routes.py: new _recover_empty_session_model picks cached_models[0] from the endpoint that matches sess.endpoint_url and persists it onto the Session row before the LLM call goes out. Wired into both /api/chat and /api/chat_stream after the existing _clear_orphaned_session_endpoint guard, so the order is: drop truly-orphaned sessions first, then heal the "picker showed it, session never knew" case. - routes/chat_routes.py: when recovery fails (no endpoint, no cached models) raise HTTP 400 with a clear message instead of letting model="" reach the upstream as 401/503. Closes #587.	2026-06-02 11:26:38 +09:00
LittleLlama	54ecfa39cf	Provider detection: match by hostname instead of substring (re #768 ) (#815 ) * Dedupe URL routing helpers and tighten adjacent hostname checks * Match providers by hostname, not substring, in _detect_provider _detect_provider used `"anthropic.com" in url`-style substring checks, so a URL that merely contained a provider's domain in its path or query — or a look-alike host like `anthropic.com.example` — was misclassified and picked the wrong auth-header/payload shape. Switch it to the existing `_host_match` helper (hostname exact/subdomain match), the same way the human-readable labels and curated model lists already work, finishing that migration. Also harden `_host_match` against trailing-dot FQDNs. Not a credential-leak fix: _detect_provider only classifies a URL the admin already configured next to its key, and the URL — not this function — decides where the request goes. This is a correctness/consistency cleanup. Adds tests that import the real helpers (test_endpoint_resolver.py tests local copies, so it can't catch this) covering the substring false-positives. Refs #768. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * Import build_headers under its real name in model_routes It was imported as `build_headers as _provider_headers`, which collides with the unrelated llm_core._provider_headers(provider, headers) — same name, different signature. Use the real name to remove the confusion. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * Use hostname matching in URL builders, not raw suffix checks PR review flagged that _detect_provider() was hardened to match on hostname, but several helpers still used raw host.endswith("anthropic.com") / host.endswith("ollama.com"), which match adjacent hosts like notanthropic.com / notollama.com. Route the remaining checks through _host_match(): _is_ollama_native_url and _ollama_api_root in llm_core, and _anthropic_api_root / _ollama_api_root in endpoint_resolver. With _detect_provider already hostname-correct, the trailing "or host.endswith(...)" clauses in build_chat_url / build_models_url are redundant, so drop them rather than fix the substring match in place. Add builder-level tests asserting look-alike and domain-in-path hosts route to the OpenAI-compatible default. They import the real builders and fail on the pre-fix code. Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 11:11:17 +09:00
wundervrc	3f6d630b56	Never resolve to a disabled endpoint model (#861 ) Background tasks (e.g. the Email Tags / check_email_urgency action) resolve their model through resolve_endpoint("utility") → Default Chat. When the configured model is one the user has since disabled on the endpoint, the resolver still dispatched to it — on Groq that surfaces as every email failing with "HTTP 400: model ... requires terms acceptance". Two paths fed this: - The auto-pick fallback selected from cached_models without excluding the endpoint's hidden_models, so a disabled model listed first won. - A stale default_model left pointing at a now-disabled model (seeded at endpoint registration from raw model_ids[0]) was used verbatim. Fix resolve_endpoint / resolve_endpoint_by_id to drop a configured model that's in hidden_models and to pick the first ENABLED chat model. Also seed default_model on registration via _first_chat_model so we never pin the global default to an embedding/tts entry a provider lists first. Checks: python -m pytest tests/test_endpoint_resolver.py tests/test_model_routes.py tests/test_model_context.py (all pass); python -m py_compile app.py routes/model_routes.py src/endpoint_resolver.py. Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 11:10:43 +09:00
pewdiepie-archdaemon	6a78b02976	Fix endpoint model preservation for tasks	2026-06-02 09:44:24 +09:00
Alexandre Teixeira	26483661da	Restrict provider discovery to admins Require admin access before serving provider discovery data from GET /api/providers. This prevents normal authenticated users from triggering provider discovery or receiving cached provider host data. Keep GET /api/models available to normal users and leave the existing admin-only GET /api/discover behavior unchanged. Add a focused regression test to ensure unauthorized callers cannot trigger discovery and cannot receive cached provider data.	2026-06-02 05:54:40 +09:00
Prakhya	a96593a99b	Improve Ollama endpoint error messages	2026-06-02 05:53:50 +09:00
Abhinav	9e8de43f25	fix: clear session headers on endpoint deletion (#477 )	2026-06-01 22:19:54 +09:00
Alexander Kenley	2c4b8b57dd	feat(ai): add OpenRouter and Ollama Cloud providers (#231 ) Co-authored-by: Alex Kenley <Alex.Kenley@threatvectorsecurity.com>	2026-06-01 14:26:10 +09:00
Sirsyorrz	09acf955f1	models: dedupe endpoints by base_url on create (#266 ) POST /api/model-endpoints always inserted a new row, so Settings -> Add Models -> Scan for Servers re-added any endpoint a user had already registered manually — once under its model name (from the earlier manual add) and again under its host:port (auto-generated when scan posts without a name). The success toast then misreported the result as "added N new". Look up an existing endpoint with the same base_url accessible to the caller (shared or owned by them) before inserting. If found, return it with `existing: true` so the client can tell the difference between an actual add and a dedupe hit. Toast now reads, e.g., "Found 1 server with 1 model — 1 already added". Tested: POSTing the same base_url three times (incl. trailing-slash variation) returns the same id each time; only one row exists.	2026-06-01 14:22:06 +09:00
pewdiepie-archdaemon	fc7f107b22	Improve Ollama setup and model endpoint handling	2026-06-01 10:00:15 +09:00
pewdiepie-archdaemon	e5c99a5eee	Odysseus v1.0	2026-05-31 23:58:26 +09:00

11 Commits