odysseus

mirror of https://github.com/pewdiepie-archdaemon/odysseus.git synced 2026-06-15 17:25:26 -04:00

Author	SHA1	Message	Date
pewdiepie-archdaemon	b118c33e37	test(provider): align lookalike-host URL expectations with /models behavior build_models_url returns /models (no /v1 prefix) for non-local generic OpenAI-compatible hosts (intentional, see endpoint_resolver.py:206). The tests added in #4272 expected /v1/models, which is the local/deepseek behavior. Match production semantics.	2026-06-15 23:21:49 +09:00
Ashvin	d792b61722	test(gallery): point delete-ordering tests at the tmp image dir (#4300 ) The two delete-ordering tests did monkeypatch.chdir(tmp_path) and wrote the image under tmp_path/data/generated_images, but DATA_DIR (and therefore gallery_routes.GALLERY_IMAGE_DIR) is always an absolute path, so the delete resolver pointed at the repo's real data dir and ignored the chdir. test_file_removed_on_successful_delete therefore failed on dev (the file at the tmp path was never the one being removed), and test_file_kept_when_commit_fails passed only by accident. Set GALLERY_IMAGE_DIR to the seeded tmp dir via monkeypatch so both tests exercise the real path and pass deterministically.	2026-06-15 14:07:49 +00:00
Kenny Van de Maele	e87b44126c	test(hwfit): fix non-Apple guard to assert the Apple matcher (unblocks pytest gate) (#4303 ) * test(hwfit): assert the Apple matcher, not the general lookup, in the non-Apple guard `f7aa2de` (#2564) added test_non_apple_gpu_with_cores_does_not_match, which asserts _lookup_bandwidth(RTX 4090) is None. But '4090': 1008 has been in the general GPU_BANDWIDTH table since v1.0, so _lookup_bandwidth correctly returns the card's real bandwidth and the test fails (expected None, got 1008) - reddening the required pytest gate on dev and, by inheritance, every open PR. The guard's actual intent is that the Apple-specific bandwidth path does not false-match a non-Apple card that carries a gpu_cores count. Point the two asserts at _lookup_apple_bandwidth, which returns None for any name without 'apple' regardless of the general table. The general-lookup behavior (4090 -> 1008) is correct and untouched. * fix(hwfit): route string GPU names through the Apple bandwidth helper Second half of the #2564 regression (RaresKeY review on #4303). That change moved the Apple tiers out of the generic GPU_BANDWIDTH table into the dict-only _lookup_apple_bandwidth, but _lookup_bandwidth only called that helper for dict inputs. A bare-string caller like _lookup_bandwidth("Apple M3 Max") therefore fell through to the generic table, found no Apple key, and returned None instead of the conservative tier. Route both dict and string inputs through the Apple helper (a string carries no gpu_cores, so it gets the model's lowest tier). Regression added for the string path plus a non-Apple string control.	2026-06-15 14:01:05 +00:00
Ahmad Naalweh	f7aa2de410	fix(hwfit): distinguish Apple Silicon bandwidth variants (#2564 ) * fix: resolve Apple Silicon bandwidth variants * fix(hwfit): preserve string lookup path in _lookup_bandwidth * fix(hwfit): guard Apple bandwidth lookup against false GPU matches Add "apple" not in gn check to _lookup_apple_bandwidth() so that non-Apple GPUs with "m3"/"m4"/"m5" in their names (e.g. NVIDIA Quadro M4 000) don't incorrectly match Apple bandwidth tiers. Addresses @o3LL review comment on PR #2564.	2026-06-15 15:13:03 +02:00
Ashvin	514d345334	test(models): pin lookalike hosts to the generic OpenAI branch (#4272 ) #4159 (`4b0a977`) made build_models_url insert /v1 for path-less bases, so the TestBuildersRejectLookalikeHosts model assertions that expected /models started failing and turned the pytest gate red on dev. Both the generic OpenAI branch and the real Anthropic branch now end in /v1/models, so a URL-only assertion no longer proves a lookalike host dodged the Anthropic/Ollama branch. Assert _detect_provider == "openai" directly and keep the /v1/models expectation.	2026-06-15 12:43:33 +00:00
andrewemer	cd02ac7ef6	fix(agent): skill-prescribed tools never reach the model's schema list (#4008 ) * Agent: make skill-prescribed tools actually callable The skill index and matched-skill procedures are injected into the prompt, but tool selection never followed: manage_skills wasn't in the RAG-selected schema list (so the model substituted manage_memory), and a matched skill could prescribe tools (grep, read_file) the model had no schema for. Now: - manage_skills rides along whenever the owner has any skills indexed - a Jaccard-matched skill's requires_toolsets join the selection - viewing a skill mid-turn via manage_skills unlocks its requires_toolsets for subsequent rounds - admin-intent turns send _ADMIN_TOOLS schemas, matching the prompt text _build_base_prompt already advertises - index_for(active_toolsets=None) no longer hides requires_toolsets skills from callers that don't know the active set Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> * Agent: validate skill requires_toolsets against known tools, not TOOL_SECTIONS grep/glob/ls ship as function schemas without a prompt-prose section, so gating on TOOL_SECTIONS silently dropped them from a skill's requires_toolsets. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> --------- Co-authored-by: Claude Fable 5 <noreply@anthropic.com>	2026-06-15 20:32:43 +09:00
cirim	e7abb7559d	fix(research): keep Discuss chats grounded on their report (#4006 ) * fix(research): preserve Discuss spin-off primer during context trimming trim_for_context() kept only system_msgs[:1] as essential and dropped the rest under budget pressure. A research "Discuss" spin-off seeds the report as a system message that sits after the preface system messages, so it landed in extra_system and was the first thing evicted once the chat grew — the conversation then lost its grounding and drifted off task. Treat any system message carrying research_spinoff_from metadata as essential, alongside the leading system prompt, so the seeded report survives trimming. maybe_compact already retains all system messages. Tests: tests/test_context_compactor.py::TestResearchPrimerPreserved * fix(research): ground Discuss spin-off chats on the seeded report build_chat_context injected global memory (pinned + hybrid-retrieved) and personal-doc RAG every turn, keyed off the user-level memory_enabled pref and a request-scoped use_rag flag — never the session. A research spin-off, whose primer declares the report the sole knowledge base, thus had unrelated keyword-matched facts pulled in ("wrong data") competing with the report; its rag=False flag was also ignored (use_rag defaulted on). Add _session_is_research_spinoff(sess) (detects the primer research_spinoff_from metadata; handles ChatMessage and dict forms) and, for such sessions, disable memory injection and force RAG off. Tests: tests/test_chat_helpers.py spin-off detection cases --------- Co-authored-by: Dan (cirim) <claude@cirim.org>	2026-06-15 20:31:57 +09:00
Max Hsu	172a8ea7b0	fix(skills): keep edit mode open on outside-the-textarea click (#4011 ) Clicking the card body outside the edit <textarea> bubbled to the card's click handler and collapsed the card, silently discarding unsaved skill edits (issue #4002). The textarea's own stopPropagation only shields clicks landing on it. Bail out of the card click handler while a .skill-md-editor is present so the card only leaves edit mode via Save (Cancel button is handled separately by #3580). Mirrors the same guard into the built-in capability card, which shared the bug. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-15 20:31:11 +09:00
Josh Patra	f5d3e5098a	fix(llm): omit temperature for Kimi K2.5 and K2.6 (#3960 )	2026-06-15 20:29:22 +09:00
Josh Patra	4ee5ed4dce	fix(memory): return complete memory lists (#3885 )	2026-06-15 20:28:25 +09:00
Josh Patra	f2bfe9b91f	fix(memory): exempt audits from request timeout (#3886 )	2026-06-15 20:27:46 +09:00
Dividesbyzer0	627a52ac44	fix(cookbook): shim Windows Store python3 alias (#2610 )	2026-06-15 20:25:30 +09:00
Vishnu	933ec8fec9	fix(memory): reject ambiguous multi-object outputs during skill extraction (#3985 )	2026-06-15 10:44:43 +00:00
Merajul Arefin	8fe98cf471	feat(auth): add per-user admin promote/demote toggle (#3078 ) * feat(auth): add per-user admin promote/demote toggle Admin-only API and Users-tab control to grant/revoke admin rights; refuses to demote the last admin. * fix(auth): restore pre-admin privilege restrictions on demotion Promoting now stashes the user's privilege map (privileges_before_admin) and demoting restores it instead of resetting to defaults, so a promote/demote round trip can no longer broaden a restricted user's access. Users without a stash (created as admin, or promoted before this fix) still demote to DEFAULT_PRIVILEGES so a born-admin's stored all-True map — including can_use_bash — can't survive demotion. --------- Co-authored-by: K M Merajul Arefin <merajul.arefin@therapservices.net>	2026-06-15 10:44:27 +00:00
nubs	55b4a5e6ff	fix(ui): restore all-edge modal snap zones (#2260 )	2026-06-15 12:36:34 +02:00
nubs	e75a52efbb	fix(notes): reset search filter on panel reopen so stale query doesn't hide notes (#2920 )	2026-06-15 11:55:46 +02:00
Mazen Tamer Salah	f28703adf6	fix(gallery): remove image file only after the delete commit succeeds (#2196 ) delete_gallery_image() deleted the on-disk file before setting is_active=False and committing. If that commit failed and rolled back, the record stayed active but its file was already gone — a broken, unviewable image (data loss). Soft-delete and commit first, then remove the file best-effort, so a missing or locked file can no longer 500 a delete that already succeeded logically. Adds tests/test_gallery_delete_file_ordering.py covering the commit-failure (file kept) and success (file removed) paths.	2026-06-15 11:00:32 +02:00
Kfir Sadeh	d8e7cc7053	feat(ui): add real-time diagnostic logs console (#974 ) * feat(diagnostics): add admin-gated real-time diagnostics logs terminal UI * feat(ui): resolve diagnostics logs feedback and optimize client-side caching * feat(ui): resolve diagnostics logs feedback	2026-06-15 10:32:51 +02:00
Achilleas90	ffc0f1dccc	Harden CalDAV write-back with retries (#1193 ) Co-authored-by: Alexandre Teixeira <111787685+alteixeira20@users.noreply.github.com>	2026-06-15 15:59:31 +09:00
Syed Ali Rizvi	57646300a4	fix(security): encrypt CardDAV password at rest in settings.json (#1741 ) * fix(security): encrypt CardDAV password at rest in settings.json CardDAV password was stored in plaintext in data/settings.json, while other secrets (email, CalDAV) are encrypted using src.secret_storage. On read (_get_carddav_config): decrypt the password via decrypt(). On write (update_config): encrypt the password via encrypt() before saving to settings.json. decrypt() is a no-op on plaintext, so existing deployments upgrade transparently on the first read after the next config save. * test: add coverage for CardDAV password encryption Nine tests covering: - encrypt-on-save and decrypt-on-read round-trip - encrypted value is stored with enc: prefix (plaintext absent from file) - legacy plaintext passthrough - CARDDAV_PASSWORD env var passthrough (not decrypted) - empty password / no settings file - double-save does not corrupt - encrypt() idempotent on already-encrypted value	2026-06-15 15:58:14 +09:00
spooky	f23e2e6ffb	docs: add agent migration manifest helper (#3028 ) * docs: add agent migration manifest helper * fix: use stat+streamed hash for metadata-only archive scans When include_content is false, skip reading full file content and only stat+stream-hash for size and sha256. Avoids spurious skipped- content warnings and keeps large-export previews fast and clean. Closes review feedback on PR #3028. * fix: skip symlinked migration inputs * fix: stream archive traversal warnings * feat: stage conversation threads in agent migration manifests	2026-06-15 15:57:33 +09:00
KYDNO	955455b797	fix(kimi): resolve Kimi Code API 403 errors and User-Agent restrictions (#3549 ) * fix(kimi): resolve Kimi Code API 403 errors and User-Agent restrictions Kimi Code subscription keys require a whitelisted coding-agent User-Agent to avoid access_terminated_error 403s. This adds User-Agent probing and caching for Kimi Code endpoints. Co-authored-by: Cursor <cursoragent@cursor.com> * fix(kimi): omit temperature for kimi-for-coding API calls Kimi Code rejects any non-default temperature with HTTP 400, which broke deep research probes and low-temp LLM rounds. Co-authored-by: Cursor <cursoragent@cursor.com> --------- Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-15 15:56:54 +09:00
Karthik Rajesh	674457384a	feat(cookbook): surface Docker hardware visibility warnings (#3658 )	2026-06-15 15:51:04 +09:00
Alexandre Teixeira	2cf8bd14ae	test: add report-only order-sensitivity runner (#3982 ) * test: add report-only order-sensitivity runner * test: report cwd in order-sensitivity runner	2026-06-15 15:49:47 +09:00
Abhishek Kumbhar	a172522d87	fix(integrations): prevent blank API integrations (#3840 ) * fix(integrations): validate unified API form fields * fix(integrations): validate API integration fields server-side	2026-06-15 15:40:36 +09:00
Max Hsu	65c7321ace	fix(cookbook): recover completed downloads from DOWNLOAD_OK in background reconciler (#4000 ) The dashboard background status reconciler (_pollBackgroundStatus) only recovered "done" for dependency installs when the backend reported a finished task as "stopped". A real model download whose tmux pane is gone after DOWNLOAD_OK (so the dead-session check misses the landed snapshot) fell through to `task.type === 'download' ? 'crashed'`, so a completed download was shown as crashed (and stalled on the Serve tab). Recover "done" from the terminal DOWNLOAD_OK sentinel, mirroring the dep-install recovery already present. The background poll runs blind, so it keys off the conclusive exit-0 sentinel only — not the `/snapshots/` path, which can be printed mid-stream for multi-file downloads and would risk marking an incomplete download done. Fixes #3897 Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-15 15:36:39 +09:00
Ashvin	23837f4571	fix(cookbook): report dead finished downloads as completed instead of stopped (#4025 ) When a download's tmux pane is gone, the status endpoint trusted only the HF-cache probe to tell completed from stopped. The probe derives its cache root from its own environment, but the download runner exports HF_HOME=<local_dir> (the #2722 fix), so custom-dir downloads land in <local_dir>/hub where the probe never looks - and ollama pulls don't touch the HF cache at all. Finished downloads were reported as stopped forever, and tasks already persisted as completed were demoted back to stopped on the next poll. This is the backend half of #3897, deliberately left out of the frontend fix in #4000. - honor the conclusive runner markers first: DOWNLOAD_OK -> completed (keeping the "Fetching 0 files" error guard), DOWNLOAD_FAILED -> error - pass the task's local_dir through to the cache probes so they check the cache the download actually wrote to, keeping the env-var fallback for default-cache downloads - move the probe scripts and marker classification into routes/cookbook_output.py (dependency-free) with behavioral tests Fixes #4017	2026-06-15 15:26:55 +09:00
Dividesbyzer0	b28aa1f2c4	fix(cookbook): allow local Windows Diffusers serving (#4077 )	2026-06-15 15:21:01 +09:00
Dividesbyzer0	33c26bab88	fix(agent): parse raw json web search calls (#4088 )	2026-06-15 15:19:38 +09:00
cyq	e52d078ea1	fix(agent): detect Polish web lookup intent (#4091 )	2026-06-15 15:19:03 +09:00
nsgds	7ae6133d7f	fix(agent): don't let a materialized default budget defeat context-window scaling (#4122 ) * fix(agent): don't let a materialized default budget defeat context scaling #1230 scales agent_input_token_budget to the model's context window unless the user explicitly set a budget, detected via is_setting_overridden(). But the settings-save path materializes every DEFAULT_SETTINGS key into settings.json (load_settings merges defaults; handlers persist the merged dict), so the persisted default 6000 reads as "overridden" and the budget code takes the min(6000, ctx) branch — silently re-capping long-context models at 6000 for anyone who has ever saved a setting. This reintroduces the exact regression #1170/#1230 set out to fix. Add is_setting_customized() (saved value != default) and gate the scaling on it instead of mere presence. A persisted default is not a user choice. is_setting_overridden has exactly one consumer (this budget path), so the change is contained. Tests cover the materialized-default regression, a deliberately-chosen budget still being honoured, and the absent-key case. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(agent): rework context-budget fix per review (#4122) Address RaresKeY's review: P2 (explicitness): is_setting_customized treated a saved value equal to the default as "not explicit", which ALSO blocked a user from deliberately pinning the default budget. Reframe the default value itself as the AUTO sentinel — agent_input_token_budget == DEFAULT_BUDGET means "scale to the model's context window", any other value is an explicit cap. A materialized default still reads as auto (fixing the original regression), and any non-default value the user chooses is now honoured. Drop the now-unused is_setting_customized helper. P2 (fallback context): auto-scaling trusted get_context_length() even when it returned only the bare DEFAULT_CONTEXT fallback (no endpoint-reported / known window), over-allocating on self-hosted/proxy setups. Add get_context_length_known() (also returns whether the window was actually discovered); the budget block passes 0 when unknown so auto-scaling stays conservative instead of inflating to an unproven window. hard_max stays auto-only — a deliberate explicit budget wins (#1190); kept that contract and answered the reviewer's question rather than silently reversing it. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(agent): lock the materialized-default budget regression (review on #4121) Per WGlynn's review on the issue: add an end-to-end regression that saves an UNRELATED setting (which makes the settings-save path materialize the budget default into settings.json) and asserts the budget still auto-scales rather than re-reading as an explicit 6000 cap — locking the exact reopening shut. To make the test bite the production decision (not just re-derive it), extract `budget_is_explicit()` into src/context_budget.py and use it from the agent loop. It keys off value-vs-default (the default is the auto sentinel), NOT settings presence — which is the whole point, since the save path materializes defaults. Note: after this PR's rework, is_setting_overridden has ZERO production callers, so the merged-dict materialization smell can't reach any setting through a presence check today (WGlynn's durability concern). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(agent): bind the budget context window to its own provenance (review #4122) RaresKeY caught a correctness bug in the fallback-context guard: stream_agent_loop kept only the `known` flag from get_context_length_known() and budgeted off the passed-in `context_length`, which can come from a different lookup. Two failures: - local endpoints are re-queried, so the passed value can be a stale DEFAULT_CONTEXT fallback while the fresh probe proves the real (smaller) served context — we'd scale off the stale value; - callers that don't pass context_length (scheduled tasks, teacher escalation, skill test runs, bg_monitor) were capped at 6000 even when a long window is discoverable. Extract budget_context_for_model() which returns the freshly-probed window when known else 0, binding the flag to the value it proves; the agent loop uses it. Regression tests cover the stale-fallback, no-arg-caller, and probe-error paths. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * docs(agent): fix stale budget comments + tighten to the contract (review #4122) - settings.py: an explicit budget is clamped to the window only — hard_max is auto-only (#1190); drop the incorrect "and to hard_max". - is_setting_overridden docstring: drop the stale "adaptive budgets" example; point value-sensitive callers at context_budget.budget_is_explicit. - Tighten the budget-block comments to the contract (default = auto sentinel, non-default = explicit cap, hard_max = auto-only ceiling). Comment/docstring-only; no behaviour change. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * docs(agent): correct budget issue citations (#1190 → merged #1230/#1273) The context-budget contract (auto-sentinel, explicit budgets honoured, hard_max auto-only) merged via #1230 — #1190 was the earlier, closed, superseded PR. Re-point the contract comments at #1230 (the live source, already cited for the auto-sentinel two lines up in settings.py). The configurable hard_max setting (`agent_input_token_hard_max`) was a reviewer requirement first raised on #1190, omitted from the merged #1230, and actually added in #1273 — credit #1273 for it and correct the test comment's history (it previously implied this PR completed the requirement). Comment/docstring-only; no behaviour change. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-15 15:17:28 +09:00
Dividesbyzer0	589fcd314a	fix(image): patch realesrgan torchvision compatibility (#4110 )	2026-06-15 15:16:41 +09:00
cyq	5e0cdb6cbb	fix(mcp): share oauth redirect URI (#4087 )	2026-06-15 15:15:53 +09:00
Max Hsu	039431f5ea	fix(mcp): detect npx cache entries before probing (#4034 )	2026-06-15 15:14:48 +09:00
cyq	aac589ee49	fix(cookbook): diagnose sglang native deps (#4112 )	2026-06-15 15:14:37 +09:00
Dividesbyzer0	8cff1f87ee	fix(cookbook): stop local Windows process trees Track the inner Bash runner PID for local Windows Cookbook tasks and stop the full child process tree during cleanup.	2026-06-15 15:12:48 +09:00
Dividesbyzer0	ec4f91afdd	fix(cookbook): normalize llama-cpp-python cache types Map llama-cpp-python --type_k/--type_v cache names to integer enum values after serve-command validation while preserving native llama-server flags.	2026-06-15 15:12:18 +09:00
Dividesbyzer0	7f571c8f7e	fix(agent): keep gpt-oss on text tool mode Treat gpt-oss local OpenAI-compatible models as text/fenced-tool models unless the endpoint explicitly declares native tool support.	2026-06-15 15:11:52 +09:00
cirim	056d1fb960	fix(llm): make connect timeout configurable Use a configurable LLM_CONNECT_TIMEOUT for call and stream connect budgets instead of the previous hard-coded 3s default.	2026-06-15 15:11:38 +09:00
Muhammed Midlaj	4b0a977988	fix(models): probe /v1/models for path-less LM Studio endpoints Probe /v1/models for path-less OpenAI-compatible model endpoints and surface clearer LM Studio diagnostics with the actual probed URL.	2026-06-15 15:09:50 +09:00
Dividesbyzer0	ece6cebc03	fix(cookbook): create bin dir before llama-server link Ensure ~/bin exists before the llama.cpp accelerated build script creates the llama-server link.	2026-06-15 15:03:55 +09:00
holden093	4c41834dc7	fix(youtube): consolidate duplicate handler Make src.youtube_handler a compatibility wrapper around services.youtube.youtube_handler so transcript state, URL parsing, and timeout behavior no longer diverge.	2026-06-15 15:03:41 +09:00
holden093	96052c5e8a	fix(agent): add contacts domain to tool classifier Add a contacts domain rule pack and deterministic contact intent detection so contact prompts surface resolve_contact/manage_contact tools.	2026-06-15 15:03:19 +09:00
adabarbulescu	afc81bdd7b	fix: drop thinking deltas from background agent loops Skip thinking-only deltas when accumulating background, scheduled-task, and teacher captured reply text.	2026-06-15 15:03:09 +09:00
osmanakkawi	71ccd59b54	fix(chat): make resend message non-destructive Keep normal resend from truncating session history while preserving replace-from-here behavior for regenerate flows.	2026-06-15 15:02:48 +09:00
Ashvin	b20cea347a	fix(hwfit): serve profiles for sub-8192 context models Allow serve-profile generation for models whose trained context window is below 8192 while preserving the 8K shrink floor for larger models.	2026-06-15 15:02:22 +09:00
Dividesbyzer0	a07fe35936	fix(agent): honor explicit web search requests Promote explicit web-search phrasing to tool use and keep web_search/web_fetch available for that turn even when the stale web toggle is false.	2026-06-15 15:02:10 +09:00
RaresKeY	a7766d0b7f	fix(agent): honor auth-disabled tool access after setup Check explicit auth-disabled mode before configured-admin ownership checks so single-user mode keeps full agent tool access after setup.	2026-06-15 15:01:48 +09:00
nopoz	6824fbb729	fix(gallery): validate upstream result image URLs Validate image URLs returned by upstream diffusion/OpenAI responses before server-side fetches to prevent SSRF through result image retrieval.	2026-06-15 15:01:28 +09:00
nopoz	f14ea6d67d	fix(codex): validate stored SSH host and port Validate cookbook task remoteHost and sshPort values before building SSH shell commands in the Codex bridge.	2026-06-15 15:01:03 +09:00

1 2 3 4 5 ...

786 Commits