odysseus

mirror of https://github.com/pewdiepie-archdaemon/odysseus.git synced 2026-06-16 01:35:36 -04:00

Author	SHA1	Message	Date
pewdiepie-archdaemon	1ee51be420	Sessions: Esc + outside-click also close the Move-to-folder submenu The session-dropdown Esc handler only closed .session-dropdown-menu, leaving the .session-folder-submenu (Move to folder → folder list) orphaned on screen. Same gap on the click-away path. Extend both selectors to cover the submenu so a single Esc / outside-click dismisses the whole stack.	2026-06-10 22:39:23 +09:00
pewdiepie-archdaemon	94931ba59f	Chats sidebar: 'manage' label sits in flex flow so its area is clickable Email's 'new' label is absolutely positioned to the LEFT of the '+' icon, which works there because the '+' is the visible/clickable anchor. The chats manage button has no visible glyph at rest, so the label was rendered outside the button's bounding box — hovering 'manage' lost the :hover state and clicking it missed. Override list-item-plus-label inside chats-manage-btn: position: static (in flex flow) + max-width:0 / max-width:80px expand-on-hover so the button's clickable rect grows alongside the text. Hover stays sticky; click hits.	2026-06-10 22:39:05 +09:00
pewdiepie-archdaemon	49ecd806a2	Chats sidebar: 'manage' slides in from the side like email's 'new' The list-item-plus-label slide-in needs a visible anchor element so the button takes up consistent width and the absolutely-positioned label can fly in to the left of it. Email uses the '+' SVG as that anchor; here we use an empty 13x13 spacer span instead — same footprint, no glyph. Result: empty button at rest (still visible per the chats-manage-btn fade rules), 'manage' slides in from the left on direct hover.	2026-06-10 22:37:06 +09:00
pewdiepie-archdaemon	1eaa5c2a81	Sessions sort: nudge auto-sort icon + 'Group' text 4px left (10→6 left padding)	2026-06-10 22:35:21 +09:00
pewdiepie-archdaemon	e107c5876e	Chats sidebar: drop library SVG from manage button — text-only 'manage' Removed the book/library SVG and list-item-plus-btn/-label classes. The button is now a plain text button styled like email's 'new' label (9.5px, 0.02em letter-spacing), reusing the existing chats-manage-btn opacity hover-reveal rules so it still fades until you hover the section.	2026-06-10 22:35:06 +09:00
pewdiepie-archdaemon	4f7061fd61	Settings overhaul + UI polish pass Two months of iteration on the Settings panel, integration forms, and small visual nudges across the app. Highlights: Settings restructure - Add Models: split into separate Local + API cards (no more in-card tabs); each fuses Type/Provider with the URL input. - Added Models: new dedicated sidebar tab, with Probe + Clear-offline pulled into its header; Local/API sub-section icons accent-tinted. - Search: Web Search and a new Deep Research card (Model + tuning), with a cross-link to AI Defaults. Provider hints use real clickable anchors; Web Search Test button shows a whirlpool spinner. - AI Defaults: Image Generation card returns; Research Model card carries only Endpoint+Model with a cross-link to Search; Vision / Default / Utility fallbacks unified under one numbered-row design matching Search's chain. - API Permissions (was 'API Tokens'): per-row rename, inline Permissions toggle that expands the scope-edit panel, in-field copy icons (icon→check on success). Empty state accent-tinted. - Integrations: + Add Integration drops a type-picker menu directly under the button (drop-up on tight viewports); each integration form (API, CalDAV, CardDAV, Email, Codex/Claude, Vault, MCP) uses the same accent-outlined Save/Test/Cancel buttons right-aligned. - Danger Zone: Wipe→Delete with trash icons; new 'Delete everything' row at the bottom that loops every category. AI Synthesis (Reminders) - Persona dropdown sourced from PROMPT_TEMPLATES + custom preset. - src/reminder_personas.py mirrors the five built-ins for the server-side synthesis path. - dispatch_reminder() reads reminder_llm_persona and uses the persona's system prompt; empty/unknown falls back to warm-neutral. Esc handling - Kebab menus and the provider picker intercept Esc in capture phase so dismissing a popup no longer closes the whole Settings modal. Accent tinting - Scoped CSS rule across data-settings-panel=ai/services/added-models/ search/integrations/reminders for card h2 icons + the Added Models sub-section icons. Codex/Claude integration form - No more auto-creation on form open — explicit Create token button. - New tokens start with every scope granted; existing tokens move out of the integration form into the API Permissions card. - Setup reveal: copy buttons inline inside the token + setup code blocks; shorter subtitle wording. Misc visual polish - Save/Test/Cancel uniformly accent-outlined and right-aligned on every integration form. - Provider logos render inline next to the search fallback selects and the Deep Research Search dropdown. - Trash icons in fallback rows bumped to 20x20 so they fill the 32px button. - Image generation default flipped to off.	2026-06-10 15:15:13 +09:00
pewdiepie-archdaemon	7690860ab1	Settings/Add Models: bump Local Type select width 57→62px	2026-06-09 15:12:57 +09:00
pewdiepie-archdaemon	b6366e9da5	Settings/Add Models: fuse Local Type select + URL input into one bordered group	2026-06-09 15:12:12 +09:00
pewdiepie-archdaemon	64122269e9	Settings/Add Models: shrink Local Type select by 15px (72→57)	2026-06-09 15:11:07 +09:00
pewdiepie-archdaemon	1bdd515941	Settings/Add Models: drop 'Type:' label, keep the LLM/Image select	2026-06-09 15:10:48 +09:00
pewdiepie-archdaemon	8ac0ae72dc	Settings/Add Models: Local card — Type and Add inline with URL field Lift the LLM/Image Type select to the left of the URL input and the Add button to its right, so the primary action (URL + Add) sits on one row. Scan / Ollama / Key / Test stay on the action row below.	2026-06-09 15:09:28 +09:00
pewdiepie-archdaemon	b2458f9891	Settings/Add Models: split Local and API into separate cards, always show API key Drop the in-card Local/API tab strip — each is now its own admin card with a normal h2 heading (Local on top, API below). The API key input is always visible (no more click-to-reveal toggle), matching how cloud providers actually work. Local keeps the optional key reveal since local servers usually don't need one. Dead code removed: wireModelsTabs IIFE and the adm-epApiKeyBtn toggle wire.	2026-06-09 14:57:42 +09:00
pewdiepie-archdaemon	2252776a97	Settings: promote Added Models to its own sidebar menu Move the Added Models endpoint lists out of the Add Models card into a dedicated sidebar tab between Add Models and AI Defaults. The card now focuses purely on adding (Local / API tabs), while the new panel owns the existing endpoints + Probe and Clear-offline controls. admin.js: defensive fallback so a stale 'added' value in localStorage falls back to 'local' instead of leaving both panes hidden.	2026-06-09 14:52:48 +09:00
pewdiepie-archdaemon	c9fecd53dc	Settings: third 'Added Models' tab in Add Models card Move the Added Local + Added API lists out of the per-type tabs into a dedicated third tab. Each Add tab is now just the form; the new tab collects both lists together with Local / API subheadings. Card layout: Add Models [Probe] [Clear offline] [Local] [API] [Added Models] Tab content: Local → Add Local form API → Add API form Added Models → Local list + API list (subheadings) All endpoint list/form IDs preserved. Tab switcher JS is generic so the new 'added' tab works without code changes.	2026-06-09 14:47:21 +09:00
pewdiepie-archdaemon	75268e7f43	Fix Cookbook serve server selection	2026-06-09 14:45:22 +09:00
pewdiepie-archdaemon	8ef9b8b215	Settings: tabbed Add Models card with Local / API tabs Earlier split into 4 flat cards wasn't what was asked for. Restore to a single 'Add Models' card with two tabs at the top: Local → Add form + Added Local Models list API → Add form + Added API Endpoints list Probe / Clear-offline live on the card header and act on both lists. Active tab is remembered in localStorage so the user lands back where they were. All form/list IDs preserved (adm-epLocalUrl, adm-epList-local, adm-epList-api, etc.) so admin.js continues to work unchanged. Replaces the .adm-section-toggle fold-open JS with a tab-switcher; the fold elements no longer exist so the old handler was already a no-op.	2026-06-09 14:43:28 +09:00
pewdiepie-archdaemon	459b825daa	Settings: split Add/Added Models into 4 flat cards (no folds) The previous 'Add Models' card had two collapsible folds (Local + API) inside it and 'Added Models' had two inline subsections. Both folded states added a click-to-expand step that wasn't earning its keep — users coming to Settings to add a model don't want a fold, they want the form. Reshape: four flat admin-cards in the Services panel, each with its own h2 title matching the rest of Settings: Add Local Model (was Add Models → Local fold) Add API (was Add Models → API fold) Added Local Models (was Added Models → Local subsection) Added API Endpoints (was Added Models → API subsection) The collapsible JS hook in admin.js already guards on 'if (!head) return' so removing the .adm-section-toggle headers turns it into a clean no-op — no breakage. All input/list IDs preserved (adm-epLocalUrl, adm-epList-local, adm-epList-api, etc.) so the rest of admin.js continues to work unchanged. Probe / Clear-offline live on the Local card and act on both lists together (existing behavior).	2026-06-09 14:36:44 +09:00
pewdiepie-archdaemon	3247773447	Hide Teacher Model settings card (2.0 'harden the core' deferral) The Teacher Mode feature stays out of the default UI per the 2.0 roadmap — backend escalation is already dormant when teacher_model is unset (its default) and we want to focus on core reliability before surfacing escalation as a feature. Nothing removed from the backend: - src/teacher_escalation.py still gates on get_setting('teacher_model') - agent_loop.py's run_teacher_inline hook is a no-op without the setting - settings backup/restore round-trips the teacher_model key unchanged - power users can still set it via manage_settings or the JSON backup settings.js's initTeacherModel already early-returns when the card's DOM ids are missing, so the JS side is clean. To re-surface the card, revert this commit.	2026-06-09 14:31:04 +09:00
pewdiepie-archdaemon	013beab861	Add Codex and Claude document draft integration	2026-06-09 14:27:53 +09:00
pewdiepie-archdaemon	c5230e85a9	Change project license to AGPL-3.0-or-later	2026-06-09 14:25:04 +09:00
pewdiepie-archdaemon	d397b3db2f	Restore dropped regression fixes	2026-06-09 10:31:43 +09:00
pewdiepie-archdaemon	1a529d63d9	Fix remaining CI regressions	2026-06-09 10:21:56 +09:00
pewdiepie-archdaemon	37c573d865	Fix model endpoint route test regressions	2026-06-09 10:16:38 +09:00
pewdiepie-archdaemon	6f29b287f6	Remove stale plan slash toggle	2026-06-09 09:54:46 +09:00
pewdiepie-archdaemon	4715a5505d	Fix duplicate cookbook server helper export	2026-06-09 09:53:41 +09:00
pewdiepie-archdaemon	84ca74f04b	Restore cookbook server key exports	2026-06-09 09:51:53 +09:00
pewdiepie-archdaemon	e6b1009b89	Remove non-merge-ready workspace and terminal agent hooks	2026-06-09 09:48:59 +09:00
pewdiepie-archdaemon	fa8c93ec0a	Cookbook UI: Ollama browser, advanced serve fold, API tokens form, diagnosis toolbar, polish Surface a lot of accumulated cookbook + UI work as a single non-agent commit so the agent rework lands cleanly. Highlights: - Ollama as a first-class backend in the Cookbook: * Download input accepts ollama-style names (name:tag) → backend=ollama * /api/cookbook/ollama/library (cached scrape of ollama.com + curated fallback so classic models like qwen2.5 stay reachable) * "Browse Ollama library" toggle below Download with size chips * Engine=Ollama in hwfit toolbar merges the Ollama library into the main scan list as per-tag rows with the same Fit/Param/Quant/VRAM columns; click → fills Download input - API Tokens form added to Integrations panel (matching wired loadTokens()/initTokenForm() that had no HTML) - Serve panel polish: Advanced fold tightening (-8px nudges on vLLM checks, Extra args, Spec row), n_cpu_moe + Split Mode controls pulled up 8px to align with the row's checkboxes, GGUF File dropdown exposed for Ollama backend, GPU re-render on Edit serve restore, _forceBackend flag so saved serveState wins over backend detection, cookbook:servers-changed CustomEvent so panels don't need refresh - Models page redesign: Add Models row (URL + hidden API key reveal + Type select + Scan/Ollama/Key/Test/Add icon buttons), Probe All + Clear-offline buttons in Added Models toolbar, offline-pill removed (opacity already conveys state), Engine dropdown gains Ollama option - _ping_endpoint probes /v1/models then base, accepts 4xx as reachable (vLLM returns 404 on bare /v1, fully working endpoints were showing offline) - Diagnosis card: × dismiss + Copy bundle buttons restored on the serve error feedback card - Orphan tmux sweep re-enabled behind a 60s rate-limit + background Thread (off the main event loop) so dead serves get discovered - cookbook_routes auto-register watchdog: drops the endpoint if the serve session exits non-zero within the first ~3min - ollama-rocm sidecar awareness in download wrapper (`docker exec ollama-rocm ollama pull` when host ollama isn't installed) - Skill extractor sets initial_status="published" when auto_approve_skills pref is on (audit demotes later) - Skill list / model list / cookbook scan misc polish	2026-06-09 09:46:19 +09:00
pewdiepie-archdaemon	646f8bd2a9	Remove remaining plan mode frontend code	2026-06-09 09:44:22 +09:00
pewdiepie-archdaemon	2a2a93d845	Remove plan mode from merge-ready UI	2026-06-09 09:40:20 +09:00
pewdiepie-archdaemon	06a04efc59	Merge branch 'dev' # Conflicts: # routes/task_routes.py # src/caldav_sync.py	2026-06-09 09:36:01 +09:00
pewdiepie-archdaemon	3b01760e95	Prepare tested main sync cleanup	2026-06-09 09:34:42 +09:00
Ocean Bennett	db1bbfe588	fix(sessions): keep fresh chats during auto tidy (#1871 ) Co-authored-by: Alexandre Teixeira <111787685+alteixeira20@users.noreply.github.com>	2026-06-09 01:06:20 +01:00
Kenny Van de Maele	2404b00f18	refactor(uploads): centralize upload byte-limits in upload_limits.py (#3364 ) (#3518 ) Move every per-route upload byte-limit into src/upload_limits.py as a validated, env-overridable constant via read_byte_limit_env: - Add GALLERY_UPLOAD_MAX_BYTES, GALLERY_TRANSFORM_UPLOAD_MAX_BYTES, MEMORY_IMPORT_MAX_BYTES, PERSONAL_UPLOAD_MAX_BYTES, EMAIL_COMPOSE_UPLOAD_MAX_BYTES, STT_MAX_AUDIO_BYTES, ICS_MAX_BYTES. - Routes import their constant instead of defining it locally: replaces 4 raw int(os.getenv(...)) and removes 3 hardcoded literals. - The 3 previously-hardcoded limits (email compose, STT audio, calendar ICS) are now env-overridable with the same ODYSSEUS_*_MAX_BYTES naming. - Defaults unchanged, so behavior is unchanged unless an env var is set; an invalid value now fails fast with a clear message instead of a bare int() ValueError. - Document all env vars in .env.example and the README. Fixes #3364	2026-06-09 01:24:30 +02:00
Alexandre Teixeira	a240f28af9	test(taxonomy): auto-mark tests by area and sub-area (#3491 )	2026-06-09 01:13:28 +02:00
Ocean Bennett	e7c1d75884	fix(models): query v1 models for llama-server endpoints (#3380 ) * fix(models): query v1 models for llama-server endpoints * test(models): accept owner kwargs in llama-server regression	2026-06-09 01:09:02 +02:00
Mateus Oliveira	f7ae85590b	refactor(tools): consolidate duplicated _truncate and get_mcp_manager into src/tool_utils (#3478 ) * refactor(tools): consolidate duplicated _truncate and get_mcp_manager into src/tool_utils Move all copies of _truncate(), get_mcp_manager(), and set_mcp_manager() into a single leaf module (src/tool_utils.py) that imports only from src.constants. This eliminates the lazy-import hack ('from src import agent_tools' inside function bodies) in tool_execution.py and tool_implementations.py, and fixes a latent bug: the _truncate copy in tool_execution.py was missing the isinstance guard and would crash on None. Also deletes mcp_servers/_common.py — it was dead code with zero callers anywhere in the codebase, containing its own copy of truncate() and constants that already exist in src/constants.py. * fix(tools): route remaining get_mcp_manager imports to src.tool_utils The maintainer's feedback flagged src/task_scheduler.py:1857 and routes/task_routes.py:977. A project-wide search found a third call site in src/agent_loop.py that also imported get_mcp_manager from src.agent_tools instead of src.tool_utils. All three are now sourced from the canonical location in src.tool_utils. --------- Co-authored-by: mcnoliveira <mcnoliveira@gmail.com>	2026-06-09 01:05:30 +02:00
Ocean Bennett	62ffcb6236	fix(cookbook): preserve same-host ssh profile selection (#3373 ) * fix(cookbook): preserve same-host ssh profile selection * fix(cookbook): resolve same-host ssh profiles in running tab and port lookups	2026-06-09 00:36:10 +02:00
Wes Huber	85c6056c87	test(models): add regression coverage for Z.AI coding endpoint probing (#2244 ) Add focused tests for the z.ai/api/coding path override: - _match_provider_curated: 5 tests verifying coding vs base key - _probe_endpoint: 3 tests verifying model preservation, curated append on partial response, and base-zai exclusion Rebased onto dev per reviewer request. Fixes #2230 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Alexandre Teixeira <111787685+alteixeira20@users.noreply.github.com>	2026-06-08 23:07:29 +01:00
Rohith Matam	049833e309	fix: skip malformed document tool call items (#3494 )	2026-06-08 23:25:31 +02:00
Cookiejunky	4e497f4878	fix(cookbook): guard break-system-packages pip flag (#3510 )	2026-06-08 23:10:20 +02:00
Lucas Daniel	5462030cde	fix(auth): per-user allowed-models checklist ignores cache, [None] doesn't block (#3355 ) Three issues combined to make the per-user 'Allowed models' checklist unreliable (#3032): 1. admin.js _loadModelsForUser fetched /api/models, which is backed by cached_models — endpoints that haven't been probed yet (e.g. a freshly-added DeepSeek API endpoint) simply didn't show up in the checklist. Switched to /api/model-endpoints, which always reflects every configured endpoint regardless of cache state. 2. _saveModels sent allowed_models: [] both when the admin clicked [All] (no restriction) and [None] (block everything) — the backend had no way to distinguish the two. 3. _enforce_chat_privileges treated an empty allowed_models list as 'no restriction' (falsy -> skip the check), so [None] had no effect. Added an explicit block_all_models privilege flag (defaulting to False, and forced to False for admins) that admin.js now sets when zero models are checked. _enforce_chat_privileges checks it first and 403s regardless of allowed_models contents.	2026-06-08 22:52:39 +02:00
Lucas Daniel	0a324f20d2	fix(agent): stop treating illustrative Markdown fences as tool calls for native function-calling models (#3356 ) * fix(agent): stop executing illustrative Markdown fences as tool calls for native function-calling models _resolve_tool_blocks fell back to the textual parse_tool_blocks() fenced-block parser whenever a model produced no native tool_calls, regardless of whether that model has a reliable native function-calling channel. Native models (GPT/Claude/Grok/Qwen3/DeepSeek-V, etc. - _is_api_model true) commonly write illustrative ```bash/```python/```json examples in guide-only prose; the fallback parser matched these and executed them as real commands, sometimes looping for several rounds as the model tried to clarify with more examples (#3222). Restrict the textual fenced-block fallback to non-native models, which rely on it as their only tool-invocation channel. Native models are trusted to use their structured tool_calls channel for real invocations; when they don't emit one, a bare fence in their response is prose, not an action. The native tool_calls path itself is untouched. This sits one layer below #3088's guide-only policy enforcement: that PR blocks tool exposure/execution on explicit no-tools requests, while this fixes the parser so ordinary illustrative fences are never misread as calls in the first place, on any turn. * fix(agent): gate only the fenced-example pattern for native models, preserve DSML/invoke recovery and persistence _resolve_tool_blocks previously short-circuited the entire textual parser (tool_blocks = [] if is_api_model else parse_tool_blocks(...)) for native function-calling models with no native tool_calls. That also dropped Patterns 2-5 (explicit [TOOL_CALL]/<invoke>/<tool_code>/DSML markup leaked into content as text), which are real calls a model couldn't emit on its structured channel (e.g. DeepSeek-V falling back to DSML), not illustrative examples. parse_tool_blocks/strip_tool_blocks now take a skip_fenced flag that gates ONLY Pattern 1 (the fenced ```bash/```python/```json block matcher). _resolve_tool_blocks passes skip_fenced=is_api_model so fenced examples stop being executed for native models while [TOOL_CALL]/<invoke>/<tool_code>/DSML stay fully active and recoverable. cleaned_round mirrors the same gate when persisting round text, so an illustrative fence that wasn't executed isn't stripped from saved/reloaded history either (it was streaming once and then disappearing on reload).	2026-06-08 22:25:28 +02:00
Mazen Tamer Salah	8e494cc1c4	fix(chat): keep balanced trailing ')' when extracting URLs (#3406 ) extract_urls() stripped any trailing ')' unconditionally via `re.sub(r'[.,;:!?\)]+$', '', url)`. That corrupts URLs that legitimately end in a parenthesis — most commonly Wikipedia disambiguation links like https://en.wikipedia.org/wiki/Python_(programming_language), which became ...Python_(programming_language and then 404 when fetched by the web/research tools. Strip trailing sentence punctuation as before, but only drop a ')' when it is unbalanced (more ')' than '('), so a prose-glued "(see https://example.com)" still loses its closing paren while balanced URLs keep theirs. Added tests/test_extract_urls.py covering balanced, unbalanced, nested, and trailing-punctuation cases.	2026-06-08 21:33:29 +02:00
nubs	932b7f2446	fix(email): close IMAP socket when connect/login fails (#3174 ) (#3363 ) * fix(email): close IMAP socket when connect/login fails (#3174) _imap_connect opened a live socket via _open_imap_connection and then called conn.login() with no try/finally, and _open_imap_connection called conn.starttls() unguarded. When auth fails (e.g. an Office 365 app password on an MFA-enabled tenant, #3174) or STARTTLS is rejected, the already-open socket was orphaned. Every IMAP caller funnels through _imap_connect, including the 30-minute _auto_summarize_poller, so a persistently misconfigured account leaked one descriptor per pass toward FD exhaustion. The previously merged leak fixes (#1325/#1330/#1423/#1530) only guard the post-connect body and monkeypatch _imap_connect to succeed, so this connect-time path was uncovered. Wrap login() and starttls() so a failure calls conn.shutdown() (low-level close; logout() can't run pre-auth) before re-raising. Adds two regression tests that fail without the guard. * fix(email): guard MCP IMAP+SMTP connect-time leaks too (#3174) Folds in the sibling connect-time leaks vdmkenny flagged on #3363, so the whole connect-then-step leak class is closed in one place: - mcp_servers/email_server.py::_imap_connect — guard starttls() and login(); close pre-auth with conn.shutdown() before re-raising. - mcp_servers/email_server.py::_smtp_connect — guard starttls() and login(); SMTP has no shutdown(), so close with conn.close() (socket close, no QUIT). Routes SMTP (_send_smtp_message) is already safe via 'with smtplib.SMTP(...)'. Adds four regression tests (one per guard), verified to fail without the fix.	2026-06-08 21:21:41 +02:00
Alex Little	a58f526992	fix(presets): scope expand-prompt model resolution to owner (#3477 ) * fix(presets): scope expand-prompt model resolution to owner /api/presets/expand resolved its model endpoint with no owner, so in a multi-user setup it could match another user's endpoint and use its URL and decrypted api_key. Pass effective_user(request) to _resolve_model so resolution is owner-scoped. Adds a regression test. * fix(presets): scope teacher and audit model resolution to owner Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Alex Little <alexwilliamlittle@gmail.com> Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com> Co-authored-by: Kenny Van de Maele <kenny@kvandemaele.be>	2026-06-08 21:12:02 +02:00
nopoz	ed6cc88974	ci: harden existing workflows for the security gate (#3498 ) Pin actions to commit SHAs, set persist-credentials: false on every checkout, and scope token permissions to the jobs that use them. Suppress the two findings that are safe by design: the description bot's pull_request_target trigger (no fork code runs) and an intentional word-split in the docker manifest step. Clears actionlint and zizmor against dev so the blocking gate from #1314 can pass once both land.	2026-06-08 20:58:59 +02:00
Mazen Tamer Salah	5198516979	fix(sessions): copy message metadata when forking a session (#3409 ) fork_session passed each source message's metadata dict by reference into the new session. add_message() -> _persist_message() stamps _db_id (and timestamp) onto that dict in place, so persisting the fork overwrote the SOURCE messages' _db_id with the forked rows' ids — silently breaking edit/delete-by-id on the original conversation. Copy the metadata dict per message so the fork and source no longer alias. Adds tests/test_fork_session_metadata.py asserting the source session's message metadata is unchanged after a fork.	2026-06-08 20:49:15 +02:00
Giuseppe Castelluccio	095c74b985	fix(security): fail closed in /api/models auth gate on unexpected errors (#3489 ) GET /api/models swallowed any non-HTTPException raised while checking whether the caller is authenticated (bare except Exception: pass), so a broken auth_manager or an exception from get_current_user silently granted the full model list to an anonymous caller instead of rejecting the request. Now any unexpected exception logs and returns HTTP 500. Split out of #2360 per reviewer request to keep the deny-list and the auth-gate fix as separate, single-purpose PRs. Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-08 20:23:39 +02:00
CorVous	34a3f8637a	fix(memory): make auto-memory extraction reliable for reasoning models (#3190 ) * fix(memory): auto-memory extracted nothing — flatten window so the prompt ends on a user turn extract_and_store appended the recent window as raw alternating role messages after the system prompt. Since the window is the last N messages, the prompt usually ENDED on an assistant turn — and a chat model given a prompt ending on an assistant turn returns an empty completion (nothing to answer). The result was facts=[] → "Auto memory extraction ran: 0 candidates" on every run, so no memories were ever stored, while skill extraction (which flattens the transcript into a single user message) worked fine. Flatten the window into one user message ending with an explicit instruction, mirroring the skill extractor, so the model always responds. Also harden parsing for reasoning models, matching the audit path which already does this: - raise max_tokens 500 → 4096 (a reasoning model spends the budget on <think> before emitting JSON; 500 truncated it before any JSON appeared); - strip <think>/prose preambles via strip_think and slice the embedded JSON array before json.loads, instead of bombing on char 0. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * chore: tighten memory-extraction-empty-completion — clarify JSON-slice comment re prior strip steps Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * docs(memory): reframe the comment to the accurate root cause (raw-chat framing) The earlier comment leaned on "ends on an assistant turn -> empty completion", which is only one failure mode. The dominant cause, confirmed by a controlled repro (0/6 old vs 6/6 new on this model), is that passing the window as raw chat messages makes the model treat it as a conversation to continue rather than a transcript to analyze, so it returns [] even when durable facts are present. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> * test(memory): cover extraction JSON parsing + slice trailing commentary unconditionally Factor the strip/fence/slice/json.loads logic out of extract_and_store into a pure module-level helper _parse_extraction_json(raw) -> list and drop the 'text[0] != "["' guard so the array is sliced whenever both brackets exist (fixes trailing commentary like '[...] Done!' reaching json.loads). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-08 19:57:44 +02:00

1 2 3 4 5 ...

1077 Commits