mirror of https://github.com/pewdiepie-archdaemon/odysseus.git synced 2026-06-16 01:35:36 -04:00

Files

T

Hriday Ranka 270b8570fc feat(email): add Google OAuth2 for Google Workspace / .edu IMAP & SMTP (#237 )

* feat(email): add Google OAuth2 for Google Workspace / .edu IMAP & SMTP

Google deprecated basic-auth (password) access for Google Workspace
accounts in May 2025. This means any .edu or org Google email account
could no longer connect via IMAP/SMTP with a username + password —
the email feature was silently broken for a large class of users.

This PR adds full OAuth2 (XOAUTH2) support for Google accounts so
Workspace / .edu emails work out of the box.

## What changed

### Backend
- `core/database.py`: add `oauth_provider`, `oauth_access_token`,
  `oauth_refresh_token`, `oauth_token_expiry`, and `display_name`
  columns to `EmailAccount` + idempotent migration
- `routes/email_helpers.py`: XOAUTH2 auth in `_imap_connect()` and
  `_send_smtp_message()`, automatic token refresh, OAuth fields in
  `_get_email_config()`
- `routes/email_routes.py`: OAuth authorize + callback routes,
  `_smtp_ready()` fix, OAuth fields through `_deliver()` closure,
  `display_name` in `From:` header

### Frontend
- `static/js/settings.js`: "Google Workspace / .edu" provider preset,
  "Connect with Google" button, success/error banner, display name field
- `static/js/document.js`: `_accountCanSend()` recognises OAuth accounts
  as SMTP-capable

* security: sign OAuth state, scope callback by owner, fix quotes & logs

Addresses reviewer feedback on the email OAuth2 PR:

- OAuth state is now HMAC-SHA256 signed (keyed with the app secret from
  secret_storage) encoding account_id + owner + a random nonce, and is
  verified with constant-time comparison in the callback before any
  token write. Replaces the bare account_id state, closing the CSRF /
  state-guessing gap.
- Callback extracts the owner from the verified state and re-checks it
  against EmailAccount.owner before writing tokens, matching the
  ownership guards used elsewhere in the email routes. Single-user mode
  (owner == "") still accepts any account, consistent with
  _assert_owns_account.
- Replaced curly/smart quotes in the Name/Email/Display Name input rows
  with plain ASCII so getElementById lookups and event wiring work.
- Stripped account name, SMTP host/user, owner, and raw provider error
  text from send-config and OAuth logs; failures now surface as generic
  error codes in the redirect instead of raw exception strings.

* test(email): add OAuth2 state, _smtp_ready, and XOAUTH2 tests

Move the OAuth state sign/verify helpers out of the setup_email_routes
closure into module-level make_oauth_state/verify_oauth_state in
email_helpers.py so they can be unit-tested, then add tests/test_email_oauth.py:

- signed state round-trips account_id + owner, nonce is unique per call
- tampered account_id, forged signature, and garbage states are rejected
- _smtp_ready treats an OAuth account (no password) as send-capable, and
  still rejects host+user-only accounts with neither password nor OAuth
- _xoauth2_string / _xoauth2_bytes produce the correct SASL XOAUTH2 framing

14 new tests; existing test_security_regressions.py still passes (28).

* refactor(email): single XOAUTH2 frame helper, use RuntimeError

Polish from self-review before merge:

- Collapse the XOAUTH2 framing to one source of truth: _xoauth2_raw()
  returns the unencoded SASL string used by both the SMTP and IMAP auth
  callbacks (each library base64-encodes it), and _xoauth2_bytes() is
  just its .encode(). Removes the unused base64 _xoauth2_string helper
  and the duplicated inline frame in _send_smtp_message.
- Raise RuntimeError (not bare Exception) for the "OAuth token
  unavailable" path, matching the convention used across src/.
- Update tests accordingly.

All 14 OAuth tests + 28 security regressions pass; SMTP/IMAP XOAUTH2
verified live against a real Workspace account.

* tests(email-oauth): cover the security-sensitive OAuth paths before merge

The previous tests only exercised pure helpers (state signing, _smtp_ready,
XOAUTH2 framing). This adds coverage for the actual token-custody and
ownership behaviour, pinning the real route handlers rather than
re-implementations of their logic.

Real OAuth callback route (pulled live from setup_email_routes()):
- missing code -> generic missing_code redirect, no account id / owner in URL
- provider error -> generic google_error redirect, raw error not echoed
- tampered/invalid state -> invalid_state redirect, auth code never leaked
- signed state with owner mismatch -> token write refused (ownership_error),
  DB row left untouched
- signed state with matching owner -> tokens written encrypted, and only to
  the intended account (a second account stays untouched)

Real accounts-list route:
- exposes oauth_provider status but never the access/refresh token values,
  encrypted or otherwise

Token storage / refresh helpers (isolated in-memory SQLite, mocked HTTP):
- refreshed access token stored encrypted; expiry is a timestamp, not a token
- fresh token uses cache (no refresh call); expired token triggers refresh
- refresh HTTP failure returns None silently, no exception or secret surfaced
- missing client credentials short-circuits to None

Password-account regression:
- password IMAP accounts call conn.login(); OAuth accounts call XOAUTH2
  authenticate() and never login()

28 tests pass (14 prior + 14 new).

* fix(email-oauth): drop raw exception text from token-refresh log

Google token refresh failures now log the account id only, matching
the conservative logging used elsewhere on the OAuth path — no raw
provider/exception details surfacing in logs.

* fix(email-oauth): bring OAuth UI parity to the Integrations email form

The Google Workspace / .edu provider preset, Display Name field, and
Connect-with-Google flow were only wired into the Email-tab account
form. The Integrations-tab form (a separate code path for the same
account type) was missing all three, so the OAuth option was invisible
from that entry point. Mirrors the same PROVIDERS entry, OAuth section,
and connect handler so both forms behave identically.

---------

Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-authored-by: Alexandre Teixeira <111787685+alteixeira20@users.noreply.github.com>

2026-06-15 17:02:58 +01:00

cli

test: move area_cli tests into cli directory (#3842 )

2026-06-11 17:01:14 +00:00

helpers

test: pilot core database stub helper (#3685 )

2026-06-09 22:23:33 +02:00

streaming

fix(chat): stop code-block button flicker during streaming (#3023 )

2026-06-06 04:08:54 -06:00

_taxonomy.py

test(taxonomy): auto-mark tests by area and sub-area (#3491 )

2026-06-09 01:13:28 +02:00

bombadil-spec.ts

…

conftest.py

fix: session context drifting — messages leaking between chats (#135 ) (#267 )

2026-06-09 14:12:52 +01:00

LAYOUT_INVENTORY.md

docs(tests): inventory first low-risk test directory split (#3764 )

2026-06-11 19:24:06 +03:00

markdown_codefence_placeholder_regression.mjs

Render emoji shortcodes as icons in chat (#345 ) (#629 )

2026-06-05 02:28:42 +02:00

README.md

test: add report-only order-sensitivity runner (#3982 )

2026-06-15 15:49:47 +09:00

run_focus.py

test: add fast lane and duration visibility (#3659 )

2026-06-09 20:11:47 +02:00

run_order_report.py

test: add report-only order-sensitivity runner (#3982 )

2026-06-15 15:49:47 +09:00

test_action_intents_shell_verbs.py

Anchor shell-verb intent patterns to imperative or can-you position (#1664 )

2026-06-03 14:23:10 +09:00

test_action_intents.py

fix(agent): honor explicit web search requests

2026-06-15 15:02:10 +09:00

test_active_document_clear.py

refactor(tools): extract document tools to handle registry (#3666 )

2026-06-10 10:41:52 +02:00

test_admin_device_flow_static.py

feat: add ChatGPT Subscription provider (#2876 )

2026-06-08 10:19:18 +02:00

test_admin_wipe_gallery.py

Admin: wipe gallery albums with images

2026-06-02 20:35:57 +09:00

test_agent_loop_tool_output_truncation.py

fix: use _truncate for tool output display limits in agent_loop (#3831 )

2026-06-11 17:05:13 +01:00

test_agent_loop.py

fix(agent): detect Polish web lookup intent (#4091 )

2026-06-15 15:19:03 +09:00

test_agent_migration_manifest.py

docs: add agent migration manifest helper (#3028 )

2026-06-15 15:57:33 +09:00

test_agent_rounds_exhausted.py

feat: round-limit handling — Continue affordance at the cap + configurable cap (#1999 )

2026-06-04 22:36:05 +02:00

test_agent_tools_truncate_nonstring.py

fix: agent_tools._truncate crashes on non-string input (#1624 )

2026-06-03 14:06:39 +09:00

test_ai_image_url_safety.py

fix(ai): validate generated image result URLs (#4289 )

2026-06-15 16:40:49 +02:00

test_ai_interaction_owner_scope.py

fix(ai): scope tool model resolution by owner

2026-06-04 00:37:28 +01:00

test_amd_gpu_check_args.py

Parse all AMD GPU check args (#1586 )

2026-06-03 08:56:48 +09:00

test_anthropic_response_parse.py

fix: Anthropic responses with multiple text blocks lose all but the first (#1255 )

2026-06-03 00:57:20 +09:00

test_api_chat_security.py

Reapply "Merge branch 'main' of github.com:pewdiepie-archdaemon/odysseus"

2026-06-03 22:47:00 +09:00

test_api_key_file_permissions.py

fix(security): restrict API-key encryption key file to 0o600

2026-06-15 15:00:11 +09:00

test_api_key_manager_corrupt_load.py

fix: APIKeyManager.load crashes app startup on a corrupt/wrong-shape api_keys.json (#1565 )

2026-06-03 08:11:37 +09:00

test_api_key_manager_resilience.py

fix(api-keys): preserve encrypted keys when saving providers (#1920 )

2026-06-11 18:23:54 +01:00

test_api_token_routes.py

fix(tokens): owner check on update and delete routes (#3899 )

2026-06-11 15:34:44 +02:00

test_api_token_user_route_gate.py

fix(auth): gate api tokens from user routes (#2992 )

2026-06-07 12:55:01 +02:00

test_app_static_mime.py

fix: normalize JS static MIME types on Windows

2026-06-02 01:32:00 +02:00

test_app.py

…

test_archived_sessions_model_filter.py

fix(tests): make archived session filter test multipart-independent

2026-06-05 10:12:47 +01:00

test_ask_user_tool.py

Add ask_user tool: agent-posed multiple-choice questions (#2111 )

2026-06-05 11:49:11 +02:00

test_atomic_io.py

Add atomic IO durability tests (#1622 )

2026-06-03 14:14:16 +09:00

test_auth_config_lock_concurrency.py

fix(auth): fail closed when deleting user tokens fails (#3733 )

2026-06-10 16:24:27 +02:00

test_auth_event_loop.py

fix: avoid double bcrypt on login by using create_session_trusted (#3236 )

2026-06-07 15:10:53 +02:00

test_auth_regressions.py

fix(endpoint): import ModelEndpoint from core database

2026-06-04 11:51:47 +01:00

test_auth_require_privilege_nondict.py

fix: require_privilege 500s on a non-dict privileges blob from auth.json (#1693 )

2026-06-03 13:37:54 +09:00

test_auth_session_revocation.py

refactor(tests): reuse import-state helper in auth tests

2026-06-05 11:10:41 +01:00

test_aux_llm_owner_scope.py

fix(endpoint): scope secondary endpoint lookups by owner

2026-06-08 11:51:55 +01:00

test_backup_cli_security.py

refactor(tests): finish shared CLI loader adoption

2026-06-05 06:00:05 +01:00

test_backup_import_cross_user_dedup.py

fix: backup import drops a user's memory when its text matches another user's (#1743 )

2026-06-03 13:29:14 +09:00

test_backup_import_skills_dedup.py

fix: backup import dropping a user's skill on cross-tenant title/id collision (#2057 )

2026-06-09 08:04:22 +02:00

test_backup_import_skills.py

fix: restore backup import after skills migration (#2980 )

2026-06-06 21:46:32 +01:00

test_bg_jobs_store.py

Ignore invalid background job store rows (#1261 )

2026-06-03 14:07:14 +09:00

test_bg_monitor_stream.py

Ignore non-string background stream deltas (#1549 )

2026-06-03 14:11:45 +09:00

test_blind_compare_redaction.py

refactor(tests): add import-state isolation helper

2026-06-05 07:30:14 +01:00

test_budget_auto_sentinel.py

fix(agent): don't let a materialized default budget defeat context-window scaling (#4122 )

2026-06-15 15:17:28 +09:00

test_build_user_content_pdf_marker.py

fix(documents): restore PDF library metadata and preview (#2483 )

2026-06-07 23:23:27 +02:00

test_builtin_actions_nonstring.py

fix: builtin_actions heuristics crash on a truthy non-string input (#1639 )

2026-06-03 08:59:16 +09:00

test_builtin_actions_owner_scope.py

fix(email): scope learned sender signatures by owner (#3724 )

2026-06-11 13:26:59 +02:00

test_builtin_mcp_npx_cache.py

fix(mcp): detect npx cache entries before probing (#4034 )

2026-06-15 15:14:48 +09:00

test_builtin_memory_consolidation.py

Scope memory consolidation by owner group

2026-06-02 12:40:28 +09:00

test_cache_affinity_local_only.py

fix(llm): stop sending llama.cpp slot-affinity fields to cloud providers (#3945 )

2026-06-11 17:51:03 +02:00

test_caldav_bidirectional_sync.py

Harden CalDAV write-back with retries (#1193 )

2026-06-15 15:59:31 +09:00

test_caldav_google_principal_url.py

fix(caldav): disable redirects on the sync/write-back DAVClient (SSRF) (#2663 )

2026-06-07 05:05:24 +01:00

test_caldav_prune_parse_failure.py

fix(caldav): skip the prune when any object fails to parse (#3454 )

2026-06-08 18:59:14 +02:00

test_caldav_redirect_hardening.py

fix(caldav): disable redirects on the sync/write-back DAVClient (SSRF) (#2663 )

2026-06-07 05:05:24 +01:00

test_caldav_sync_prune_local_events.py

fix(caldav): don't prune locally-created events on sync (#2706 )

2026-06-05 02:48:03 +02:00

test_caldav_sync_uid_scope.py

fix(calendar): scope CalDAV event lookup by calendar

2026-06-04 04:01:21 +01:00

test_caldav_url_hardening.py

fix(caldav): disable redirects on the sync/write-back DAVClient (SSRF) (#2663 )

2026-06-07 05:05:24 +01:00

test_caldav_url_nonstring.py

Harden DAV outbound URL validation (#2819 )

2026-06-05 13:22:21 +02:00

test_caldav_writeback_route.py

Harden CalDAV write-back with retries (#1193 )

2026-06-15 15:59:31 +09:00

test_caldav_writeback.py

Harden CalDAV write-back with retries (#1193 )

2026-06-15 15:59:31 +09:00

test_calendar_batch_events.py

fix: handle batch events format in manage_calendar tool (#3503 )

2026-06-10 19:13:08 +02:00

test_calendar_event_contrast.py

Improve calendar event text contrast (#1184 )

2026-06-02 23:14:52 +09:00

test_calendar_list_range_aliases.py

fix(calendar): accept list event range aliases

2026-06-06 03:47:18 -06:00

test_calendar_owner_scope.py

Harden CalDAV write-back with retries (#1193 )

2026-06-15 15:59:31 +09:00

test_calendar_parse_dt_naive.py

Strip tz in _parse_dt dateutil fallback (naive-datetime contract) (#2557 )

2026-06-05 08:18:26 +01:00

test_calendar_parse_dt_tonight.py

fix: _parse_dt does not understand 'tonight' so event start/end breaks (#1488 )

2026-06-03 14:14:41 +09:00

test_calendar_recurrence.py

fix(calendar): cap RRULE expansion (#2902 )

2026-06-05 16:05:14 +02:00

test_calendar_reminder_minutes_parsing.py

fix(calendar): parse "mins"/"hrs" reminder offsets in manage_calendar (#4266 )

2026-06-15 17:37:28 +02:00

test_calendar_rrule_until_utc.py

fix(calendar): keep recurring events with a UTC UNTIL from collapsing to one (#1383 )

2026-06-03 14:24:14 +09:00

test_calendar_rrule.py

refactor(tests): add temp sqlite helper (#2930 )

2026-06-07 23:44:16 +02:00

test_calendar_update_event_tz.py

refactor(tests): add temp sqlite helper (#2930 )

2026-06-07 23:44:16 +02:00

test_calendar_utils_dates_js.py

Ignore non-string calendar date inputs (#1649 )

2026-06-03 14:16:58 +09:00

test_carddav_password_encryption.py

fix(security): encrypt CardDAV password at rest in settings.json (#1741 )

2026-06-15 15:58:14 +09:00

test_censor_pref_js.py

Ignore censor preference storage errors (#1652 )

2026-06-03 14:16:55 +09:00

test_chat_attachment_picker.py

Chat attachments: allow picker to choose any file type

2026-06-02 20:55:30 +09:00

test_chat_cached_model_normalization.py

Chat: use cached endpoint model ids before probing

2026-06-02 21:00:58 +09:00

test_chat_helpers.py

fix(research): keep Discuss chats grounded on their report (#4006 )

2026-06-15 20:31:57 +09:00

test_chat_image_routing.py

Fix Windows Cookbook background tasks, exit statuses, and empty SSH logs wrapper (#1389 )

2026-06-05 14:41:07 +02:00

test_chat_metrics.py

fix(chat): show requested and actual reply models

2026-06-06 04:30:16 -06:00

test_chat_preprocess_tool_policy.py

fix(agent): enforce guide-only tool policy (#3088 )

2026-06-06 18:48:24 -06:00

test_chat_route_tool_policy.py

fix(agent): honor explicit web search requests

2026-06-15 15:02:10 +09:00

test_chat_stream_scope.py

…

test_chat_tool_screenshot_xss.py

Harden chat streaming DOM sinks (#2498 )

2026-06-04 20:49:37 +02:00

test_chat_upload_limit_config.py

fix(upload): configure chat attachment size limit (#2439 )

2026-06-07 22:42:24 +02:00

test_chatgpt_subscription_routes.py

feat: add ChatGPT Subscription provider (#2876 )

2026-06-08 10:19:18 +02:00

test_check_outbound_url_nonstring.py

fix: check_outbound_url crashes on a truthy non-string URL (#1623 )

2026-06-03 08:59:49 +09:00

test_chroma_client.py

…

test_claim_ownerless_json.py

Skip invalid ownerless JSON rows (#1540 )

2026-06-03 14:06:57 +09:00

test_classify_events_memory_text.py

fix(tasks): read Memory.text in classify_events personal context (#3640 )

2026-06-10 19:03:45 +02:00

test_cleanup_owner_scope.py

tests: cover cleanup owner scope

2026-06-02 20:42:21 +09:00

test_cleanup_service_utcnow.py

Replace cleanup service datetime.utcnow calls (#1494 )

2026-06-03 14:14:27 +09:00

test_code_nav_tools.py

feat: add code-navigation tools (grep, glob, ls) + read_file line ranges (#1670 )

2026-06-04 18:37:32 +02:00

test_codex_ssh_host_validation.py

fix(cookbook): validate adopt host (#4282 )

2026-06-15 16:44:24 +02:00

test_compact_truncate_tool_call_args.py

fix(compactor): shrink oversized tool_calls arguments so trim_for_context can fit a tool-only turn (#2949 )

2026-06-05 20:23:38 +02:00

test_compaction_summary_failure.py

fix(test): tolerate owner kwarg in compaction summary resolve_endpoint mock (#3304 )

2026-06-07 17:23:06 +02:00

test_companion_pairing.py

refactor(constants): single source of truth for data dir (#3368 )

2026-06-08 09:58:52 +02:00

test_companion_readonly.py

Tests: companion model JSON resilience

2026-06-02 13:15:22 +09:00

test_compare_endpoint_owner_scope.py

fix(tests): isolate compare endpoint owner-scope test

2026-06-04 19:17:15 +01:00

test_compare_js.py

…

test_compare_stop_disconnect_poll.py

fix(compare): stream Compare panes directly to stop upstream promptly

2026-06-08 01:13:45 +01:00

test_composer_arrow_up_recall_js.py

feat(chat): recall last user message on empty composer ArrowUp (#1175 )

2026-06-08 13:06:05 +02:00

test_compute_next_run_monthly_clamp.py

fix: monthly tasks scheduled for day 29-31 skip every short month (#1668 )

2026-06-03 14:23:01 +09:00

test_consolidate_memory_explicit_drops.py

fix(memory): only delete memories the model explicitly drops in tidy (#3455 )

2026-06-08 18:54:45 +02:00

test_contacts_add_null_name.py

fix: POST /api/contacts/add crashes on JSON null name/email (None.strip()) (#1544 )

2026-06-03 14:23:34 +09:00

test_contacts_carddav_security.py

Harden DAV outbound URL validation (#2819 )

2026-06-05 13:22:21 +02:00

test_contacts_import_nonstring.py

fix(contacts): tolerate non-string body in /api/contacts/import (#3638 )

2026-06-10 17:50:22 +02:00

test_contacts_vcard_parse.py

fix(contacts): parse Apple/iCloud item-grouped vCard EMAIL/TEL properties (#1438 )

2026-06-03 14:24:04 +09:00

test_context_budget.py

fix(agent): don't let a materialized default budget defeat context-window scaling (#4122 )

2026-06-15 15:17:28 +09:00

test_context_cache_per_endpoint.py

fix(agent): don't let a materialized default budget defeat context-window scaling (#4122 )

2026-06-15 15:17:28 +09:00

test_context_compactor_nonstring.py

fix: context_compactor token helpers crash on non-string message text (#1634 )

2026-06-03 14:12:14 +09:00

test_context_compactor.py

fix(research): keep Discuss chats grounded on their report (#4006 )

2026-06-15 20:31:57 +09:00

test_cookbook_cpu_only_serve.py

fix(cookbook): allow local Windows Diffusers serving (#4077 )

2026-06-15 15:21:01 +09:00

test_cookbook_dead_download_status.py

fix(cookbook): report dead finished downloads as completed instead of stopped (#4025 )

2026-06-15 15:26:55 +09:00

test_cookbook_dependency_completion_regression.py

fix(cookbook): recover completed downloads from DOWNLOAD_OK in background reconciler (#4000 )

2026-06-15 15:36:39 +09:00

test_cookbook_diagnosis_js.py

fix(cookbook): diagnose sglang native deps (#4112 )

2026-06-15 15:14:37 +09:00

test_cookbook_diagnosis.py

fix(cookbook): diagnose sglang native deps (#4112 )

2026-06-15 15:14:37 +09:00

test_cookbook_download_toast_duration.py

Keep Cookbook download-failure toasts visible long enough to read (#1412 )

2026-06-03 03:48:25 +09:00

test_cookbook_endpoint_registration.py

Fix Cookbook container-local model endpoints (#1223 )

2026-06-03 00:09:48 +09:00

test_cookbook_error_feedback.py

fix(cookbook): surface backend diagnosis when serve fails in background (#1636 )

2026-06-05 09:52:07 +01:00

test_cookbook_error_tail_lines.py

fix: expand cookbook error output tail from 12 to 50 lines (#1538 )

2026-06-11 17:55:33 +01:00

test_cookbook_gemma4_thinking_template.py

feat(cookbook): add Gemma4 thinking chat template (#2955 )

2026-06-05 22:43:31 +02:00

test_cookbook_helpers.py

fix(cookbook): shim Windows Store python3 alias (#2610 )

2026-06-15 20:25:30 +09:00

test_cookbook_hf_token.py

fix: detect HuggingFace token when downloading cookbook models (#3459 )

2026-06-11 21:53:16 +01:00

test_cookbook_package_detection.py

fix(cookbook): allow local Windows Diffusers serving (#4077 )

2026-06-15 15:21:01 +09:00

test_cookbook_progress_signal_js.py

Don't falsely declare a dependency build stale (#1568 ) (#1768 )

2026-06-03 13:23:35 +09:00

test_cookbook_same_host_server_profiles_js.py

fix(cookbook): preserve same-host ssh profile selection (#3373 )

2026-06-09 00:36:10 +02:00

test_cookbook_serve_lifecycle.py

fix(cookbook): only persist successfully stopped scheduled serves (#4267 )

2026-06-15 17:30:18 +02:00

test_copilot_routes.py

feat(provider): add GitHub Copilot provider with device-flow auth (#1480 )

2026-06-04 21:13:14 +02:00

test_copilot.py

feat(provider): add GitHub Copilot provider with device-flow auth (#1480 )

2026-06-04 21:13:14 +02:00

test_copy_message_strips_thinking_js.py

fix(chat): copy only the displayed reply from the message copy buttons (#3731 )

2026-06-10 18:29:22 +02:00

test_cors_preflight.py

Fix: CORS preflight 401'd by AuthMiddleware before CORSMiddleware (#3262 )

2026-06-07 15:23:23 +02:00

test_database_utcnow.py

Replace core database utcnow defaults (#1457 )

2026-06-04 02:50:19 +01:00

test_db_stubs_helper.py

test: pilot core database stub helper (#3685 )

2026-06-09 22:23:33 +02:00

test_ddg_redirect_resolution.py

Match host, not substring, when resolving DuckDuckGo redirects (#886 )

2026-06-02 12:25:56 +09:00

test_deep_research_date_context.py

Inject current date into deep research planning and query prompts (#1347 )

2026-06-03 03:00:52 +09:00

test_deep_research_extraction_controls.py

fix(research): track analyzed URLs separately (#3125 )

2026-06-10 12:08:22 +01:00

test_deep_research_parse_json_array_echo.py

fix: deep research runs the prompt's example queries when the model echoes them (#1666 )

2026-06-03 14:23:07 +09:00

test_deep_research_search_error.py

Research: report empty search provider results clearly

2026-06-02 20:34:25 +09:00

test_deep_research_synthesis_resilience.py

Don't lose deep-research findings when synthesis times out (#1551 ) (#1562 )

2026-06-03 08:11:44 +09:00

test_delete_message_no_session.py

Let the output "x" delete work when no model/session exists (#1431 )

2026-06-03 04:20:48 +09:00

test_delete_user_invalidates_token_cache.py

fix(auth): fail closed when deleting user tokens fails (#3733 )

2026-06-10 16:24:27 +02:00

test_delete_user_revokes_api_tokens.py

fix(auth): fail closed when deleting user tokens fails (#3733 )

2026-06-10 16:24:27 +02:00

test_deleted_session_sidebar_regression.py

Fix stale deleted sessions in sidebar (#1203 )

2026-06-02 23:52:22 +09:00

test_derive_title_nonstring.py

fix: _derive_title crashes on non-string content instead of returning Untitled (#1751 )

2026-06-03 13:25:41 +09:00

test_device_flow_routes.py

feat: add ChatGPT Subscription provider (#2876 )

2026-06-08 10:19:18 +02:00

test_diagnostics_logs.py

feat(ui): add real-time diagnostic logs console (#974 )

2026-06-15 10:32:51 +02:00

test_diagnostics_service_route.py

feat(diagnostics): add consolidated service health endpoint for degraded-state reporting (#964 )

2026-06-09 16:00:24 +01:00

test_dialog_aria.py

Add dialog accessibility semantics

2026-06-02 12:41:25 +09:00

test_diffusion_server_security.py

test(diffusion-server): exercise security middleware wiring (#3214 )

2026-06-07 23:42:11 +02:00

test_digest_windows.py

fix: calendar check-in digest drops events 7-8 days out (#1249 )

2026-06-03 01:03:58 +09:00

test_direct_upload_limits.py

refactor(uploads): centralize upload byte-limits in upload_limits.py (#3364 ) (#3518 )

2026-06-09 01:24:30 +02:00

test_doc_library_open_orphaned.py

Let orphaned documents be reopened from the library (#1602 ) (#1761 )

2026-06-03 13:28:31 +09:00

test_docs_no_orphan_images.py

Remove stray PR screenshots accidentally committed under docs/ (#1351 )

2026-06-03 03:31:09 +09:00

test_docs_query_nondict_rows.py

fix: docs RAG query crashes on a non-dict row from the index (#1706 )

2026-06-03 13:35:01 +09:00

test_document_actions_nonstring.py

fix: document_actions title/content helpers crash on non-string input (#1621 )

2026-06-03 08:59:55 +09:00

test_document_ai_preview_refresh_js.py

fix document preview refresh after AI edits (#2259 )

2026-06-07 22:33:01 +02:00

test_document_close_clears_active_route.py

refactor(tools): extract document tools to handle registry (#3666 )

2026-06-10 10:41:52 +02:00

test_document_deeplink.py

refactor(tools): extract document tools to handle registry (#3666 )

2026-06-10 10:41:52 +02:00

test_document_diff_discard_on_update_js.py

fix(documents): discard pending AI diff before switching active doc (#2484 )

2026-06-07 22:35:35 +02:00

test_document_editor_scroll.py

fix: drop thinking deltas from background agent loops

2026-06-15 15:03:09 +09:00

test_document_library_delete_counters.py

fix(documents): refresh library counters after removal (#1924 )

2026-06-04 04:42:23 +01:00

test_document_library_language_facet.py

fix: document library language facet undercounts text documents (#1758 )

2026-06-03 13:28:38 +09:00

test_document_library_pdf_metadata.py

fix(documents): restore PDF library metadata and preview (#2483 )

2026-06-07 23:23:27 +02:00

test_document_pdf_marker.py

Documents: strip PDF marker without corrupting text

2026-06-02 20:35:27 +09:00

test_document_processor_attachment_budget.py

Cap inline attachment context across files (#1498 )

2026-06-03 14:23:43 +09:00

test_document_session_owner_scope.py

Scope document session links by owner (#3005 )

2026-06-07 12:47:20 +02:00

test_document_tidy_null_timestamp.py

fix: document tidy crashes on a duplicate with NULL timestamps (#1772 )

2026-06-03 13:23:01 +09:00

test_document_tool_owner_scope.py

refactor(tools): extract document tools to handle registry (#3666 )

2026-06-10 10:41:52 +02:00

test_edit_file.py

refactor(tools): migrate execution logic to src/agent_tools/ package with handler registry (#3435 )

2026-06-09 14:35:36 +01:00

test_editor_draft_payload.py

Ignore invalid editor draft payloads (#1533 )

2026-06-03 14:07:03 +09:00

test_email_decode_header.py

fix(email): guard _decode_header against unknown MIME charset (#1354 )

2026-06-03 14:24:20 +09:00

test_email_envelope_recipients.py

fix: SMTP envelope recipients split on commas inside display names (#1464 )

2026-06-03 14:23:58 +09:00

test_email_fallback_reconnect.py

Reconnect after a failed SEARCH ALL so the email poller doesn't desync IMAP (#1613 ) (#1748 )

2026-06-03 13:28:53 +09:00

test_email_gmail_fetch_flags.py

fix(email): keep FETCH attributes Gmail sends after the header literal (all Gmail mail showed as unread) (#3785 )

2026-06-11 16:12:39 +02:00

test_email_helpers_decode_header_spaces.py

fix(email): decode headers without injected spaces (#2433 )

2026-06-07 16:56:20 +02:00

test_email_imap_timeout.py

Use shared IMAP timeout for account tests (#1088 )

2026-06-02 23:11:04 +09:00

test_email_library_bulk_actions.py

Email: persist bulk read state to provider

2026-06-02 20:28:01 +09:00

test_email_linkify_security_js.py

Harden email HTML URL sanitization (#2496 )

2026-06-04 20:47:47 +02:00

test_email_oauth.py

feat(email): add Google OAuth2 for Google Workspace / .edu IMAP & SMTP (#237 )

2026-06-15 17:02:58 +01:00

test_email_owner_scope.py

fix(email): scope learned sender signatures by owner (#3724 )

2026-06-11 13:26:59 +02:00

test_email_polly_imap_leak.py

fix(tests): allow multiple logout calls when IMAP fallback reconnects (#1976 )

2026-06-04 02:56:05 +01:00

test_email_smtp_security.py

Email: add explicit SMTP security mode

2026-06-02 13:15:06 +09:00

test_email_split_border_css.py

fix(ui): contain email split divider (#1194 )

2026-06-02 23:28:24 +09:00

test_email_thread_parser_nonstring.py

Ignore non-string email thread bodies (#1654 )

2026-06-03 14:06:31 +09:00

test_embedding_cache_confinement.py

Constrain embedding model cache paths (#2849 )

2026-06-05 10:46:48 +02:00

test_embedding_endpoint_config.py

Ignore non-object embedding endpoint config (#1260 )

2026-06-03 14:12:41 +09:00

test_embedding_lane_ndarray_restore.py

fix(embeddings): survive numpy embeddings when restoring a reset lane (#3410 )

2026-06-09 10:40:17 +02:00

test_embedding_lanes.py

fix: split Chroma embedding lanes (#3046 )

2026-06-06 03:17:19 -06:00

test_embeddings.py

Add support for EMBEDDING_API_KEY (#2691 )

2026-06-05 14:47:24 +02:00

test_emoji_shortcodes_js.py

Render emoji shortcodes as icons in chat (#345 ) (#629 )

2026-06-05 02:28:42 +02:00

test_emoji_svg_hardening.py

Harden emoji SVG proxy responses (#2842 )

2026-06-05 10:31:58 +02:00

test_endpoint_owner_scope_followup.py

fix(endpoint): scope secondary endpoint lookups by owner

2026-06-08 11:51:55 +01:00

test_endpoint_probing.py

feat: add ChatGPT Subscription provider (#2876 )

2026-06-08 10:19:18 +02:00

test_endpoint_resolver.py

refactor(tests): replace local function copies in test_endpoint_resolver with real imports (#3359 )

2026-06-07 22:47:57 +02:00

test_esc_menu_stack_js.py

…

test_estimate_tokens_tool_calls.py

fix(model-context): count tool_calls in estimate_tokens so compaction sees real size (#2751 )

2026-06-05 15:56:54 +02:00

test_extract_quotes.py

fix: extract_quotes accepts mismatched opening/closing quotes (#1113 )

2026-06-02 22:34:52 +09:00

test_extract_skill_json_nonstring.py

fix: _extract_skill_json crashes on a truthy non-string teacher response (#1630 )

2026-06-03 08:59:36 +09:00

test_extract_statistics.py

fix: extract_statistics drops large numbers and trailing % signs (#1153 )

2026-06-02 22:35:30 +09:00

test_extract_urls.py

fix(chat): keep balanced trailing ')' when extracting URLs (#3406 )

2026-06-08 21:33:29 +02:00

test_fenced_example_not_executed_for_native_models.py

fix(agent): stop treating illustrative Markdown fences as tool calls for native function-calling models (#3356 )

2026-06-08 22:25:28 +02:00

test_fenced_invoke_no_raw_xml.py

fix: route misfenced web lookups to web tools

2026-06-06 03:46:31 -06:00

test_font_routes.py

Keep compact font family names together (#1263 )

2026-06-03 14:24:30 +09:00

test_fork_session_metadata.py

fix(sessions): copy message metadata when forking a session (#3409 )

2026-06-08 20:49:15 +02:00

test_form_markdown_roundtrip.py

fix(forms): keep PDF-form export from dropping values when the label has '*' (#1407 )

2026-06-03 14:24:07 +09:00

test_forwarded_message_divider.py

Email: recognize forwarded message dividers

2026-06-02 20:32:56 +09:00

test_function_call_non_object_args.py

test(tool_execution): stop two tests leaking src.tool_execution into the suite (#2686 )

2026-06-09 16:35:10 +01:00

test_gallery_album_owner_scope.py

fix(gallery): fail closed for null-user owner scope (#3613 )

2026-06-09 20:20:21 +02:00

test_gallery_delete_file_ordering.py

test(gallery): point delete-ordering tests at the tmp image dir (#4300 )

2026-06-15 14:07:49 +00:00

test_gallery_endpoint_matching.py

Scope gallery image endpoints by owner (#3001 )

2026-06-07 12:51:21 +02:00

test_gallery_endpoint_ssrf.py

fix: validate client-supplied image _endpoint to prevent SSRF (gallery proxies) (#1718 )

2026-06-03 13:34:17 +09:00

test_gallery_exif_orientation.py

fix: gallery records raw instead of display dimensions for EXIF-rotated photos (#1667 )

2026-06-03 14:23:04 +09:00

test_gallery_filename_confinement.py

fix(gallery): confine replacement image path (#4285 )

2026-06-15 16:42:41 +02:00

test_gallery_image_endpoint_owner_scope.py

Scope gallery image endpoints by owner (#3001 )

2026-06-07 12:51:21 +02:00

test_gallery_image_privileges.py

fix(endpoint): scope secondary endpoint lookups by owner

2026-06-08 11:51:55 +01:00

test_gallery_null_user_routes.py

fix(gallery): fail closed for null-user owner scope (#3613 )

2026-06-09 20:20:21 +02:00

test_gallery_owner_filter_single_user.py

fix(gallery): fail closed for null-user owner scope (#3613 )

2026-06-09 20:20:21 +02:00

test_gallery_result_image_ssrf.py

fix(gallery): validate upstream result image URLs

2026-06-15 15:01:28 +09:00

test_generated_image_confinement.py

Constrain generated-image paths to image root (#2837 )

2026-06-05 10:33:47 +02:00

test_gmail_quote_attribution_js.py

Parse standard Gmail quote attribution dates

2026-06-03 13:45:56 +09:00

test_gpu_compose_standalone.py

Reapply "Merge branch 'main' of github.com:pewdiepie-archdaemon/odysseus"

2026-06-03 22:47:00 +09:00

test_group_chat_storage.py

Keep group chat session cache loading (#1418 )

2026-06-03 04:05:40 +09:00

test_helpers_import_state.py

refactor(tests): centralize fake endpoint resolver cleanup

2026-06-05 13:23:46 +01:00

test_hex_to_rgb_js.py

fix: theme color parsing breaks on #rgb shorthand hex (#1213 )

2026-06-03 00:30:03 +09:00

test_history_compact_tool_calls.py

Scope auxiliary LLM endpoints by owner (#2996 )

2026-06-07 14:47:44 +02:00

test_history_db_fallback_hidden.py

fix: history DB fallback returned hidden (compaction) messages to the client (#1726 )

2026-06-03 13:30:11 +09:00

test_history_order_by_timestamp_regression.py

Fix HTTP 500 in history routes: order ChatMessage by timestamp, not created_at (#1673 )

2026-06-03 14:22:51 +09:00

test_history_topics_owner_scope.py

fix(history): scope topic analysis to authenticated owner only (#744 )

2026-06-02 11:36:01 +09:00

test_hwfit_amd.py

Cookbook fit: steer consumer AMD to GGUF recommendations

2026-06-02 21:01:42 +09:00

test_hwfit_apple_bandwidth.py

test(hwfit): fix non-Apple guard to assert the Apple matcher (unblocks pytest gate) (#4303 )

2026-06-15 14:01:05 +00:00

test_hwfit_bandwidth_nonstring.py

fix: _lookup_bandwidth crashes on a truthy non-string gpu_name (#1641 )

2026-06-03 14:11:10 +09:00

test_hwfit_container_visibility_warning.py

feat(cookbook): surface Docker hardware visibility warnings (#3658 )

2026-06-15 15:51:04 +09:00

test_hwfit_gpu_count_nonnumeric.py

fix(hwfit): tolerate non-numeric gpu_count in /api/hwfit/models (#3639 )

2026-06-11 01:01:58 +02:00

test_hwfit_macos.py

fix(hwfit): distinguish Apple Silicon bandwidth variants (#2564 )

2026-06-15 15:13:03 +02:00

test_hwfit_manual_backend.py

fix(hwfit): honor manual "metal" backend in the hardware simulator (#1090 )

2026-06-02 23:12:34 +09:00

test_hwfit_native_quant_labels.py

fix: hwfit native quant labels miss the cost maps and over-estimate VRAM (#1690 )

2026-06-03 14:22:42 +09:00

test_hwfit_params_b_malformed.py

fix: params_b crashes the whole ranking on a malformed parameter_count (#1550 )

2026-06-03 14:23:30 +09:00

test_hwfit_quant_formats.py

Fix native Cookbook quant classification

2026-06-02 13:07:20 +09:00

test_hwfit_remote_validation.py

fix(hwfit): validate remote SSH detection targets (#3718 )

2026-06-11 00:43:49 +02:00

test_hwfit_unified_nvidia.py

fix(platform): Improve WSL SSH remote compatibility (#3316 )

2026-06-08 00:33:50 +02:00

test_hwfit_windows.py

fix(hwfit): filter non-GGUF models on Windows (#2530 )

2026-06-04 20:02:13 +02:00

test_icloud_imap_full_fetch.py

Fetch full messages with BODY.PEEK[] so read_email works on iCloud IMAP (#1961 ) (#1963 )

2026-06-04 03:53:14 +01:00

test_ics_escape.py

Sanitize calendar export filenames (#2840 )

2026-06-05 10:18:09 +02:00

test_ics_export_escaping.py

fix: ICS export — escape X-WR-CALNAME and honour is_utc on DTSTART/DTEND (#1174 )

2026-06-02 23:02:28 +09:00

test_ics_import_dedup_tz.py

fix: re-importing an ICS file duplicates every tz-aware timed event (#1683 )

2026-06-03 14:22:49 +09:00

test_image_models_nondict_system.py

fix: image model ranking crashes when system is not a dict (#1900 )

2026-06-04 03:23:59 +01:00

test_image_models_nonstring_search.py

fix: image model ranking crashes on a non-string search filter (#1898 )

2026-06-04 03:26:35 +01:00

test_imap_leak_fixes.py

fix(email): close IMAP socket when connect/login fails (#3174 ) (#3363 )

2026-06-08 21:21:41 +02:00

test_imap_mailbox_quoting.py

fix: quote IMAP mailbox arguments (#2170 )

2026-06-05 16:00:20 +02:00

test_inside_base_dir_nonstring.py

fix: inside_base_dir raises TypeError on a non-string path instead of failing closed (#1619 )

2026-06-03 09:00:04 +09:00

test_integrations_api_call_truncation.py

fix(integrations): truncate api_call JSON lists with sentinel instead of mid-string cut (#3540 )

2026-06-09 22:34:08 +01:00

test_integrations_store_shape.py

fix(integrations): prevent blank API integrations (#3840 )

2026-06-15 15:40:36 +09:00

test_internal_api_base.py

fix: route all agent loopback calls through internal_api_base() helper (#3322 )

2026-06-07 22:22:09 +01:00

test_is_youtube_url_nonstring_svc.py

fix: is_youtube_url (services) crashes on a non-string url (#1753 )

2026-06-03 13:24:24 +09:00

test_is_youtube_url_nonstring.py

fix: is_youtube_url crashes on a non-string url (#1752 )

2026-06-03 13:24:33 +09:00

test_keybind_altgr_js.py

Ignore AltGr keystrokes in Ctrl+Alt keyboard shortcuts (#825 )

2026-06-02 11:12:54 +09:00

test_kimi_code_hosts.py

fix(kimi): resolve Kimi Code API 403 errors and User-Agent restrictions (#3549 )

2026-06-15 15:56:54 +09:00

test_kimi_code_user_agent.py

fix(kimi): resolve Kimi Code API 403 errors and User-Agent restrictions (#3549 )

2026-06-15 15:56:54 +09:00

test_kv_cache_invalidation_2927.py

fix(chat): stabilize system prompt, sequence memory extraction, and send stable session id to preserve KV cache (#3360 )

2026-06-09 22:46:54 +01:00

test_lang_icon_null_opts_js.py

fix: langIcon throws on an explicit null opts argument (#1740 )

2026-06-03 13:29:21 +09:00

test_llama_server_models_url.py

fix(agent): don't let a materialized default budget defeat context-window scaling (#4122 )

2026-06-15 15:17:28 +09:00

test_llm_core_anthropic_cache.py

Add Anthropic prompt caching to the agent loop (#812 )

2026-06-02 11:14:31 +09:00

test_llm_core_anthropic_temp_clamp.py

Clamp Anthropic temperature to [0.0, 1.0] in _build_anthropic_payload (#1737 )

2026-06-03 13:29:36 +09:00

test_llm_core_anthropic_temp_omit.py

fix: omit temperature for Opus 4.7+ on native Anthropic path (#3117 )

2026-06-11 16:27:40 +03:00

test_llm_core_concurrency.py

…

test_llm_core_connect_timeout.py

fix(llm): make connect timeout configurable

2026-06-15 15:11:38 +09:00

test_llm_core_fallback.py

Don't attempt the same (url, model) route twice in the fallback chains (#1733 )

2026-06-03 13:33:50 +09:00

test_llm_core_ollama_thinking.py

fix(llm): suppress thinking mode for qwen3/gemma4 on Ollama /v1 endpoint (#3228 )

2026-06-09 07:35:15 +02:00

test_llm_core_ollama.py

Ollama: pass discovered num_ctx in chat requests

2026-06-02 20:27:24 +09:00

test_llm_core_reasoning_content_fallback.py

fix: surface reasoning_content when content is empty (thinking models) (#1233 )

2026-06-03 01:41:24 +09:00

test_llm_core_reasoning.py

fix(llm): route harmony thinking streams (#2449 )

2026-06-05 15:22:08 +02:00

test_llm_core_sanitize_tool_calls.py

test: stabilize full test collection

2026-06-04 00:27:29 +01:00

test_llm_core_sse_no_space.py

fix: streaming drops providers that emit SSE data lines with no space (#1701 )

2026-06-03 13:37:14 +09:00

test_llm_core_streaming.py

fix(llm): guard against null arguments in streaming tool-call accumulator (#2923 )

2026-06-05 20:57:36 +02:00

test_llm_core_system_msg_missing_content.py

fix: degrade missing/None content key in system messages to empty string (#2570 )

2026-06-05 00:10:11 +02:00

test_llm_core_temperature.py

fix(llm): omit temperature for Kimi K2.5 and K2.6 (#3960 )

2026-06-15 20:29:22 +09:00

test_llm_core_usage_finish_delta.py

fix: SSE stream parser crashes with NoneType on providers sending null choice/usage/tc entries (#2389 )

2026-06-04 13:53:10 +01:00

test_lmstudio_discovery.py

Discover LM Studio via host/port scanning and native-API fingerprint (#1126 )

2026-06-02 23:04:58 +09:00

test_lmstudio_models_url.py

fix(models): probe /v1/models for path-less LM Studio endpoints

2026-06-15 15:09:50 +09:00

test_lmstudio_vision.py

Use LM Studio-reported vision capability for image passthrough (#1130 )

2026-06-02 23:01:04 +09:00

test_load_features_permission_error.py

fix(settings): degrade load_features to defaults on PermissionError

2026-06-11 21:20:10 +01:00

test_local_endpoint_api_key_js.py

Models: allow API keys for local endpoints

2026-06-02 20:36:54 +09:00

test_local_endpoint_js.py

fix: don't bill self-hosted models reached by a container/service hostname (#596 )

2026-06-02 11:47:58 +09:00

test_loop_breaker_runaway.py

fix(agent): don't abort legitimate tool batches as runaway loops (#3183 )

2026-06-07 16:16:17 +02:00

test_manage_memory_list.py

fix(memory): return complete memory lists (#3885 )

2026-06-15 20:28:25 +09:00

test_manage_notes_owner_gate.py

Tighten manage notes owner checks (#3002 )

2026-06-07 12:50:10 +02:00

test_manage_settings_token_budget.py

fix: agent_input_token_budget wrongly treated as a secret and unsettable from chat (#1294 )

2026-06-03 01:53:47 +09:00

test_markdown_dom_xss_helpers.py

Harden markdown raw HTML sanitization (#2497 )

2026-06-04 20:46:10 +02:00

test_markdown_rendering_js.py

fix(markdown): avoid autolinking dotted imports (#2295 )

2026-06-05 02:57:20 +02:00

test_markdown_table_row_js.py

Ignore non-string markdown table rows (#1648 )

2026-06-03 14:17:02 +09:00

test_markitdown_format_nonstring.py

fix: is_markitdown_format crashes on a non-string path (#1618 )

2026-06-03 09:00:10 +09:00

test_markitdown_runtime.py

Add optional markitdown extraction for Office/EPUB documents (#766 )

2026-06-02 11:28:52 +09:00

test_match_model_key_js.py

fix: model cost/info matches first substring key (gpt-4o-mini billed as gpt-4o) (#1439 )

2026-06-04 03:05:37 +01:00

test_mcp_cache_invalidation.py

fix(mcp): invalidate tool prompt cache on connect/disconnect/error (#1235 )

2026-06-03 00:49:29 +09:00

test_mcp_common_truncate.py

refactor(tools): consolidate duplicated _truncate and get_mcp_manager into src/tool_utils (#3478 )

2026-06-09 01:05:30 +02:00

test_mcp_email_decode_header_spaces.py

Decode email headers without injected spaces

2026-06-03 13:45:33 +09:00

test_mcp_manager.py

feat(mcp): add Streamable HTTP transport with OAuth 2.0 (#1033 )

2026-06-05 02:40:52 +02:00

test_mcp_oauth.py

feat(mcp): add Streamable HTTP transport with OAuth 2.0 (#1033 )

2026-06-05 02:40:52 +02:00

test_mcp_param_hint_hardening.py

fix(mcp): sanitize and cap rendered MCP tool param hints (#2682 )

2026-06-05 03:00:22 +02:00

test_mcp_reconnect_args.py

fix: MCP reconnect via tool passes only server_id to connect_server (#1385 )

2026-06-03 03:46:07 +09:00

test_mcp_tool_params_in_prompt.py

fix(mcp): expose MCP tool input parameters to the agent

2026-06-04 12:51:31 +00:00

test_memory_audit_timeout.py

fix(memory): exempt audits from request timeout (#3886 )

2026-06-15 20:27:46 +09:00

test_memory_bullet_extraction.py

Fix memory bullet extraction in service copy

2026-06-03 13:41:46 +09:00

test_memory_extract_chat_nondict.py

fix: chat memory extraction crashes on a non-dict message (#1749 )

2026-06-03 13:25:48 +09:00

test_memory_extraction_parse.py

fix(memory): make auto-memory extraction reliable for reasoning models (#3190 )

2026-06-08 19:57:44 +02:00

test_memory_extractor_rows.py

Skip invalid memory extractor rows (#1535 )

2026-06-03 14:07:00 +09:00

test_memory_extractor_vector_cross_tenant.py

Stub llm_core via monkeypatch.setitem so the cross-tenant test does not leak its fake into later test modules

2026-06-05 00:04:15 +01:00

test_memory_extractor_vector_degraded.py

Update degraded-vector dedup test for owner-scoped vector match

2026-06-04 23:45:13 +01:00

test_memory_fallback_dislike.py

fix(memory): record dislikes as dislikes, not preferences (#2435 )

2026-06-07 16:36:07 +02:00

test_memory_imports.py

refactor(memory): canonicalize memory imports (#50 )

2026-06-04 05:31:15 +01:00

test_memory_owner_isolation.py

test(memory): cover owner isolation for memory search

2026-06-11 22:21:30 +01:00

test_memory_provider.py

feat(memory): add provider interface (#72 )

2026-06-04 16:26:11 +01:00

test_memory_recall_nondict_rows.py

fix: memory recall crashes on a non-dict row from the vector store (#1705 )

2026-06-03 13:35:09 +09:00

test_memory_routes_session_owner.py

fix(memory): validate session owner on manual add (#3807 )

2026-06-11 15:44:10 +02:00

test_memory_validate_entries_nondict.py

fix: memory entry validation crashes on a non-dict row from memory.json (#1691 )

2026-06-03 13:38:02 +09:00

test_merge_last_assistant_rows.py

fix: merge-last-assistant deletes tool/system rows from the DB (history desync) (#1929 )

2026-06-04 19:47:08 +02:00

test_migrate_faiss_to_chroma.py

Skip invalid FAISS migration JSON (#1547 )

2026-06-03 14:11:49 +09:00

test_modal_dock_composer_clearance.py

fix(ui): keep minimized windows above composer (#1197 )

2026-06-02 23:31:09 +09:00

test_model_context.py

fix(agent): don't let a materialized default budget defeat context-window scaling (#4122 )

2026-06-15 15:17:28 +09:00

test_model_discovery_status.py

Reject invalid Tailscale discovery JSON (#1556 )

2026-06-03 14:11:31 +09:00

test_model_helper_owner_scope.py

Scope model helper endpoint resolution (#3007 )

2026-06-07 12:40:23 +02:00

test_model_name_tooltip.py

Add hover tooltips for clipped model names (#1982 ) (#1985 )

2026-06-07 19:23:44 +02:00

test_model_routes.py

fix(models): scope API-token model listing (#4292 )

2026-06-15 16:38:41 +02:00

test_model_sort_js.py

Ignore invalid model sort inputs (#1653 )

2026-06-03 14:16:52 +09:00

test_new_chat_clears_input.py

Clear the composer draft when entering the New Chat / welcome state (#1408 )

2026-06-03 04:07:31 +09:00

test_new_chat_model_preference.py

Chat: prefer active model for new desktop chats

2026-06-02 21:00:50 +09:00

test_nix_upload_text.py

fix: treat Nix files as readable uploads (#2249 )

2026-06-04 12:06:24 +02:00

test_note_reminder_fire_scope.py

Harden note reminder dispatch ownership (#2999 )

2026-06-07 12:52:27 +02:00

test_notes_dom_xss_helpers.py

Guard image and QR DOM attributes (#2500 )

2026-06-04 20:51:23 +02:00

test_notes_fail_closed_auth.py

fix(notes): fail closed when an unauthenticated request reaches owner-scoped routes (#4062 )

2026-06-15 17:43:28 +02:00

test_notes_search_reset_on_reopen_js.py

fix(notes): reset search filter on panel reopen so stale query doesn't hide notes (#2920 )

2026-06-15 11:55:46 +02:00

test_notes_select_esc_listener_js.py

fix(notes): track + remove the select-mode Esc keydown listener so it doesn't leak per open (#2792 )

2026-06-05 16:25:05 +02:00

test_notes_update_due_date.py

Notes: parse natural-language due dates on update

2026-06-02 20:51:16 +09:00

test_null_owner_gates.py

Harden CalDAV write-back with retries (#1193 )

2026-06-15 15:59:31 +09:00

test_odysseus_dispatcher.py

refactor(tests): reuse CLI loader in more tests (#2571 )

2026-06-05 02:42:10 +01:00

test_og_image_extraction.py

fix: source thumbnails dropped for http-only og:image URLs (#667 )

2026-06-02 11:41:33 +09:00

test_ollama_port_detection.py

Add Ollama port path detection regressions (#883 )

2026-06-02 12:24:18 +09:00

test_ordinal_suffix_js.py

fix: monthly schedule label shows 21th/22th/31th (ordinal suffix for days >20) (#1577 )

2026-06-03 08:57:47 +09:00

test_owned_document_query.py

refactor(tools): extract document tools to handle registry (#3666 )

2026-06-10 10:41:52 +02:00

test_parse_due_time_first.py

fix(notes): handle time-first due_date phrases in parse_due_for_user (#3319 )

2026-06-07 19:15:38 +02:00

test_pdf_runtime.py

…

test_personal_delete_file_confinement.py

fix(personal): resolve upload delete path (#4291 )

2026-06-15 16:38:37 +02:00

test_personal_dir_symlink_escape.py

fix: personal-docs path confinement used abspath, allowing symlink escape (#1728 )

2026-06-03 13:29:57 +09:00

test_personal_docs_exclusions.py

Docs: respect path boundary when clearing exclusions

2026-06-02 20:35:44 +09:00

test_personal_docs_keyword_nondict.py

Skip malformed personal keyword index rows

2026-06-03 13:42:05 +09:00

test_personal_docs_lists.py

Save only string personal doc paths (#1566 )

2026-06-03 08:37:29 +09:00

test_personal_docs_office_index.py

Add optional markitdown extraction for Office/EPUB documents (#766 )

2026-06-02 11:28:52 +09:00

test_personal_docs_pdf_index.py

…

test_personal_docs_state_store.py

Ignore invalid personal docs state (#1401 )

2026-06-03 04:02:16 +09:00

test_personal_remove_dir_confinement.py

fix(personal): confine remove_directory_from_rag to PERSONAL_DIR

2026-06-15 15:00:35 +09:00

test_personal_upload_isolation.py

…

test_personal_upload_privilege.py

fix(personal): require document privilege for rag upload (#2990 )

2026-06-07 12:56:53 +02:00

test_plan_mode.py

feat: Add plan mode to the chat agent (#638 )

2026-06-05 16:32:25 +02:00

test_platform_compat.py

fix(platform): read proc version with utf-8

2026-06-11 21:58:22 +01:00

test_popup_opener_isolation_js.py

Isolate HTML popup openers (#2501 )

2026-06-04 20:52:41 +02:00

test_pr_blocker_audit.py

tools: add read-only PR blocker audit helper

2026-06-04 12:51:48 +01:00

test_prefs_atomic_write.py

Persist user prefs atomically (#1840 )

2026-06-04 03:55:22 +01:00

test_prefs_routes.py

Ignore non-object prefs JSON (#1257 )

2026-06-03 14:12:45 +09:00

test_prefs_single_user_no_clobber.py

fix: disabling auth wipes all users' preferences on next pref save (#1764 )

2026-06-03 13:23:50 +09:00

test_preset_atomic_save.py

fix(presets): persist presets atomically to avoid corruption on crash (#2169 )

2026-06-08 19:16:37 +02:00

test_preset_expand_owner_scope.py

fix(presets): scope expand-prompt model resolution to owner (#3477 )

2026-06-08 21:12:02 +02:00

test_preset_fill_missing_defaults.py

Presets: fill missing built-in defaults on load

2026-06-02 20:32:08 +09:00

test_preset_local_storage_js.py

Keep presets loading with bad local state (#1417 )

2026-06-03 04:09:28 +09:00

test_preset_store_shape.py

Fall back from invalid preset stores (#1402 )

2026-06-03 14:12:31 +09:00

test_promote_image_fields.py

fix(images): render agent-generated images in chat (#2809 )

2026-06-05 13:04:33 +02:00

test_prompt_security.py

fix(security): harden untrusted_context_message against delimiter spoofing (#3086 )

2026-06-07 22:15:50 +01:00

test_provider_classification.py

feat(providers): add NVIDIA AI provider endpoint support (#3456 )

2026-06-09 11:06:12 +02:00

test_provider_detection.py

test(provider): align lookalike-host URL expectations with /models behavior

2026-06-15 23:21:49 +09:00

test_provider_device_flow_js.py

feat: add ChatGPT Subscription provider (#2876 )

2026-06-08 10:19:18 +02:00

test_provider_endpoints.py

fix(models): probe /v1/models for path-less LM Studio endpoints

2026-06-15 15:09:50 +09:00

test_providers_mixtral_logo_js.py

fix: Mixtral and Ministral models render with no provider logo (#1640 )

2026-06-03 14:23:21 +09:00

test_public_blocked_tool_nonstring.py

fix: is_public_blocked_tool crashes on a truthy non-string tool name (#1620 )

2026-06-03 14:11:14 +09:00

test_question_type_detection.py

fix: research query misclassifies 'whatsapp'/'however' as questions (#1247 )

2026-06-03 01:10:06 +09:00

test_rag_keyword_fallback_owner.py

fix: RAG keyword fallback leaked owner-less documents across users (#1722 )

2026-06-03 13:31:33 +09:00

test_rag_manager_owner_compat.py

fix(rag): forward owner through manager wrapper (#2991 )

2026-06-07 12:56:57 +02:00

test_rag_remove_directory_scope.py

Fix RAG remove_directory wiping the entire shared collection (#1660 ) (#1734 )

2026-06-03 13:29:51 +09:00

test_rag_server_directory_nonstring.py

fix: rag_server add/remove_directory crashes on a non-string directory arg (#1614 )

2026-06-03 08:36:45 +09:00

test_rag_vector_id_stability.py

fix(tests): use current python for rag id stability (#1817 )

2026-06-04 03:49:59 +01:00

test_rate_limiter.py

…

test_readiness.py

feat: add /api/ready readiness probe (DB, data dir, local-first) (#1200 )

2026-06-02 23:33:22 +09:00

test_readme_ascii_fenced.py

test: align README presentation guards with the #4306 refresh (#4311 )

2026-06-15 16:25:38 +01:00

test_realesrgan_torchvision_compat.py

fix(image): patch realesrgan torchvision compatibility (#4110 )

2026-06-15 15:16:41 +09:00

test_rename_user_case_insensitive.py

refactor(tests): reuse import-state helper in auth tests

2026-06-05 11:10:41 +01:00

test_rename_user_owner_sync.py

fix(uploads): migrate upload ownership on rename (#3617 )

2026-06-11 16:01:04 +02:00

test_rename_user_token_cache.py

fix: renaming a user leaves their API tokens resolving to the old owner (#1932 )

2026-06-04 20:37:59 +02:00

test_replace_messages_multimodal.py

fix: session context drifting — messages leaking between chats (#135 ) (#267 )

2026-06-09 14:12:52 +01:00

test_reply_all_cc_nonstring_js.py

fix: reply-all Cc builder crashes on a non-string To or Cc field (#1700 )

2026-06-03 13:37:22 +09:00

test_reply_recipients_js.py

fix: reply-all Cc's the user's own other addresses (multi-account) (#672 )

2026-06-02 11:42:20 +09:00

test_research_chat_stream_owner.py

fix: pass owner to start_research in chat stream path (#1265 )

2026-06-03 02:32:38 +09:00

test_research_endpoint_owner_scope.py

feat: add ChatGPT Subscription provider (#2876 )

2026-06-08 10:19:18 +02:00

test_research_handler_analyzed_urls.py

fix(research): track analyzed URLs separately (#3125 )

2026-06-10 12:08:22 +01:00

test_research_handler_path_confinement.py

Constrain research handler JSON paths (#2846 )

2026-06-05 13:20:02 +02:00

test_research_handler_raw_nondict.py

fix: a non-dict finding silently drops all raw research findings (#1739 )

2026-06-03 13:29:29 +09:00

test_research_handler_sources_nondict.py

fix: research source extraction crashes on a non-dict finding (#1714 )

2026-06-03 13:34:40 +09:00

test_research_owner_scope_routes.py

refactor(constants): single source of truth for data dir (#3368 )

2026-06-08 09:58:52 +02:00

test_research_probe_errors.py

Surface deep research probe errors (#1086 )

2026-06-02 22:51:25 +09:00

test_research_query_fallback.py

Deep research: don't treat a bare 'yes' as the research topic (#858 )

2026-06-02 11:30:53 +09:00

test_research_report_read.py

Route "read that report" to manage_research instead of the HTML render (#1375 )

2026-06-03 03:24:09 +09:00

test_research_service.py

Skip invalid research service sources (#1583 )

2026-06-03 08:57:09 +09:00

test_research_session_id_validation.py

fix(research): validate session_id to block path traversal

2026-06-01 23:25:38 +01:00

test_research_source_link_xss.py

Whitelist research source links (#2499 )

2026-06-04 20:41:35 +02:00

test_research_status_avg_duration.py

fix(research): stop rescanning the research dir on every status poll (#3637 )

2026-06-10 17:40:44 +02:00

test_research_utils_low_quality_nonstring.py

Treat non-string research summaries as low quality

2026-06-03 13:42:24 +09:00

test_research_utils.py

…

test_resend_message_nondestructive.py

fix(chat): make resend message non-destructive

2026-06-15 15:02:48 +09:00

test_reserved_username_admin_escalation.py

fix(auth): drop reserved usernames loaded from auth config (#3727 )

2026-06-10 16:31:26 +02:00

test_resolve_endpoint_fallbacks.py

Add resolve_endpoint fallback chain regressions (#890 )

2026-06-02 12:24:50 +09:00

test_resolve_session_auth_chatgpt.py

feat: add ChatGPT Subscription provider (#2876 )

2026-06-08 10:19:18 +02:00

test_resolve_upload_path_nondict.py

fix: _resolve_user_upload_path crashes on a non-dict resolve_upload result (#1715 )

2026-06-03 13:34:33 +09:00

test_review_regressions.py

fix(agent): honor auth-disabled tool access after setup

2026-06-15 15:01:48 +09:00

test_rewrite_persist_column.py

fix(tests): add endpoint URLs to remaining session fixtures

2026-06-04 03:14:43 +01:00

test_route_validators.py

fix(hwfit): validate remote SSH detection targets (#3718 )

2026-06-11 00:43:49 +02:00

test_run_focus.py

test: mark first slow tests from duration evidence (#3711 )

2026-06-10 01:07:38 +02:00

test_run_order_report.py

test: add report-only order-sensitivity runner (#3982 )

2026-06-15 15:49:47 +09:00

test_sanitize_multimodal_merge.py

fix: merging consecutive user messages corrupts multimodal (image) content (#1277 )

2026-06-03 01:21:57 +09:00

test_sanitize_preserves_reasoning.py

fix: preserve reasoning_content in sanitized messages for Moonshot/Kimi (#3152 )

2026-06-09 21:44:38 +01:00

test_schedule_email_offset_normalization.py

Normalize scheduled email offsets before storage

2026-06-03 13:44:18 +09:00

test_scheduler_restart_doublefire.py

Replace task scheduler utcnow calls (#1456 )

2026-06-03 14:14:30 +09:00

test_scheduler_scheduled_time_validation.py

fix(scheduler): fail closed on malformed scheduled_time instead of 500 (#1410 )

2026-06-03 14:12:07 +09:00

test_search_analytics_defaults.py

refactor(search): make src analytics a service shim (#2264 )

2026-06-04 18:57:24 +02:00

test_search_cache_invalidation.py

Fix invalidate_search_cache using a key that never matches stored entries (#852 )

2026-06-02 10:53:33 +09:00

test_search_config_no_key_leak.py

Stop GET /api/search/config from leaking the Brave API key (#1661 ) (#1750 )

2026-06-03 13:24:17 +09:00

test_search_config_provider_key.py

Report provider-specific search API keys correctly (#1202 )

2026-06-02 23:37:15 +09:00

test_search_content_block_source_index.py

fix: web search content blocks numbered by fetch completion order break citations (#1672 )

2026-06-03 14:22:55 +09:00

test_search_content_extraction_parity.py

fix(search): catch HTTPStatusError so 403/404 URLs degrade gracefully instead of 500 (#2203 )

2026-06-08 01:09:21 +01:00

test_search_content_url_guards.py

chore: deduplicate src/search modules (cache, content, query) into shims (#2506 )

2026-06-04 18:10:55 +02:00

test_search_module_consolidation.py

chore: deduplicate src/search modules (cache, content, query) into shims (#2506 )

2026-06-04 18:10:55 +02:00

test_search_provider_json.py

fix(search): degrade to empty results on non-JSON provider responses (#1129 ) (#1352 )

2026-06-03 14:24:23 +09:00

test_search_query_entities_nonstring.py

fix: _extract_entities crashes on a non-string query (#1724 )

2026-06-03 13:30:28 +09:00

test_search_query_nonstring.py

fix: search query helpers crash on a non-string query (#1604 )

2026-06-03 08:36:01 +09:00

test_search_query.py

…

test_search_ranking_recency.py

Reapply "Merge branch 'main' of github.com:pewdiepie-archdaemon/odysseus"

2026-06-03 22:47:00 +09:00

test_search_ranking_sports_substring.py

fix: sports-hint ranking penalty fires on 'transport'/'passport' substrings (#1473 )

2026-06-03 14:23:52 +09:00

test_search_ranking_subject_substring.py

Word-boundary match for snippet and subject-term ranking (#1473 follow-up) (#2556 )

2026-06-05 08:04:31 +01:00

test_search_ranking.py

Reapply "Merge branch 'main' of github.com:pewdiepie-archdaemon/odysseus"

2026-06-03 22:47:00 +09:00

test_search_service_nondict_rows.py

fix(tests): update search service mock to match current API signature (#2334 )

2026-06-04 14:19:51 +01:00

test_searchservice_search_call.py

fix: SearchService.search() calls comprehensive_web_search incorrectly (broken public API) (#1720 )

2026-06-03 13:33:56 +09:00

test_searxng_image_pinned.py

Pin the SearXNG image so a broken :latest can't block startup (#1419 )

2026-06-03 03:56:54 +09:00

test_security_headers_middleware.py

fix(security): add HSTS and Permissions-Policy to SecurityHeadersMiddleware (#3081 )

2026-06-07 04:58:33 +01:00

test_security_headers_pdf_preview.py

fix(documents): restore PDF library metadata and preview (#2483 )

2026-06-07 23:23:27 +02:00

test_security_regressions.py

test: align README presentation guards with the #4306 refresh (#4311 )

2026-06-15 16:25:38 +01:00

test_select_dropdown_theme_css.py

Normalize native select option theming (#1178 )

2026-06-02 23:09:15 +09:00

test_sender_signature_skip_roles.py

fix: signature learning never skips support@/info@/admin@ senders (#1773 )

2026-06-03 13:22:52 +09:00

test_serve_profiles.py

fix(hwfit): serve profiles for sub-8192 context models

2026-06-15 15:02:22 +09:00

test_service_health.py

feat(diagnostics): add consolidated service health endpoint for degraded-state reporting (#964 )

2026-06-09 16:00:24 +01:00

test_service_search_provider_guards.py

chore: Switch duckduckgo-search to ddgs (#3143 )

2026-06-10 17:59:47 +02:00

test_services_research_low_quality_sources.py

fix: services research lists junk no-content pages as cited sources (#1669 )

2026-06-03 14:22:58 +09:00

test_services_search_analytics_defaults.py

Merge search analytics defaults in services copy

2026-06-03 13:45:07 +09:00

test_session_actions_cleanup.py

fix(sessions): keep fresh chats during auto tidy (#1871 )

2026-06-09 01:06:20 +01:00

test_session_concurrent.py

fix: session context drifting — messages leaking between chats (#135 ) (#267 )

2026-06-09 14:12:52 +01:00

test_session_context_excludes_slash.py

fix: exclude slash-command/setup messages from LLM context (#2634 ) (#2640 )

2026-06-04 21:42:23 +02:00

test_session_endpoint_owner_scope.py

Harden session endpoint owner scope (#1308 )

2026-06-03 02:40:22 +09:00

test_session_export_filename.py

fix: _sanitize_export_filename crashes on a non-string session name (#1607 )

2026-06-03 08:35:47 +09:00

test_session_export_nonstring_content.py

Fix session export 500 on multimodal/None message content (#1984 )

2026-06-04 12:53:44 +01:00

test_session_ghost_delete.py

allow user who disable auth to use chat (#2548 )

2026-06-05 22:54:19 +02:00

test_session_list_owner_scope.py

fix(sessions): scope enrichment queries by owner, add LIMIT to auto_sort (#3350 )

2026-06-07 21:32:21 +02:00

test_session_manager_cleanup.py

Fix session cleanup cutoff timezone (#2488 )

2026-06-05 09:52:34 +02:00

test_session_manager_persist_guard.py

Guard session message persistence after delete (#1451 )

2026-06-03 14:24:01 +09:00

test_session_manager.py

fix: session context drifting — messages leaking between chats (#135 ) (#267 )

2026-06-09 14:12:52 +01:00

test_session_mode_helpers.py

…

test_session_owner_attribution.py

feat: add ChatGPT Subscription provider (#2876 )

2026-06-08 10:19:18 +02:00

test_session_search_batch_fetch.py

fix(search): batch FTS hit lookups into one query (N+1) (#3909 )

2026-06-11 16:31:54 +02:00

test_session_search.py

feat(search): unify session transcript search (#2877 )

2026-06-05 18:08:31 -06:00

test_set_admin.py

feat(auth): add per-user admin promote/demote toggle (#3078 )

2026-06-15 10:44:27 +00:00

test_settings_error_paths.py

fix(settings): catch PermissionError in load_settings + error-path tests (#1570 )

2026-06-03 14:23:27 +09:00

test_settings_scrub.py

fix(settings): scrub camelCase secret keys (#3707 )

2026-06-11 12:53:33 +02:00

test_settings_store_shape.py

Fall back from invalid settings stores (#1416 )

2026-06-03 03:53:05 +09:00

test_setup_admin_user.py

refactor(constants): single source of truth for data dir (#3368 )

2026-06-08 09:58:52 +02:00

test_setup_device_auth_static.py

feat: add ChatGPT Subscription provider (#2876 )

2026-06-08 10:19:18 +02:00

test_shell_routes.py

fix(image): patch realesrgan torchvision compatibility (#4110 )

2026-06-15 15:16:41 +09:00

test_shell_service.py

fix: use running loop for shell stream deadlines (#1694 )

2026-06-03 13:37:46 +09:00

test_signature_fold_js.py

Ignore non-string signature fold metadata (#1655 )

2026-06-03 14:16:48 +09:00

test_signature_fold_self_closing_br_js.py

fix: signature delimiter fold misses self-closing <br/> breaks (#1774 )

2026-06-03 13:22:46 +09:00

test_signature_route_hardening.py

Constrain signature uploads to PNG data (#2844 )

2026-06-05 13:17:43 +02:00

test_signature_settings_dom_xss.py

Constrain signature uploads to PNG data (#2844 )

2026-06-05 13:17:43 +02:00

test_skill_edit_no_collapse_on_outside_click_js.py

fix(skills): keep edit mode open on outside-the-textarea click (#4011 )

2026-06-15 20:31:11 +09:00

test_skill_extractor_json.py

fix(memory): reject ambiguous multi-object outputs during skill extraction (#3985 )

2026-06-15 10:44:43 +00:00

test_skill_extractor_rows.py

Skip invalid skill extractor rows (#1546 )

2026-06-03 14:06:53 +09:00

test_skill_extractor_stray_brace.py

fix(memory): reject ambiguous multi-object outputs during skill extraction (#3985 )

2026-06-15 10:44:43 +00:00

test_skill_importer.py

feat(skills): import SKILL.md bundles from public GitHub URLs (#2576 )

2026-06-05 19:48:23 +02:00

test_skill_index_prompt_injection.py

fix(agent): scope skill index to owner (#2404 )

2026-06-09 09:51:29 +02:00

test_skill_index_toolset_gating.py

fix(agent): skill-prescribed tools never reach the model's schema list (#4008 )

2026-06-15 20:32:43 +09:00

test_skill_save_no_rename.py

fix(skills): markdown save must not rename the skill, so delete keeps working (#1333 ) (#1365 )

2026-06-03 03:16:11 +09:00

test_skills_delete_owner.py

Skills: delete owner-scoped skills with owner

2026-06-02 20:28:36 +09:00

test_skills_manager_owner_isolation.py

Scope skills usage by owner (#1312 )

2026-06-03 02:27:43 +09:00

test_skills_routes_nondict.py

fix: skill test-task / precision helpers crash on a non-dict skill (#1638 )

2026-06-03 08:59:24 +09:00

test_skills_routes_owner_update.py

Fix owner-scoped skill updates (#1240 )

2026-06-03 00:42:56 +09:00

test_skills_tag_token_match.py

fix: skill retrieval boosts on tag substrings (e.g. 'ai' tag for any 'email' query) (#1406 )

2026-06-03 14:24:11 +09:00

test_slash_autocomplete_static.py

feat: add ChatGPT Subscription provider (#2876 )

2026-06-08 10:19:18 +02:00

test_snap_other_layers_nonarray_js.py

fix: computeSnap throws when ctx.otherLayers is not an array (#1716 )

2026-06-03 13:34:25 +09:00

test_speech_service_toggles.py

Honor disabled speech service toggles (#814 )

2026-06-02 10:44:39 +09:00

test_split_chunks_no_duplicate_tail.py

fix(tests): use non-repeating split chunk fixture

2026-06-04 18:11:42 +01:00

test_sqlite_foreign_keys.py

refactor(tests): centralize fake database import-state cleanup

2026-06-05 12:27:44 +01:00

test_src_search_query_nonstring.py

chore: deduplicate src/search modules (cache, content, query) into shims (#2506 )

2026-06-04 18:10:55 +02:00

test_streaming_segmenter_js.py

fix(chat): stop code-block button flicker during streaming (#3023 )

2026-06-06 04:08:54 -06:00

test_strip_reasoning_prose_dataloss.py

fix: _strip_reasoning_prose discards the answer when reasoning trails it (#1643 )

2026-06-03 14:23:15 +09:00

test_strip_think.py

fix: normalize Gemma 4 thought-channel output (#2224 )

2026-06-04 19:26:58 +02:00

test_stt_leak.py

STT: clean temp audio files on transcription failure

2026-06-02 20:43:24 +09:00

test_task_chain_owner_scope.py

Enforce task chain owner scope (#3006 )

2026-06-07 12:43:43 +02:00

test_task_scheduler_cancel.py

Tasks: clean up queued cancellation state

2026-06-02 20:51:21 +09:00

test_task_scheduler_session_delivery.py

fix: session context drifting — messages leaking between chats (#135 ) (#267 )

2026-06-09 14:12:52 +01:00

test_task_session_folder.py

feat(tasks): assign folder='Tasks' at creation + backfill migration (#2834 )

2026-06-07 15:33:17 +02:00

test_taxonomy.py

test(taxonomy): auto-mark tests by area and sub-area (#3491 )

2026-06-09 01:13:28 +02:00

test_teacher_audit_owner_scope.py

fix(presets): scope expand-prompt model resolution to owner (#3477 )

2026-06-08 21:12:02 +02:00

test_teacher_eval_nonstring_reply.py

fix: evaluate_turn_regex crashes on a non-string agent_reply (#1723 )

2026-06-03 13:31:26 +09:00

test_tile_manager_snap_zones_js.py

fix(ui): restore all-edge modal snap zones (#2260 )

2026-06-15 12:36:34 +02:00

test_tls_overrides_scope.py

Support extra CA bundle for private-CA LLM providers (#769 )

2026-06-04 13:18:50 +01:00

test_tool_index_keyword_boundaries.py

fix(tests): restore Python CI baseline regressions

2026-06-05 10:31:38 +01:00

test_tool_parsing_nonstring.py

fix: tool-block parsing crashes on a non-string input (#1628 )

2026-06-03 08:59:42 +09:00

test_tool_path_confinement.py

fix(tools): strict path confinement with sensitive-subpath deny list (#1072 )

2026-06-02 23:13:30 +09:00

test_tool_policy.py

refactor(tools): remove dead workspace-confinement plumbing (#3590 )

2026-06-09 08:30:50 +02:00

test_tool_rag_contacts_domain.py

fix(agent): add contacts domain to tool classifier

2026-06-15 15:03:19 +09:00

test_tool_rag_keyword_hints.py

fix(agent): honor explicit web search requests

2026-06-15 15:02:10 +09:00

test_tool_support_heuristic.py

fix(agent): keep gpt-oss on text tool mode

2026-06-15 15:11:52 +09:00

test_tool_utils_import_clean.py

refactor(tools): consolidate duplicated _truncate and get_mcp_manager into src/tool_utils (#3478 )

2026-06-09 01:05:30 +02:00

test_topic_analyzer.py

refactor(tests): centralize fake database import-state cleanup

2026-06-05 12:27:44 +01:00

test_totp_failclosed.py

fix: 2FA bypassed when enabled but TOTP secret is missing (fail-open) (#1286 )

2026-06-03 01:26:47 +09:00

test_truncate_message_count_regression.py

fix: session context drifting — messages leaking between chats (#135 ) (#267 )

2026-06-09 14:12:52 +01:00

test_tts_cache_stats.py

TTS: include mp3 files in cache stats

2026-06-02 20:43:29 +09:00

test_tts_speed_malformed.py

fix(tts): tolerate a malformed tts_speed instead of 500-ing (#1450 )

2026-06-03 14:12:03 +09:00

test_ui_control_rag_toggle.py

fix: ui_control rejects the advertised rag toggle (#1763 )

2026-06-03 13:24:00 +09:00

test_unknown_tool_calls.py

test(tool_execution): stop two tests leaking src.tool_execution into the suite (#2686 )

2026-06-09 16:35:10 +01:00

test_update_database_script.py

Remove duplicate update database body (#1584 )

2026-06-03 08:57:03 +09:00

test_update_plan_tool.py

feat: Add plan mode to the chat agent (#638 )

2026-06-05 16:32:25 +02:00

test_upload_error_surfaced.py

Surface upload failures instead of silently dropping the files (#1425 )

2026-06-03 04:12:23 +09:00

test_upload_handler_atomicity.py

Ignore stale duplicate upload rows (#1256 )

2026-06-03 00:59:01 +09:00

test_upload_handler_rename_owner.py

fix(uploads): migrate upload ownership on rename (#3617 )

2026-06-11 16:01:04 +02:00

test_upload_id_extension.py

fix: uploads with _ or - in the extension become permanently unreadable (#1756 )

2026-06-03 13:28:45 +09:00

test_upload_id_validation.py

fix: uploaded files with no extension become permanently unresolvable (#1275 )

2026-06-03 01:16:30 +09:00

test_upload_limits_centralized.py

refactor(uploads): centralize upload byte-limits in upload_limits.py (#3364 ) (#3518 )

2026-06-09 01:24:30 +02:00

test_upload_multifile.py

Fix multi-file uploads tripping the per-IP concurrency guard (#1346 ) (#1362 )

2026-06-03 04:04:19 +09:00

test_upload_routes_owner_scope.py

Constrain upload paths to upload root (#2825 )

2026-06-05 13:15:23 +02:00

test_url_safety.py

fix: SSRF hardening for the custom embedding endpoint URL (#132 ) (#1206 )

2026-06-02 23:46:33 +09:00

test_user_time.py

fix(chat): stabilize system prompt, sequence memory extraction, and send stable session id to preserve KV cache (#3360 )

2026-06-09 22:46:54 +01:00

test_vault_password_not_in_argv.py

Keep Bitwarden unlock password off argv (#1311 )

2026-06-03 02:13:51 +09:00

test_venice_hosts.py

Treat Venice as a tool-capable SOTA cloud provider (#1173 )

2026-06-02 23:03:46 +09:00

test_vision_model_detection.py

Recognize gemma3/llama4/mistral-small3.1+/multimodal as vision models (#1430 )

2026-06-03 04:17:40 +09:00

test_vision_owner_scope.py

Scope vision model resolution by owner (#3009 )

2026-06-07 12:39:02 +02:00

test_visual_report_icon_url.py

fix: visual report drops photos whose URL slug contains icon or logo (#1685 )

2026-06-03 14:22:45 +09:00

test_visual_report_nonstring.py

fix: visual_report markdown helpers crash on a non-string input (#1633 )

2026-06-03 14:06:35 +09:00

test_visual_report.py

…

test_warmup_ping_urls.py

fix(startup): ping real endpoints in warmup/keepalive (#3641 )

2026-06-10 19:21:45 +02:00

test_web_fetch_plaintext.py

fix(search): read plain-text, Markdown, and JSON URLs in fetch_webpage_content (#3809 )

2026-06-11 14:24:53 +00:00

test_web_search_raw_json_tool_call.py

fix(agent): parse raw json web search calls (#4088 )

2026-06-15 15:19:38 +09:00

test_web_search_time_filter.py

fix(tool-schemas): preserve web_search time_filter through native tool-call conversion (#2757 )

2026-06-05 08:00:59 +01:00

test_web_search_tool_icon_js.py

fix(ui): raw SVG markup displayed instead of search icon for web_search tool label (#3601 )

2026-06-10 16:50:43 +02:00

test_webhook_sanitize_error_ipv6.py

fix(webhooks): redact IPv6 addresses in sanitized error messages (#3038 )

2026-06-07 04:55:33 +01:00

test_webhook_ssrf_resilience.py

fix(tests): make webhook SSRF test clean-worktree deterministic

2026-06-05 08:16:28 +01:00

test_webhook_task_refs.py

fix(tests): isolate webhook task reference imports

2026-06-15 14:57:47 +09:00

test_webhook_trigger_auth_exempt.py

Exempt task webhook trigger from session auth (#784 )

2026-06-02 11:23:40 +09:00

test_windows_update_script.py

Windows: add Docker update script

2026-06-02 20:45:32 +09:00

test_workspace_confine.py

feat(agent): confine agent file/shell tools to a selectable workspace (#3665 )

2026-06-11 18:17:54 +02:00

test_youtube_comments_timeout.py

YouTube: enforce comment fetch timeout while waiting

2026-06-02 20:44:24 +09:00

test_youtube_extract_id_nonstring.py

fix: extract_youtube_id crashes on a non-string url instead of returning None (#1689 )

2026-06-03 13:38:11 +09:00

test_youtube_handler_consolidation.py

fix(youtube): consolidate duplicate handler

2026-06-15 15:03:41 +09:00

test_youtube_svc_comments_nondict.py

fix: youtube (services) comment formatter crashes on a non-dict comment (#1746 )

2026-06-03 13:29:01 +09:00

test_youtube_transcript_seg_nondict.py

fix: youtube transcript formatter crashes on a non-dict segment (#1745 )

2026-06-03 13:29:08 +09:00

TESTING_STANDARD.md

test: move area_cli tests into cli directory (#3842 )

2026-06-11 17:01:14 +00:00

README.md

Test Suite Notes

Purpose

This file documents the shared test helpers and the review expectations that go with them. The suite is being refactored incrementally, so this is a working reference for that effort - not a claim that the suite is already fully organized. Read it before adding a new helper or before reviewing a PR that touches tests/helpers/.

For the broader rules - test taxonomy, determinism/isolation rules, the behavioral-vs-source-text policy, and helper/factory extraction rules - see TESTING_STANDARD.md. This file is the concrete helper reference; that file is the standard the refactor works toward.

Running focused subsets (taxonomy markers)

tests/conftest.py tags every test at collection time with two markers derived from its filename by tests/_taxonomy.py: an area_* marker (e.g. area_security) and a finer sub_* marker (e.g. sub_owner_scope). This adds markers only - it moves no files and changes no test behavior. Use them to run a focused slice:

python3 -m pytest -m area_security
python3 -m pytest -m "area_services and sub_cookbook"

Areas are security, routes, services, cli, js, helpers, unit, and uncategorized. Classification is conservative and token-based: a file that matches no area keyword falls back to area_uncategorized with its filename as the sub-area. The area_* names are registered in pyproject.toml; the dynamic sub_* names are registered before collection by pytest_configure in tests/conftest.py, so unknown-mark warnings still flag genuine typos.

For common focused runs, use tests/run_focus.py. It validates area and sub-area names, accepts sub-areas with or without the sub_ prefix, and passes extra pytest arguments after --:

python3 tests/run_focus.py --area security
python3 tests/run_focus.py --area services --sub-area cookbook
python3 tests/run_focus.py --sub-area sub_cookbook
python3 tests/run_focus.py --keyword taxonomy
python3 tests/run_focus.py --last-failed
python3 tests/run_focus.py --dry-run --area services --sub-area cookbook
python3 tests/run_focus.py --area services -- --maxfail=1 -q

Fast lane and duration visibility

--fast runs the fast lane: the tests that are not marked slow (it adds the marker expression not slow). It composes with --area/--sub-area using and. Because no tests may be marked slow yet, --fast can initially match the full focused selection; it becomes a real speed-up as slow marks are added from duration evidence. Use it for quick local or reviewer feedback; it does not replace broader focused or full-suite validation before merge.

--durations N and --durations-min FLOAT add pytest's slowest-test reporting so you can see where time goes. They are reporting only and do not count as a focus selector, so --durations must be combined with a real selector (--area, --sub-area, --keyword, --last-failed, or --fast).

Activate or otherwise use the project Python environment before running these commands. The examples use python3 intentionally to avoid hard-coding a local venv path.

python3 tests/run_focus.py --fast
python3 tests/run_focus.py --area services --fast
python3 tests/run_focus.py --area services --durations 25
python3 tests/run_focus.py --area services --fast --durations 25 --durations-min 0.05

The slow marker is opt-in. Mark a test slow only with duration evidence (from --durations), not by guessing - see the fast-lane policy in TESTING_STANDARD.md. --fast is for quick reviewer feedback and must not replace the full suite before merge. A slow mark only excludes a test from the fast lane; the test stays runnable directly, e.g.:

python3 -m pytest tests/test_auth_config_lock_concurrency.py
python3 -m pytest -m slow

Order-sensitivity reporting (report-only)

tests/run_order_report.py runs pytest with the collected test items shuffled by a seeded RNG, to surface order-sensitive tests (hidden coupling through shared import state, module caches, databases, etc.). It is report-only: it is not wired into CI, adds no gate, and changes no normal pytest collection or ordering - the shuffle exists only inside this runner. The seed is always printed, and pytest targets/options go after a literal --:

python3 tests/run_order_report.py --seed 123 -- tests/cli/ -q
python3 tests/run_order_report.py -- tests/cli/ -q   # generates and prints a seed

The same seed reproduces the same order when the reported working directory, pytest target arguments, and test environment are also the same. The runner prints all command arguments with shell-safe POSIX quoting and uses the invoking Python interpreter.

A generated-seed run starts with output like:

[order-report] working directory: /path/to/odysseus
[order-report] shuffling test order with seed 284734921
[order-report] reproduce from this working directory with the same test environment:
[order-report] reproduce with: /path/to/odysseus/.venv/bin/python /path/to/odysseus/tests/run_order_report.py --seed 284734921 -- tests/cli/ -q

Run the printed command from the reported working directory to reproduce the same fixed-seed order:

[order-report] working directory: /path/to/odysseus
[order-report] shuffling test order with seed 284734921
[order-report] reproduce from this working directory with the same test environment:
[order-report] reproduce with: /path/to/odysseus/.venv/bin/python /path/to/odysseus/tests/run_order_report.py --seed 284734921 -- tests/cli/ -q

Pytest output remains visible between the report header and footer. A failing run ends with pytest's normal failure report followed by:

FAILED tests/example_test.py::test_example - AssertionError
[order-report] seed 284734921: pytest exit code 1 (report-only; fix order-sensitive failures in separate scoped PRs)

Failures discovered this way are real isolation bugs: fix them in separate scoped PRs - do not silence them with skip/xfail, and do not "fix" them by depending on a particular order.

The runner propagates pytest's exit code, so it composes with normal local workflows; "report-only" means it is not a CI gate, not that failures are swallowed.

Core principles

Keep PRs small and homogeneous: one kind of change per PR.
Prefer explicit local setup over hidden global fixtures.
Avoid expanding the root conftest.py unless absolutely necessary.
Do not mix file moves with logic changes in the same PR.
Do not weaken tests with skip/xfail just to make CI pass.
Validate the focused files you changed, plus any neighboring or order-sensitive groups they interact with.

Helper conventions

The helpers below live under tests/helpers/. They exist to remove repeated boilerplate that already appeared across multiple tests. Reach for one only when your test matches its intended use; do not stretch a helper to cover a new case.

`tests.helpers.cli_loader.load_script`

Use when a test needs to import a script under scripts/ without repeating SourceFileLoader / importlib.util boilerplate.

Intended for script/CLI tests that load a single file from scripts/.
Not for arbitrary package imports - use a normal import for those.
When migrating an existing test to it, keep the existing stubs and assertions unchanged. Any sys.modules stubs the script needs at import time must still be injected (e.g. via monkeypatch) before calling load_script.

`tests.helpers.import_state.clear_module`

Use when a test must drop one cached module and its parent-package attribute before a fresh import.

Clears sys.modules[name].
Clears the parent-package attribute when present.
Good replacement for local sys.modules.pop(...) + delattr(parent, child) blocks.

`tests.helpers.import_state.preserve_import_state`

Use when a test temporarily installs stubs into sys.modules and needs deterministic cleanup afterward.

Context manager: restores both sys.modules entries and parent-package attributes on exit (normal or exception).
Useful around module-level stubs or temporary imports.
Prefer narrow, explicit module names over broad ones.

`tests.helpers.import_state.clear_fake_database_modules`

Use only for the guarded fake/stub database cleanup pattern.

Preserves a real-looking core.database (one with a string __file__).
Removes a fake/stub core.database and the related src.database state.
Do not use as a general database reset fixture.

`tests.helpers.import_state.clear_fake_endpoint_resolver_modules`

Use only for the guarded fake/stub src.endpoint_resolver cleanup pattern.

Preserves real resolver modules (those with a truthy __file__).
Evicts fake/stub resolver modules and the dependent route modules that were cached against them.
Accepts explicit extra dependent module names to evict alongside the defaults.

`tests.helpers.sqlite_db.make_temp_sqlite`

Use for the repeated file-backed temp sqlite setup in tests.

Only constructs (SessionLocal, engine, tmpfile) from the repeated block.
Does not patch modules and does not clean up the temp file.
The caller must bind SessionLocal explicitly onto whatever module the code under test reads, and must keep the returned objects alive.
Do not use it as a general DB fixture framework.

`tests.helpers.db_stubs.make_core_db_stub`

Use for small import-time core.database stubs with a placeholder SessionLocal.

Pass model names via models when MagicMock attributes are sufficient.
Pass attributes when an import needs exact placeholder values.
Set install_core_package=True only when the test also needs a fake parent core module stub.
Keep custom fake sessions and route-specific database behavior local.

What not to abstract yet

Some remaining patterns should stay as-is for now rather than being forced into helpers:

Large mixed files such as security/review regression files.
Broad setup-oriented sys.modules stub installers.
One-off custom module patching.
Custom DB session, route, and app setup.

Validation expectations

Run validation locally before opening or approving a PR. Practical checks:

git diff --check - catch whitespace and conflict-marker errors.
python3 -m py_compile <changed files> - confirm changed files compile.
Focused pytest on the changed test files.
pytest on neighboring or order-sensitive test groups that share import state with the changed files.
grep for the old boilerplate when replacing it, to confirm no stragglers remain.
A fresh audit worktree when changing the helpers themselves, so stale __pycache__ or import state cannot mask a regression.

Current roadmap

Import-state cleanup - complete.
Document helper conventions (this file).
Pilot the repeated import-time core.database stub helper.
Add further tiny helpers only when the repeated semantics are clear.
Start low-risk file moves only after helper conventions are documented.
Avoid moving high-risk security/route regression files first.