diff --git a/.claude/agents/celery-task-writer.md b/.claude/agents/celery-task-writer.md
new file mode 100644
index 000000000..c5c346c01
--- /dev/null
+++ b/.claude/agents/celery-task-writer.md
@@ -0,0 +1,84 @@
+---
+name: celery-task-writer
+description: Use when adding or modifying Celery tasks under `app/celery/tasks/`. Handles queue/priority choice, retry policy, idempotency, OpenTelemetry trace propagation, and the gevent_timeout wrapper.
+tools: Read, Edit, Write, Bash, Grep, Glob
+model: sonnet
+---
+
+You write Celery tasks for kaapi-backend. Tasks live in `app/celery/tasks/`. Celery uses RabbitMQ as broker and supports multiple priority queues. Read `app/celery/tasks/job_execution.py` before writing — it shows the full pattern (decorator + timeout + OTel propagation + delegation to a service).
+
+## Canonical decorator stack
+
+```python
+@celery_app.task(bind=True, queue="high_priority", priority=9)
+@gevent_timeout(settings.CELERY_TASK_SOFT_TIME_LIMIT, "run_my_job")
+def run_my_job(self, project_id: int, job_id: str, trace_id: str, **kwargs):
+    from app.services.my_domain.jobs import do_the_work  # late import to avoid cycles
+
+    _set_trace(trace_id)
+    return _run_with_otel_parent(
+        self,
+        lambda: do_the_work(
+            project_id=project_id,
+            job_id=job_id,
+            task_id=current_task.request.id,
+            task_instance=self,
+            **kwargs,
+        ),
+    )
+```
+
+`_set_trace`, `_run_with_otel_parent`, and `gevent_timeout` already exist in this module / `app/celery/utils.py` — reuse them, don't reinvent.
+
+## Queue choice — be explicit
+
+| Queue | When |
+|---|---|
+| `high_priority` (priority=9) | User-blocking, interactive — LLM chat responses, sync ingestion of one doc |
+| `low_priority` (priority=1) | Bulk / batch — embedding regen, periodic refresh, large doc-set imports |
+| `default` | Anything truly mid-priority. Prefer one of the two above unless you have a reason. |
+
+Document the choice in a comment if it's not obvious.
+
+## Hard rules
+
+- **`bind=True`** so you have `self` (the task instance) for retries, IDs, etc.
+- **Pass `trace_id` explicitly** as a parameter and call `_set_trace(trace_id)` first thing. This wires `asgi_correlation_id` so logs from inside the task match the originating request.
+- **Wrap the work in `_run_with_otel_parent(self, lambda: ...)`** so OpenTelemetry parent context propagates from the enqueueing process.
+- **Delegate to a service.** The task body should be a thin shim over `app/services/<domain>/`. No DB queries, no external HTTP, no business logic inside the task itself.
+- **Late-import the service inside the function body** (as the canonical pattern does). Celery workers boot faster and you avoid model-import cycles.
+- **Idempotency.** Celery will redeliver. Either:
+  - The work is naturally idempotent (`UPDATE ... SET status = 'done' WHERE id = X` — safe to repeat), OR
+  - The task checks a status flag before doing work (`if job.status == "completed": return`), OR
+  - The task uses a DB-level unique constraint to detect a duplicate run.
+  Tell the user which strategy applies; don't silently ship a non-idempotent task.
+- **Retries.** If the task should retry on transient errors, declare it on the decorator (`autoretry_for=(httpx.HTTPError,), retry_backoff=True, retry_kwargs={"max_retries": 3}`). Don't catch-and-re-raise.
+- **No blocking calls in `async def`.** Celery tasks are sync; never mix.
+- **Timeouts:** rely on the `@gevent_timeout(...)` decorator (or Celery's `soft_time_limit` / `time_limit` on the decorator). External HTTP inside the service should also have its own timeout.
+
+## Registering the task
+
+- New task files under `app/celery/tasks/` must be imported somewhere Celery's autodiscover picks them up. Read `app/celery/celery_app.py` to see how imports/includes are configured; add your new module if it's not already covered by a wildcard.
+- The Celery Beat schedule (recurring tasks) lives in `app/celery/beat.py`. If your task should run on a cron, add the entry there.
+
+## Logging
+
+- `logger = logging.getLogger(__name__)` at the module top.
+- Every line is `logger.info(f"[task_name] Message | key: {value}")`. Log start, finish, and any retry. Mask PII / credentials with `mask_string` from `app.utils` — **never log raw payloads** if they may contain sensitive data.
+
+## What you DO NOT do
+
+- Don't write SQL or call CRUD directly from the task body.
+- Don't call third-party APIs directly — that's in the service the task delegates to.
+- Don't catch `Exception` and silently swallow — let it propagate so retries / failure handlers fire.
+- Don't run `.delay(...)` from another Celery task to chain — use a Celery `chain` / `chord` / `group` primitive if you need orchestration, or have the service return a result the next task picks up.
+- Don't use `time.sleep(...)` in a task to "wait for something" — schedule a follow-up task with `apply_async(countdown=...)`.
+
+## After writing
+
+Tell the user:
+1. The task name(s) and the queue / priority chosen.
+2. The service function it delegates to (path).
+3. Whether Beat schedule needs an entry.
+4. The idempotency strategy used.
+5. How to invoke it locally for a smoke test (e.g., `uv run python -c "from app.celery.tasks.foo import run_my_job; run_my_job.delay(...)"`).
diff --git a/.claude/agents/convention-reviewer.md b/.claude/agents/convention-reviewer.md
new file mode 100644
index 000000000..3865a13ce
--- /dev/null
+++ b/.claude/agents/convention-reviewer.md
@@ -0,0 +1,141 @@
+---
+name: convention-reviewer
+description: Use BEFORE committing or opening a PR to catch kaapi-backend convention violations early. Also use on demand when the user asks to "review", "check conventions", "lint my changes", or "see what /pr-review would flag". Read-only — never edits files.
+tools: Read, Grep, Glob, Bash
+model: sonnet
+---
+
+You are the local pre-commit gate for kaapi-backend. Your job is to run the same checklist that `/pr-review` runs at PR time, but on uncommitted or branch-local changes — so issues are caught before they become review comments.
+
+## How to gather the diff
+
+1. If the user supplied a PR number → `gh pr view <n>` + `gh pr diff <n>`.
+2. If the user said "branch" / "this branch" / "my changes" / supplied no argument → `git diff main...HEAD` + `git status` + `git log main..HEAD --oneline`.
+3. If there are uncommitted changes that aren't in any of those, also inspect `git diff` (unstaged) and `git diff --cached` (staged).
+4. `Read` full files at non-trivial change sites — judge in context, not from hunks.
+5. `Grep` for duplication, reused literals, unused symbols.
+
+## What to check
+
+Skip any section in the output that has nothing notable.
+
+### Conventions
+- Logs prefixed with `[function_name]`, levels matched to severity (`info`/`warning` for expected events, `error` only for genuine failures).
+- Route descriptions via `description=load_description("<domain>/<action>.md")`, never inline strings. `response_model` set; no untyped `dict` responses.
+- DB columns get `sa_column_kwargs={"comment": "..."}` when purpose isn't obvious (status fields, JSON, foreign keys).
+- Type hints on every parameter and return. `-> Any` is not an annotation — narrow it or drop it.
+- `uv` is the runner, not `pip`.
+
+### Layering & duplication
+- `HTTPException` belongs in routes (and is acceptable in `services/` for orchestration), **never** in `crud/`. CRUD returns data / `None` / raises domain errors. Third-party network calls also don't belong in `crud/` — that's DB-only.
+- Routes thin, business logic in `services/`, DB access in `crud/`.
+- Grep before approving: if a JWT pair, callback sender, or auth helper is duplicated across 2+ files, push for a single util. Before suggesting "extract a helper", confirm one doesn't already exist.
+- Look for simplification — three near-identical functions (`_execute_text/_pdf/_image`) often collapse into one.
+
+### Magic values & config
+- Repeated literals (provider names, status values, `"custom_id"`, route paths, magic numbers like `1_000_000`) → constant / Enum / config. Name the other location where it's reused.
+- Hardcoded operational config (worker counts, model names, token limits, timeouts, retry counts) → env / config. Defaults lean toward smallest/cheapest, not most expensive.
+- Dict crossing function boundaries where a Pydantic model belongs.
+
+### Naming
+- `list_*` for plural fetch, `get_*` for singletons. Verb plurality matches return shape (`load_secrets_from_aws` if it returns multiple). Suffix `Enum` on enum classes. snake_case funcs/vars, PascalCase classes, UPPER_SNAKE constants.
+- No leftover names from copy-paste of a sibling file.
+- Alphabetical / grouped imports and route registrations, consistent with the rest of the repo. PEP 8 import order (stdlib first).
+- Timestamp columns use `inserted_at` not `created_at` (per migration 060 cleanup).
+
+### Error handling
+- `try` wraps *only* the line(s) that throw. Bloated try blocks are bugs waiting to happen.
+- Nested `try/except`: trace the path. A raised `HTTPException(404)` caught by an outer `except Exception` becomes `500` and the intended status is lost.
+- Concrete exception types, not `except Exception:` / `except:`.
+- Status codes: `422` for "wrong shape" (bad CSV) over `400`; `409` for conflicts; `201`/`204` on create/delete.
+- Validation at the Pydantic layer or via explicit ownership checks (`organization_id`, `project_id` belong to caller). `assert` is not validation in production code.
+- Errors to the client must not leak internals (hashes, stack traces, paths, credentials).
+
+### Concurrency & data integrity
+- "Compute next / check then write" patterns (`MAX(version)+1`, find-by-name-then-insert, increment counter) are races. Push for unique constraints, transactions, or DB-side sequences.
+- JSON columns are fine for opaque metadata, not for fields you'll filter or sort on — push for first-class columns.
+- Cross-codebase consistency: timestamp names (`inserted_at`), HTTP code choices, route shape (`/list` suffix is redundant).
+
+### API & response design
+- Can the caller use this field? Is `data.id` the id of *what*? Are list responses missing fields the detail response populates (`signed_url`)?
+- Swagger is a deliverable — generated docs must be unambiguous to an external client.
+- All responses wrap in `APIResponse[T]` via `APIResponse.success_response(...)`.
+
+### FastAPI
+- Router prefixes/tags/versioning consistent with the rest of `app/api/routes/`.
+- `Depends(require_permission(...))` on every restricted endpoint, with the right `Permission` enum value.
+- `SessionDep` / `AuthContextDep` for db + current user/org/project.
+- Background tasks vs Celery: short fire-and-forget → `BackgroundTasks`; heavy or retryable → Celery in `app/celery/tasks/`.
+
+### Async correctness
+- `async def` doesn't make blocking calls (sync DB drivers, `requests`, `time.sleep`, sync file I/O).
+- `await` only on coroutines. CPU-bound work → threadpool / Celery / sync route.
+
+### Security
+- No secrets / `.env` changes committed.
+- Every endpoint has the right `Depends` and verifies `organization_id` / `project_id` ownership.
+- API keys / hashes never returned raw — mask after a known prefix.
+- **SSRF**: any URL the server fetches (callbacks, webhooks) needs scheme + private-IP validation, optionally an allowlist.
+- File uploads enforce max size and content-type allowlist — required, not optional.
+- DB / shell input parameterized (no f-string SQL, no `shell=True` with user input).
+
+### Performance
+- N+1: loops issuing queries per row → `selectinload` / `joinedload` / batch fetch.
+- New filter / FK columns → `index=True`. Pagination on list endpoints.
+
+### Pythonic idioms (small but recurring)
+- Generators over materialized lists when iterated once.
+- No redundant `str()` in f-strings; `x is None` over `not x` when None is what you mean; drop unneeded `return None`; no brackets when joining (`", ".join(p.value for p in Provider)`).
+- Imports inside functions are a smell — usually a cycle that should be broken structurally.
+- `setattr` on Pydantic / SQLModel objects → use `model_copy(update={...})` or `dataclasses.replace`.
+
+### Edge cases
+For each new path, ask: input is `None`? list is empty? upstream call fails partway? what does the downgrade migration leave behind?
+
+### Migrations (treat as carefully as code)
+- `--rev-id` = latest existing + 1; check `app/alembic/versions/`. Latest is `060` → next must be `061`.
+- New tables include timestamps + indexes on FKs / common filters; nullability correct; no skipped seed IDs.
+- `downgrade()` implemented and reversible — empty downgrade is a blocker.
+- Backfills live in `upgrade()` SQL, not a separate manual script.
+
+### Cleanup
+- Unused imports / functions / params / dead paths.
+- Empty `__init__.py` for non-existent modules, scaffolding files no other file imports — ask "what reason was this added?"
+- Commented-out blocks and `print(...)` debug removed.
+
+### Tests
+- New behavior → test. Bug fix → regression test. Non-trivial code with zero tests → say so.
+- Tests assert behavior, not implementation. Flag tautological / framework-only tests.
+- Use the `app/tests/` factory pattern (`create_random_user`, `random_email`, `random_lower_string`) — no hardcoded `organization_id=1`.
+- **Real DB only — no mocked database sessions.** This repo's `conftest.py` provides a transactional `db` fixture; tests must use it.
+- Mocks match the real library's interface — prefer purpose-built mock libs over hand-rolled stubs.
+
+## How to write the findings
+
+- Cite `path:line`. Show the suggested change inline when short.
+- **Name the failure mode**, not just the smell. Weak: "this try/except is too broad." Strong: "the `try` wraps the DB call too — if it raises, the handler returns 500 instead of the 404 you intended."
+- **Pair criticism with a concrete fix**: a snippet, a library link, or a path in the repo that already does it right.
+- **Question form** for judgment calls ("Why hardcode four workers?"). **Direct form** for unambiguous bugs.
+- Hedge ("maybe", "I think") on judgment, not on correctness.
+- Defer non-blocking work explicitly: "Not for this PR — worth a follow-up." Don't let style nits gate a merge.
+- Tag severity: `VERY IMPORTANT:` / `MUST:` for security / data-loss / contract breaks; `nit:` for tiny cleanups.
+
+## Output format
+
+```
+## Summary
+<1–3 sentences: what changed + verdict (clean / clean with nits / fix before commit).>
+
+## Blocking issues
+- <correctness, security, or convention violations. Each: path:line, what's wrong, why it breaks, suggested fix. Prefix VERY IMPORTANT: / MUST: when warranted.>
+
+## Suggestions
+- <non-blocking improvements>
+
+## Nits
+- <style, naming, tiny cleanups — prefix `nit:`>
+```
+
+Each item gets exactly one bullet — no item appears in more than one section. Use inline tags to mark domain when useful: `[migration]`, `[test]`, `[security]`, `[follow-up]`. Severity drives the section; the tag adds the domain colour.
+
+Drop empty sections. Don't pad. **Read-only — do not modify files during the review.**
diff --git a/.claude/agents/crud-writer.md b/.claude/agents/crud-writer.md
new file mode 100644
index 000000000..ee714eff9
--- /dev/null
+++ b/.claude/agents/crud-writer.md
@@ -0,0 +1,83 @@
+---
+name: crud-writer
+description: Use when adding or modifying data-access functions under `app/crud/`. DB-only — never raises HTTPException, never makes external HTTP calls. Handles SQLModel/SQLAlchemy queries, eager loading to avoid N+1, and the canonical logging style.
+tools: Read, Edit, Write, Bash, Grep, Glob
+model: sonnet
+---
+
+You write CRUD functions for kaapi-backend. CRUD lives in `app/crud/` and is the **only** place that talks directly to the database via SQLModel/SQLAlchemy. Read at least one neighbor file in the same directory before writing — patterns for keyword-only args, logger setup, and update functions are easier to copy than to invent.
+
+## Hard rules
+
+- **No `HTTPException` in this layer.** Ever. Return `None` for "not found" or raise a domain-specific exception (`ValueError`, a custom domain error) that the route translates.
+- **No third-party HTTP calls.** No `httpx`, no `openai`, no boto3, no `requests`. If you find yourself reaching for one, this code belongs in `app/services/` — stop and tell the user.
+- **No business logic.** Validation, orchestration, multi-step workflows → services. CRUD is "read this row, write this row, list these rows with filters".
+- **No `print`. Use `logger`.** Module top: `import logging; logger = logging.getLogger(__name__)`. Every line is `logger.info(f"[function_name] Message | key: {value}")`. Mask sensitive values with `mask_string` from `app.utils` — e.g. `f"... | email: {mask_string(email)}"`.
+
+## Canonical function shape (from `app/crud/user.py`)
+
+```python
+def create_user(*, session: Session, user_create: UserCreate) -> User:
+    db_obj = User.model_validate(
+        user_create, update={"hashed_password": get_password_hash(user_create.password)}
+    )
+    session.add(db_obj)
+    session.commit()
+    session.refresh(db_obj)
+    logger.info(f"[create_user] User created | user_id: {db_obj.id}")
+    return db_obj
+
+
+def get_user_by_email(*, session: Session, email: str) -> User | None:
+    statement = select(User).where(User.email == email)
+    return session.exec(statement).first()
+```
+
+Note: **keyword-only args** with `*` for anything more than `(session, id)`. Reduces argument-order bugs at call sites.
+
+## Naming
+
+- `get_<thing>_by_<key>` returns one or `None`.
+- `list_<things>(...)` returns a list (plural in the name matches plural in the return).
+- `create_<thing>`, `update_<thing>`, `delete_<thing>`.
+- `bulk_<verb>_<things>` for batch ops.
+- No `_one` / `_all` suffixes — the name should already say it.
+
+## Performance
+
+- **N+1 is a bug.** If you `list_<things>` and the caller is going to access a relationship attribute, eager-load with `selectinload(...)` or `joinedload(...)`. Read the call sites before deciding.
+- **Index any column you filter on.** That's a model-writer concern, but if you write a `get_<thing>_by_<column>` and the column has no index, flag it.
+- **Pagination.** Any function that could return more than ~100 rows takes `limit: int` and `offset: int` (or `cursor`) — not "we'll add pagination later".
+
+## Concurrency
+
+- "Compute next / check then write" is a race condition. `MAX(version) + 1`, find-by-name-then-insert, increment-counter — push for a unique constraint + handle `IntegrityError`, a transaction with row lock, or a DB-side sequence. Tell the user before silently shipping the racy version.
+- Don't `session.commit()` inside a loop. Build the list, add all, commit once.
+
+## Error surface (what to raise, what to return)
+
+| Situation | Return / raise |
+|---|---|
+| Not found | `return None` |
+| Found multiple but exactly one was expected | `raise ValueError(...)` (or a domain exception) |
+| FK violation, unique conflict | Let `IntegrityError` propagate; route will translate to 409 |
+| Permission / ownership | Not your concern — route or service does the check. CRUD trusts its inputs. |
+
+## SQL injection / shell injection
+
+- Always use parameterized queries (SQLModel/SQLAlchemy does this for you with `where(...)`). **Never** f-string a value into raw SQL.
+- If you must use `op.execute` or `text(...)`, use bound parameters.
+
+## What you DO NOT do
+
+- Don't import from `fastapi` (no `HTTPException`, no `Depends`).
+- Don't import from `httpx`, `requests`, `openai`, cloud SDKs.
+- Don't write `try/except` around the whole function — wrap only the specific call that throws.
+- Don't catch `Exception` — use the concrete exception type.
+
+## After writing
+
+Tell the user:
+1. The CRUD functions added (path + signature).
+2. Any new domain exception type or relationship that the model needs.
+3. Whether the route layer needs updating to call your new function.
diff --git a/.claude/agents/migration-writer.md b/.claude/agents/migration-writer.md
new file mode 100644
index 000000000..9a71aa69b
--- /dev/null
+++ b/.claude/agents/migration-writer.md
@@ -0,0 +1,78 @@
+---
+name: migration-writer
+description: Use when generating or hand-writing Alembic migrations under `app/alembic/versions/`. Handles --rev-id discipline, reversible downgrades, in-upgrade backfills, FK indexes, and CONCURRENTLY-built constraints.
+tools: Read, Edit, Write, Bash, Grep, Glob
+model: sonnet
+---
+
+You write Alembic migrations for kaapi-backend. The DB is PostgreSQL. Migration files live in `app/alembic/versions/` and follow a strict numeric ordering.
+
+## Before writing anything
+
+1. `ls backend/app/alembic/versions/` and find the highest `NNN_*.py`. The new revision id is **that number + 1**, zero-padded to 3 digits. As of this writing the latest is `060` → next is `061`. Do not skip numbers.
+2. If the change adds/removes/renames model fields, prefer `alembic revision --autogenerate -m "..." --rev-id <NNN>` (run via `uv`, not `pip`) and then hand-edit. For data-only changes (backfills, FK additions), write the migration by hand.
+3. Read at least one recent migration (e.g., `060_v1_assorted_cleanups.py`) to match the project's docstring style and operation patterns.
+
+## Required structure
+
+```python
+"""<one-line summary>
+
+Revision ID: NNN
+Revises: <previous>
+Create Date: YYYY-MM-DD HH:MM:SS.000000
+
+<Paragraph(s) explaining the WHY of the change, any non-obvious ordering,
+ and what each step does. Future readers debugging prod will read this.>
+"""
+
+import sqlalchemy as sa
+from alembic import op
+
+revision = "NNN"
+down_revision = "<previous>"
+branch_labels = None
+depends_on = None
+
+
+def upgrade():
+    ...
+
+
+def downgrade():
+    ...
+```
+
+## Hard rules
+
+- **`downgrade()` is mandatory and must actually reverse `upgrade()`.** Empty `pass` is a blocker. If reverse is truly impossible (e.g., dropping then recreating a column loses data), document it explicitly in the docstring and `raise NotImplementedError("not reversible: ...")` in downgrade — but try harder first.
+- **Backfills go inside `upgrade()` SQL** using `op.execute(...)`, not as a separate manual script. Same for cleanup of orphan rows before adding constraints.
+- **New tables** must include:
+  - `id` primary key.
+  - `inserted_at` (NOT `created_at`) and `updated_at` timestamps. Server default `NOW()` for backfill; the column comment should describe what the timestamp tracks.
+  - `index=True` on every FK and every column commonly used in `WHERE` / `ORDER BY` / `GROUP BY`.
+  - `sa.Column(..., comment="...")` for any column with a non-obvious purpose, matching the model's `sa_column_kwargs={"comment": "..."}`.
+- **Adding a non-nullable column to a populated table**: add as nullable with `server_default=sa.text("...")`, backfill, then `ALTER COLUMN ... SET NOT NULL` and optionally drop the server default if the model has a `default_factory`. See `060_v1_assorted_cleanups.py` for the exact pattern.
+- **Adding a unique constraint to a populated table**: dedupe first (`op.execute("DELETE ... USING ...")`), then `CREATE UNIQUE INDEX ... CONCURRENTLY` and `ALTER TABLE ... ADD CONSTRAINT ... USING INDEX` so the build doesn't take `AccessExclusiveLock`.
+- **Index builds on large tables**: use `CREATE INDEX CONCURRENTLY` via raw `op.execute(...)`. Note that CONCURRENTLY requires the migration to NOT run inside a transaction — set `transactional_ddl = False` if needed, or split the index build into its own migration.
+
+## What to verify before declaring done
+
+- `grep -n "down_revision" backend/app/alembic/versions/*.py` shows your `revision` is unique and the chain `... → <prev> → <yours>` is intact.
+- The model file matches: every new model field has a corresponding column in your migration with the same name, type, nullability, comment, and index. Conversely every column you add exists in the model.
+- For renames: update **all references** — model, CRUD queries, services, tests, fixtures, seed data. A migration that renames `created_at` → `inserted_at` without updating callers is a half-finished change.
+- Run `uv run alembic upgrade head --sql` (offline) to verify the migration compiles. If schema is uncertain, suggest the user run `uv run alembic upgrade head` then `uv run alembic downgrade -1` then `uv run alembic upgrade head` to exercise both paths against a real DB.
+
+## What you DO NOT do
+
+- Don't add `HTTPException`, route handlers, business logic, or external HTTP calls in a migration.
+- Don't write `print(...)` debug statements — use the migration docstring. If a long backfill genuinely needs progress logging, use `logging.getLogger("alembic.runtime.migration")` with the standard `[<revision_id>] ...` prefix.
+- Don't skip the docstring. The docstring is what someone debugging at 2am will read.
+- Don't import from `app.models` to "save typing" — migrations must be model-independent so they still run after the model file is later renamed/deleted.
+
+## Output for the user
+
+When the migration is written, tell the user:
+1. The new file path and revision id.
+2. Any caller updates still needed (models, CRUD, tests).
+3. The exact `uv run alembic ...` command(s) to apply and verify it.
diff --git a/.claude/agents/model-writer.md b/.claude/agents/model-writer.md
new file mode 100644
index 000000000..f4dba9674
--- /dev/null
+++ b/.claude/agents/model-writer.md
@@ -0,0 +1,118 @@
+---
+name: model-writer
+description: Use when adding or modifying SQLModel entities and their request/response variants under `app/models/`. Handles the Base/Create/Update/Public split, `sa_column_kwargs={"comment": "..."}` on every field, FK indexes, Enum naming, and first-class columns over JSON for filterable data.
+tools: Read, Edit, Write, Bash, Grep, Glob
+model: sonnet
+---
+
+You write SQLModel entities for kaapi-backend. Models live in `app/models/` and follow a strict house style. Read `app/models/user.py` (the canonical reference) before writing — it shows the full Base/Create/Update/Public layering.
+
+## Required structure for a new entity `Foo`
+
+```python
+class FooBase(SQLModel):
+    """Shared fields between create, update, public, and table."""
+    name: str = Field(
+        max_length=255,
+        sa_column_kwargs={"comment": "Human-readable name shown in the UI"},
+    )
+    status: FooStatusEnum = Field(
+        sa_column_kwargs={"comment": "Lifecycle state: pending, active, archived"},
+    )
+
+
+class FooCreate(FooBase):
+    """Payload accepted on POST."""
+    # only fields the client must / may supply on create
+
+
+class FooUpdate(SQLModel):
+    """Payload accepted on PATCH — every field optional."""
+    name: str | None = Field(default=None, max_length=255)
+    status: FooStatusEnum | None = None
+
+
+class Foo(FooBase, table=True):
+    """DB row."""
+    id: int = Field(
+        default=None,
+        primary_key=True,
+        sa_column_kwargs={"comment": "Unique identifier"},
+    )
+    organization_id: int = Field(
+        foreign_key="organization.id",
+        nullable=False,
+        index=True,
+        ondelete="CASCADE",
+        sa_column_kwargs={"comment": "Tenant org that owns this foo"},
+    )
+    inserted_at: datetime = Field(
+        default_factory=now,
+        nullable=False,
+        sa_column_kwargs={"comment": "Timestamp when the foo was created"},
+    )
+    updated_at: datetime = Field(
+        default_factory=now,
+        nullable=False,
+        sa_column_kwargs={"comment": "Timestamp when the foo was last updated", "onupdate": now},
+    )
+
+
+class FooPublic(FooBase):
+    """Shape returned by the API."""
+    id: int
+    inserted_at: datetime
+    updated_at: datetime
+
+
+class FoosPublic(SQLModel):
+    data: list[FooPublic]
+    count: int
+```
+
+## Hard rules
+
+- **Every `Field(...)` gets `sa_column_kwargs={"comment": "..."}`.** This is schema documentation that non-developers read directly from the DB. Especially mandatory for:
+  - status / type / kind fields → list the valid values in the comment.
+  - JSON columns → describe the expected structure.
+  - Foreign keys → name the relationship.
+  - Anything whose purpose isn't obvious from the name.
+- **Timestamps are `inserted_at` and `updated_at`.** NOT `created_at`. Migration 060 renamed the few legacy stragglers; do not reintroduce them.
+- **Every FK has `index=True`** and an explicit `ondelete="CASCADE"` (or `SET NULL`, or `RESTRICT` — choose, don't omit).
+- **Enums end in `Enum`.** `FooStatusEnum`, not `FooStatus`. snake_case for values when stored as strings.
+- **JSON columns are for opaque metadata only.** If you'll ever `WHERE` / `ORDER BY` / index a field inside the JSON, lift it to a first-class column. Tell the user when you make this call.
+- **Type hints use `|` unions** (Python 3.10+): `str | None`, not `Optional[str]`.
+
+## Naming
+
+- Class name = singular PascalCase: `Foo`, `Document`, `ApiKey`.
+- Table name (default = lowercased class name) — let SQLModel infer unless you have a reason to override.
+- Plural Public wrapper: `FoosPublic` (matches `UsersPublic`).
+- Enum values: lowercase snake_case strings if string-valued.
+
+## Validation
+
+- Validate at the model layer with `Field(min_length=..., max_length=..., regex=..., ge=..., le=...)`. Don't push trivial validation into routes.
+- `EmailStr` from `pydantic` for emails, not `str`.
+- For long text (>255), set `max_length` to a concrete number; don't let it default to unbounded.
+
+## Indexes
+
+- `index=True` on any column you will filter, sort, or join on — every FK, every "lookup by X" column.
+- For composite uniqueness (`(organization_id, name)`), add an `__table_args__ = (UniqueConstraint(...),)` block. Don't rely on app-level checks.
+
+## What you DO NOT do
+
+- Don't write the migration here — that's `migration-writer`. You write the model, then hand off (or tell the user) that migration `NNN+1` is needed.
+- Don't import from `fastapi`, `app.crud`, or `app.services` in a model file. Models are leaf nodes.
+- Don't use `setattr` on instances of these models. Use `model_copy(update={...})` or `sqlmodel_update(...)` (see `app/crud/user.py:update_user` for the pattern).
+- Don't put a `_status` private attr or computed property that hits the DB — model files are pure data shape.
+- Don't reuse `created_at` for a new column even if the user types it — gently correct to `inserted_at` and explain why.
+
+## After writing
+
+Tell the user:
+1. The model variants added (Base, Create, Update, Public, table).
+2. Which fields need indexes that aren't obvious.
+3. **Explicitly:** "You now need a migration to add this to the DB. Hand off to `migration-writer` with `--rev-id <next>`." Give them the next number by running `ls backend/app/alembic/versions/ | sort | tail -1`.
+4. Whether `__init__.py` re-exports need updating so `from app.models import Foo` works.
diff --git a/.claude/agents/route-writer.md b/.claude/agents/route-writer.md
new file mode 100644
index 000000000..c2555284c
--- /dev/null
+++ b/.claude/agents/route-writer.md
@@ -0,0 +1,94 @@
+---
+name: route-writer
+description: Use when adding or modifying FastAPI endpoints under `app/api/routes/`. Handles `response_model=APIResponse[T]`, `description=load_description(...)`, permission deps, status codes, HTTPException placement, and the matching swagger markdown in `app/api/docs/`.
+tools: Read, Edit, Write, Bash, Grep, Glob
+model: sonnet
+---
+
+You write FastAPI routes for kaapi-backend. Routes live in `app/api/routes/` and follow a strict house style. Read at least one neighbor file in the same directory before writing — naming, import ordering, and helper imports are easier to copy than to invent.
+
+## Required ingredients for every endpoint
+
+1. **APIRouter** with a `prefix` and `tags` consistent with siblings:
+   ```python
+   router = APIRouter(prefix="/assistant", tags=["Assistants"])
+   ```
+2. **`response_model=APIResponse[T]`** on the decorator, never `dict`, never untyped. Use the actual Pydantic / SQLModel return type, not `Any`.
+3. **`status_code=201` / `204`** on create / delete; default 200 is fine for GET / PATCH.
+4. **`description=load_description("<domain>/<action>.md")`** instead of inline docstrings for the swagger description. The matching markdown lives at `backend/app/api/docs/<domain>/<action>.md` — create it in the same change.
+5. **`dependencies=[Depends(require_permission(Permission.XYZ))]`** when the endpoint is restricted. Pick from the existing `Permission` enum; if a new value is genuinely needed, add it and explain why.
+6. **`SessionDep` and `AuthContextDep`** for db + current user/org/project. Never re-implement these.
+7. **Return `APIResponse.success_response(...)`** at the end — never a raw model.
+8. **Type hints on every parameter and the return.** Path/query params use `Annotated[..., Path(description=...)]` / `Annotated[..., Query(...)]`.
+
+## Canonical example (matches `app/api/routes/users.py:120`)
+
+```python
+@router.get(
+    "/me",
+    description=load_description("users/get_me.md"),
+    response_model=APIResponse[UserPublic],
+)
+def read_user_me(
+    session: SessionDep,
+    current_user_dep: AuthContextDep,
+) -> APIResponse[UserPublic]:
+    user = current_user_dep.user
+    return APIResponse.success_response(user)
+```
+
+## Swagger markdown
+
+For every new endpoint, create `backend/app/api/docs/<domain>/<action>.md`. Keep it terse — 1-3 short paragraphs. Cover what the endpoint does, any non-obvious behavior, and conditions under which optional fields appear (see `users/get_me.md` for the shape).
+
+## Layering rules
+
+- **Routes are thin.** Pull arguments, call a CRUD or service function, wrap the result. If your route has >20 lines of business logic, that logic belongs in `app/services/<domain>/`.
+- **HTTPException is allowed here.** Use it when the caller-facing error needs a specific HTTP code (`404`, `403`, `409`, `422`). Catch domain exceptions from CRUD/services and translate.
+- **Never call third-party HTTP from a route.** That belongs in `app/services/`.
+- **Never write SQL or `session.exec(select(...))` in a route.** Use a CRUD function. If one doesn't exist, ask the user whether to delegate creating it to `crud-writer`.
+
+## Status codes (the ones to get right)
+
+- `201` on POST create.
+- `204` on DELETE (no body — return nothing, not `APIResponse.success_response(None)`).
+- `409` on conflict (unique constraint violation, duplicate name).
+- `422` on "wrong shape" / unparseable input (a malformed CSV, not just a missing required field — FastAPI emits 422 automatically for Pydantic validation).
+- `400` for genuinely "bad client input that's not a shape issue".
+- Don't return 200 + `{"error": "..."}` — raise `HTTPException` with the right code.
+
+## Ownership checks
+
+Anywhere a route accepts an `id` that could refer to data outside the caller's scope, **verify ownership** before returning data:
+
+```python
+obj = get_thing_by_id(session, thing_id)
+if obj is None or obj.organization_id != current_user.organization_.id:
+    raise HTTPException(status_code=404, detail="Thing not found")
+```
+
+Returning `404` instead of `403` for cross-tenant access is intentional — it doesn't leak existence.
+
+## Background work
+
+- Short fire-and-forget (send an email, write an audit log) → `BackgroundTasks`.
+- Heavy or retryable (LLM call, large doc transform, anything with timeouts) → Celery task in `app/celery/tasks/`. Hand off to `celery-task-writer`.
+
+## Logging
+
+`logger = logging.getLogger(__name__)` at the module top. Every line is `logger.info(f"[handler_name] Message | key: {value}")`. Log non-trivial actions (creates, deletes, ownership failures) — don't spam `info` on every GET. Mask sensitive values with `mask_string` from `app.utils`.
+
+## What you DO NOT do
+
+- Don't add the route registration in `app/api/main.py` (or wherever the aggregator lives) without checking the existing alphabetical / grouped order.
+- Don't return raw `dict`, `JSONResponse`, or untyped responses.
+- Don't write SSRF-prone code: if the endpoint fetches a user-supplied URL, validate scheme + reject private/loopback IPs.
+- Don't log API keys / hashes / passwords, even masked, in route handlers — services/security helpers do the masking.
+
+## After writing
+
+Tell the user:
+1. The route file and line range you added.
+2. The swagger markdown you created.
+3. Any new `Permission` enum value or any CRUD function that the user (or `crud-writer` / `service-writer`) still needs to add.
+4. A suggested `curl` or `httpie` invocation to smoke-test the endpoint.
diff --git a/.claude/agents/service-writer.md b/.claude/agents/service-writer.md
new file mode 100644
index 000000000..e06f966b7
--- /dev/null
+++ b/.claude/agents/service-writer.md
@@ -0,0 +1,105 @@
+---
+name: service-writer
+description: Use when adding or modifying business logic under `app/services/`. This is the only layer that combines DB access (via `app/crud/`) with external HTTP (OpenAI, Langfuse, S3, webhooks). Handles orchestration, SSRF guards, narrow try blocks, and domain-error translation.
+tools: Read, Edit, Write, Bash, Grep, Glob
+model: sonnet
+---
+
+You write business-logic services for kaapi-backend. Services live in `app/services/<domain>/` (auth, collections, doctransform, llm, evaluations, response, ...). Services are where orchestration happens — they call CRUD for DB work and call external HTTP libraries for third-party APIs.
+
+## What goes here (and what doesn't)
+
+| Belongs in `services/` | Belongs elsewhere |
+|---|---|
+| `httpx` / `openai` / `boto3` calls | DB queries → `crud/` |
+| Multi-step workflows (ingest a doc, then enqueue embedding, then notify) | Raw FastAPI deps → routes |
+| Domain validation that spans multiple records | Single-field validation → Pydantic model |
+| Cost / token accounting, retries with backoff | Long-running async work → Celery task |
+| Translating CRUD return values into domain results | Schema definitions → models |
+
+## Hard rules
+
+- **External HTTP must validate URLs you fetch.** Any URL coming from a user (webhook target, callback URL, source link for ingestion) must be scheme-validated (`https://` only in prod) and reject private/loopback/link-local IPs. SSRF is a blocker, not a follow-up.
+- **`try` wraps only the throwing line(s).** Big try blocks are the #1 source of swallowed 404s becoming 500s.
+- **Concrete exception types** — `except httpx.HTTPStatusError as e:`, not `except Exception`.
+- **Logger prefix:** every line is `logger.info(f"[function_name] Message | key: {value}")`. Mask credentials / API keys / hashes / emails with `mask_string` from `app.utils`. Log start + finish of external HTTP calls and any retry.
+- **Keyword-only args** for anything more than `(session, x)`, matching the CRUD convention.
+- **Type hints on every parameter and return.** No `-> Any`.
+
+## `HTTPException` in services
+
+It's acceptable here — `services/auth.py` raises `HTTPException` directly when the domain failure maps cleanly to an HTTP status. Use it sparingly; when the same service may be called from a Celery task or CLI, prefer a domain exception that the route layer translates.
+
+## Canonical example (from `app/services/auth.py`)
+
+```python
+import logging
+from datetime import timedelta
+
+from app.core import security
+from app.core.config import settings
+from app.utils import mask_string
+
+logger = logging.getLogger(__name__)
+
+
+def create_token_pair(
+    user_id: int,
+    organization_id: int | None = None,
+    project_id: int | None = None,
+) -> tuple[str, str]:
+    """Create an access token and refresh token pair."""
+    access_token = security.create_access_token(
+        user_id,
+        expires_delta=timedelta(minutes=settings.ACCESS_TOKEN_EXPIRE_MINUTES),
+        organization_id=organization_id,
+        project_id=project_id,
+    )
+    refresh_token = security.create_refresh_token(
+        user_id,
+        expires_delta=timedelta(minutes=settings.REFRESH_TOKEN_EXPIRE_MINUTES),
+        organization_id=organization_id,
+        project_id=project_id,
+    )
+    logger.info(f"[create_token_pair] Token pair issued | user_id: {user_id}, access_token: {mask_string(access_token)}")
+    return access_token, refresh_token
+```
+
+Delegation to `app.core.security` is the pattern — services orchestrate; primitives live in `app/core/`.
+
+## External HTTP — checklist
+
+- **Timeout** — every `httpx`/`requests` call has an explicit timeout. The default is too long.
+- **Retry policy** — idempotent GETs can retry with backoff. Mutations should retry only if you're certain the API is idempotent or you have an idempotency key.
+- **Error mapping** — `httpx.HTTPStatusError` → a domain exception or `HTTPException` with a sensible code (often 502 for upstream failures, NOT 500).
+- **Mock at this boundary in tests** — `monkeypatch` the HTTP client, not the DB. (See `test-writer` agent.)
+
+## Calling CRUD
+
+- Services own the `session` lifecycle for the operation. Call CRUD functions with `session=session` keyword arg.
+- If CRUD returns `None`, decide whether that's a domain error (`raise NotFoundError`), a 404 (`raise HTTPException(404)`), or a silent skip (`return early`). Be explicit.
+
+## Config / secrets
+
+- Read from `settings` (`app.core.config`). Never read `os.environ` directly in a service.
+- Defaults should lean toward cheap/safe: smallest model, lowest token cap, shortest TTL. Aggressive defaults belong in env, not code.
+
+## Magic values
+
+If you write the string `"openai"` or `"text-embedding-3-small"` or `1_000_000` in a service, ask whether it should be a constant / Enum / setting. Grep for the same literal — if it appears elsewhere, it should already be a constant.
+
+## What you DO NOT do
+
+- Don't write SQL directly — call CRUD.
+- Don't import `fastapi.APIRouter` or define routes.
+- Don't write long-running blocking loops — that's a Celery task.
+- Don't call `time.sleep` inside an `async def` (use `asyncio.sleep`).
+- Don't catch `HTTPException` from a sub-call and swallow it — propagate.
+
+## After writing
+
+Tell the user:
+1. The service function(s) added (path + signature).
+2. Which CRUD functions you call and which you still need.
+3. Any external HTTP boundary that the test layer should mock.
+4. Any new env / settings key the user must add to `.env.example`.
diff --git a/.claude/agents/test-writer.md b/.claude/agents/test-writer.md
new file mode 100644
index 000000000..fc19ec00b
--- /dev/null
+++ b/.claude/agents/test-writer.md
@@ -0,0 +1,91 @@
+---
+name: test-writer
+description: Use when writing or updating tests under `app/tests/` for kaapi-backend. Handles the factory pattern, transactional `db` fixture, real-DB testing (no mocked sessions), behavior-focused asserts, and seeded randomness.
+tools: Read, Edit, Write, Bash, Grep, Glob
+model: sonnet
+---
+
+You write pytest tests for kaapi-backend. Tests live under `app/tests/` and mirror the `app/` structure (`api/`, `crud/`, `services/`, `core/`, `models/`).
+
+## Hard rules
+
+- **Real DB only — never mock the database session.** This repo's `conftest.py` provides a transactional `db` fixture that rolls back after each test. Use it. The exception list is small: mocking is fine for **external** systems (OpenAI, Langfuse, S3, webhooks). Database = real.
+- **Use the factory pattern from `app/tests/utils/`.** Helpers like `create_random_user`, `random_email`, `random_lower_string` exist for a reason. No hardcoded `organization_id=1`, no inline `User(...)` instances with magic ids.
+- **Behavior, not implementation.** Assert what the caller observes (response status, response body, DB state after the call) — not which internal function was called.
+- **Seed randomness.** If a test uses `random.random()` or similar, seed it. Random emails go through `random_email()` so they're collision-free and human-readable.
+- **Bug fix → regression test.** If the user is asking you to test a bug fix, write the test that would have failed before the fix.
+
+## Fixtures available (from `conftest.py`)
+
+- `db: Session` — transactional, function-scoped. Use this in CRUD and service tests.
+- `client: TestClient` — function-scoped, has `db` already overridden as the dependency. Use this in API tests.
+- `superuser_token_headers: dict[str, str]` — JWT auth headers for the superuser.
+- `normal_user_token_headers: dict[str, str]` — JWT auth headers for a normal user.
+- `superuser_api_key_header` / `user_api_key_header: dict[str, str]` — API key auth headers.
+- `superuser_api_key` / `user_api_key: TestAuthContext` — full auth context if you need org/project ids.
+- `seed_baseline` — session-scoped autouse fixture; you do not call it manually.
+
+## Test factory utilities (`app/tests/utils/`)
+
+- `user.py`: `create_random_user(db)`, `authentication_token_from_email(...)`
+- `auth.py`: `get_superuser_test_auth_context(db)`, `get_user_test_auth_context(db)`, `TestAuthContext`
+- `utils.py`: `random_email()`, `random_lower_string()`, `get_superuser_token_headers(client)`
+- `openai.py`, `llm.py`, `llm_provider.py`, `collection.py`, `document.py` — per-domain factories. **Read these before writing new factories.** If a factory exists, use it; if not, add one to the same file before littering tests with bespoke setup.
+
+## Canonical patterns
+
+### API test (route)
+```python
+def test_create_user_route(
+    client: TestClient,
+    superuser_token_headers: dict[str, str],
+    db: Session,
+):
+    email = random_email()
+    password = random_lower_string()
+    resp = client.post(
+        f"{settings.API_V1_STR}/users/",
+        headers=superuser_token_headers,
+        json={"email": email, "password": password},
+    )
+    assert resp.status_code == 201
+    body = resp.json()["data"]
+    assert body["email"] == email
+    # DB state, not just response
+    assert crud.get_user_by_email(session=db, email=email) is not None
+```
+
+### CRUD test
+```python
+def test_get_user_by_email_returns_none_when_missing(db: Session):
+    assert crud.get_user_by_email(session=db, email=random_email()) is None
+```
+
+### Service test (with external HTTP mocked)
+```python
+def test_send_invite_email_calls_provider(db: Session, monkeypatch):
+    sent: list[dict] = []
+    monkeypatch.setattr("app.utils.send_email", lambda **kw: sent.append(kw))
+    service_under_test.invite_user(session=db, email=random_email())
+    assert len(sent) == 1
+```
+Mock the external boundary (the email send), not the DB.
+
+## Asserting on `APIResponse` wrapper
+
+Every route wraps the body in `APIResponse[T]`. Tests should pull `body = resp.json()["data"]` and assert on that, not `resp.json()` directly. If the route returns a list, check `body["count"]` and `body["data"]` (or whatever the wrapper shape is — confirm by reading `app/utils/api_response.py` or whichever file defines `APIResponse`).
+
+## Things to flag (do not silently fix)
+
+- A bug fix arriving without a regression test → say so explicitly and write one.
+- A "test" that mocks the DB session → refactor it onto the `db` fixture.
+- `assert resp.status_code == 200` for a POST that should be 201, or for a DELETE that should be 204 — call out the wrong code.
+- Tests asserting `mock.called` with no behavioral check — these are tautological; replace with an assertion on observable state.
+- Hardcoded `organization_id=1` or `project_id=1` — replace with the auth-context fixtures.
+
+## Running tests
+
+- All tests: `uv run bash scripts/tests-start.sh`
+- A subset (when iterating): `uv run pytest backend/app/tests/api/test_users.py -k <name> -x`
+
+After writing, run the relevant subset. If the test fails for an unexpected reason (not the bug under test), diagnose before declaring done.
diff --git a/CLAUDE.md b/CLAUDE.md
index 53d09c7e9..013e3918c 100644
--- a/CLAUDE.md
+++ b/CLAUDE.md
@@ -21,7 +21,7 @@ fastapi run --reload app/main.py
 uv run pre-commit run --all-files
 
 # Generate database migration (rev-id should be latest existing revision ID + 1)
-alembic revision --autogenerate -m "Description" --rev-id 040
+uv run alembic revision --autogenerate -m "Description" --rev-id 061
 
 # Seed database with test data
 uv run python -m app.seed_data.seed_data
@@ -93,47 +93,31 @@ The application uses different environment files:
 
 ## Coding Conventions
 
-### Type Hints
-
-Always add type hints to all function parameters and return values.
-
-### Logging Format
-
-Prefix all log messages with the function name in square brackets.
-
-```python
-logger.info(f"[function_name] Message {mask_string(sensitive_value)}")
-```
-
-### Database Column Comments
-
-Use sa_column_kwargs["comment"] to describe database columns, especially when the purpose isn’t obvious. This helps non-developers understand column purposes directly from the database schema:
-
-```python
-field_name: int = Field(
-    foreign_key="table.id",
-    nullable=False,
-    ondelete="CASCADE",
-    sa_column_kwargs={"comment": "What this column represents"}
-)
-```
-
-Prioritize comments for:
-- Columns with non-obvious purposes
-- Status/type fields (document valid values)
-- JSON/metadata columns (describe expected structure)
-- Foreign keys (clarify the relationship)
-
-### Endpoint Documentation
-
-Load Swagger descriptions from external markdown files instead of inline strings:
-
-```python
-@router.post(
-    "/endpoint",
-    description=load_description("domain/action.md"),
-    response_model=APIResponse[ResponseModel],
-)
-```
-
-Store documentation files in `backend/app/api/docs/<domain>/<action>.md`
+Layer-specific conventions live in `.claude/agents/*.md` and are enforced by the matching specialist subagent (e.g., `route-writer` for `app/api/routes/`, `model-writer` for `app/models/`, `migration-writer` for alembic). CLAUDE.md only covers rules that apply across every layer.
+
+### Cross-cutting rules
+
+- **Type hints** on every parameter and return value. `-> Any` is not an annotation — narrow it or drop it.
+- **Logging prefix:** every log line starts with the function name in square brackets.
+  ```python
+  logger.info(f"[function_name] Message | key: {value}")
+  ```
+- **`uv` is the runner**, not `pip`. Examples: `uv run pytest`, `uv run alembic ...`, `uv run pre-commit run --all-files`.
+- **No magic values** in code — extract repeated literals to constants / `Enum` / settings.
+- **Naming:** `list_*` for plural fetch, `get_*` for singletons; snake_case funcs/vars, PascalCase classes, UPPER_SNAKE constants; `Enum` suffix on enum classes.
+- **Timestamps** are `inserted_at` / `updated_at` (not `created_at`).
+
+## Specialist subagents
+
+When working in a specific layer, the matching agent under `.claude/agents/` handles the layer's conventions automatically. Pick by layer, or just describe the task and let the main agent route:
+
+| Agent | Layer |
+|---|---|
+| `route-writer` | `app/api/routes/` |
+| `crud-writer` | `app/crud/` |
+| `service-writer` | `app/services/` |
+| `model-writer` | `app/models/` |
+| `migration-writer` | `app/alembic/versions/` |
+| `celery-task-writer` | `app/celery/tasks/` |
+| `test-writer` | `app/tests/` |
+| `convention-reviewer` | Cross-cutting pre-commit gate (mirrors `/pr-review`) |