docs: launch-readiness pass for the local inference feature#241

Open

quiet-node wants to merge 22 commits into

mainfrom

docs/local-inference-launch

quiet-node commented Jun 22, 2026

Owner

Overview

Aligns every user-facing word in the repo with the shipped local inference feature for the v0.15.0 release. The bundled built-in engine is the zero-setup default, with Ollama as the optional provider. This pass corrects stale copy, fills the documentation gap, and refreshes the in-app tips.

What changed

In-app copy

The model reasoning capability is named "Reasoning" consistently: the capability badge, the live streaming block ("Reasoning..." then "Reasoning"), the /think description, and the chat export.
Residency wording is unified on "Keep Warm" and "in memory" (Apple Silicon shares unified memory, so there is no separate VRAM).
The OCR-failure hint no longer says "via Ollama", the no-models cue drops "LLM", and onboarding no longer calls a model a "brain".

README

Leads with the built-in zero-setup story and names the Settings > Models surfaces: Library, Discover (Staff picks and Browse all), and Providers, plus Keep Warm.
Corrects the model-download pointer to Discover and the TypeScript badge.

Docs

New guides: Models and providers, Privacy, and Troubleshooting, linked from the README.
configurations.md, SECURITY.md, and the tuning and OCR guides are corrected for accuracy and made engine-agnostic. configurations.md drops the phantom MAX_IMAGES_PER_MESSAGE constant and documents the real, tunable image cap.
SECURITY.md documents the local inference posture: a localhost-only sidecar with the web UI disabled, bounded GGUF parsing, and model provenance from pinned repo revisions.

Tips

Stale Ollama tips are generalized to the built-in engine, with new tips for the model library, providers, vision, and reasoning.

Provider scope

The OpenAI-compatible provider (its UI is behind a dev flag) is intentionally not mentioned in user-facing copy for this release. The loader still accepts an existing kind = "openai" config for back-compat.

Testing

bun run test:all:coverage (frontend 100%, 2075 tests; backend 100% lines) and bun run validate-build both pass.

Notes

The README's demo videos and logo show older UI and need a human recapture (visuals cannot be verified headless). The CHANGELOG's hand-written Unreleased block carries the BREAKING migration notes, so only its factual contradiction was corrected here; how release-please folds it into the 0.15.0 section is a release-engineering choice left to the release.

quiet-node added 22 commits

June 21, 2026 19:19


          fix(copy): drop the stale via-Ollama hint from the OCR-failure message

b67f39f

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          fix(copy): name the model reasoning capability Reasoning and drop the…

8e93c4c

… LLM label

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          fix(copy): unify Providers residency wording on Keep Warm and in memory

ca9fca4

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          docs(readme): lead with the built-in engine and name the Models surfaces

31613f0

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          docs: correct the registry description to the Staff Picks catalog

1089c30

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          fix(copy): generalize tips to the built-in engine and unify residency…

a3b5c82

… wording

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          fix(copy): rename the live reasoning block and /think copy to Reasoning

509dbdf

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          docs(configurations): drop the OpenAI provider, fix the image cap, de…

670f4b9

…-Ollama shared limits

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          docs(security): document the local inference threat-model posture

bb78c6b

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          docs: correct stale release notes and the pre-major bump rule

dcd64a7

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          docs: make the tuning and OCR guides engine-agnostic

10ee785

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          fix(copy): replace arch jargon in the unsupported-model error

9c00d07

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          docs: add Models and Providers, Privacy, and Troubleshooting guides

e1cc509

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          docs(tips): add tips for the model library, providers, vision, and re…

e451845

…asoning

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          fix(copy): replace brain slang in onboarding and scrub OpenAI from th…

d52e766

…e search guide

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          fix(copy): rename the Always thinks badge to Always reasons

f8dc75c

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          refactor: rename the reasoning UI identifiers from thinking to reasoning

15a3306

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          style: prettier-format the renamed reasoning test file

342599d

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          fix(copy): drop the parenthetical from the built-in provider label

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          fix: heal the built-in provider label on load so existing configs update

f458ecc

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          docs(configurations): present the built-in engine as the default and …

750edd4

…drop migration history

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>


          docs: trim CLAUDE.md and keep the built-in engine framing front and c…

ac56fd4

…enter

Signed-off-by: Logan Nguyen <lg.131.dev@gmail.com>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet