Muser

Accessible, local-first image search — with features pointed at asset generation and taste / curation for model training.

Muser indexes a folder of images, lets you search it by natural language, and treats that index as the substrate for two adjacent jobs: (1) finding source assets for generative work — image-to-image search, the selection cart, reverse-image lookup, NSFW / C2PA-provenance filters — and (2) curating training sets for personal LoRAs and aesthetic models — cluster exploration, aesthetic ranking via LAION-V2 and PickScore, selection/export. The core runs entirely on-device — no API keys, no cloud round-trip, no telemetry; the optional generate and caption paths are the only network reach (see below). The same SigLIP embeddings power semantic search and feed downstream scoring/curation so there's no second model pass for any of it.

Search with a text query like "a dog on a beach at sunset" and get back the closest-matching photos.

Three surfaces over one core (a warm local service that owns the model + index):

muser — a CLI to index, search, score, caption, cluster, benchmark.
muser serve — the embedded service + a browser search UI at http://127.0.0.1:7777.
muser-mcp — an MCP server (an MCP App) that searches and renders matches in an interactive gallery inside the host (e.g. Claude Desktop).

What's local vs. what's cloud

The core is genuinely local, offline, and $0 — embedding, indexing, semantic search, color search, dedup, AI-likelihood scoring, on-disk captions read-back. No keys, no network, no telemetry.

Two things reach the network and are opt-in:

Captioning (muser caption) calls OpenAI GPT-4o-mini (~$0.001–0.005/image).
Generate (the cart → image/3D/LoRA pipeline) calls fal.ai / OpenAI image models and costs money. Needs FAL_KEY / OPENAI_API_KEY.

The 3D viewer and the Explore point cloud load three.js / <model-viewer> from a CDN, so they need internet to render — both degrade gracefully offline (3D → download link, Explore viz → hidden; the data itself is still computed locally).

Features

Semantic search — text→image and image→image (drag/drop or picker) in one shared SigLIP space; folder-scoped, multi-concept comma queries (a, b, -c) with blend or intersection matching, lowercased for the case-sensitive tokenizer.
Color search — a separate local LAB-palette index; query by one or more hex swatches (all/any mode). No model load, $0.
AI-origin signals — two independent surfaces: a hard C2PA verdict (reads signed Content Credentials via optional c2patool; positive-only, a lower bound) and a soft AI-likelihood % (Community Forensics pixel model). Filter [All | Hide AI | Only AI].
Curation — aesthetic scoring (LAION-V2 / PickScore / HPS), cluster exploration, a selection cart, and zip export — pointed at building LoRA / aesthetic training sets.
Generate (cloud add-on) — cart → fal.ai image generation (Nano Banana / gpt-image / both / FLUX LoRA), BiRefNet cutout, prompt expansion, and image→3D (Meshy 6, or Hunyuan3D multiview). Live per-stage progress on a Jobs page.
Explore — 3D PCA point cloud of the library, colored by HDBSCAN cluster; SigLIP or opt-in DINOv2 space (finer, named clusters).
Reverse-image lookup — one-click, $0: copies the image to the clipboard and opens Google Lens (source) or TinEye (provenance); nothing leaves the machine until you paste.

Screenshots

Explore — clustered map	AI — C2PA-flagged origins

How it works

Images are embedded with the model's vision encoder at index time; text queries with its text encoder at search time. Both land in one shared space, so a description retrieves visually matching images (cosine over L2-normalized vectors). State lives in a single LanceDB at ~/.muser/db, one table per model. Default model: siglip2-b (Apache-2.0, strong quality/speed). See CLAUDE.md for architecture and REQUIREMENTS.md for scope.

Requirements

Python 3.12 and uv

Install

git clone https://github.com/connerkward/Muser
cd Muser
uv pip install -e .
# Apple Silicon only, optional MLX backend: uv pip install -e '.[mac]'

First index/search downloads the model weights once (siglip2-b ≈ 1 GB).

CLI usage

# Index a folder (recursive). Re-running is incremental (mtime-based; prunes deleted files).
# Indexing also auto-runs the color, C2PA, and AI-likelihood scans over the new subtree.
uv run muser index ~/Pictures/screenshots

# Search the whole index, or scope to a folder. Comma = multi-concept; leading - subtracts.
uv run muser search "a login screen with a blue button"
uv run muser search "sunset over water" --in ~/Pictures -k 5
uv run muser search "a car, -people" --match all

# Color search (local, $0). No --query → (re)build the palette index.
uv run muser color --query "#1e90ff" --in ~/Pictures

# Curation: aesthetic-score the index, explore clusters
uv run muser score
uv run muser cluster

# AI-origin: C2PA scan (needs c2patool) and soft AI-likelihood scan (torch+timm)
uv run muser detect --in ~/Pictures
uv run muser aiscore --in ~/Pictures        # or: aiscore -q <one-image>

# Captions for LoRA training sets (OpenAI, opt-in, costs money) → ~/.muser/captions.jsonl
uv run muser caption ~/Pictures/best

# List models / run the retrieval-eval benchmark
uv run muser models
uv run muser bench

# Launch the embedded service + web UI (http://127.0.0.1:7777), or the eval Gradio UI
uv run muser serve
uv run muser web

Run uv run muser --help for the full command list (also: reindex-metadata, uid). Paths accept ~ (expands per-OS) and forward slashes on every platform.

MCP usage

Add to your MCP host (e.g. Claude Desktop) config — the server is a console script, so no path or runtime prefix is needed:

{ "mcpServers": { "muser": { "command": "muser-mcp" } } }

search_images renders the interactive gallery as an MCP App (blend/sort slider, dedup badges, folder scope, reverse-image) — the host shows it inline in the chat and lets you expand it into the side panel:

Inline in the chat	Expanded into the side panel

_{Rendered via the @modelcontextprotocol/ext-apps host bridge in a Claude-style preview shell — the same MCP App component Claude Desktop loads.}

Tool	Purpose
`index_folder`	Index / re-index a folder of images.
`index_info`	Report index stats.
`search_images`	Search by description and open the gallery UI (an MCP App).

The MCP server is a thin HTTP client of muser serve and auto-spawns it if it's not already running. Run over HTTP instead of stdio with muser-mcp --http (http://127.0.0.1:8000/mcp).

Platform support

Core (index, search, serve, web, CLI, MCP) is cross-platform (macOS, Windows, Linux). The default install is portable; the mac extra (MLX backend) is Apple-Silicon -only and auto-skips elsewhere via environment markers. Hardware acceleration: CUDA on NVIDIA, Apple MPS on Apple Silicon, CPU otherwise (chosen automatically).

OS-integration niceties degrade gracefully when the platform tool is missing:

Feature	macOS	Windows	Linux
Reveal in file manager	`open -R`	`explorer /select,`	FileManager1 D-Bus → `xdg-open`
Copy image to clipboard (Lens button)	`osascript`	PowerShell (WinForms)	`wl-copy` / `xclip`
Native folder picker	`osascript`	`FolderBrowserDialog`	`zenity` / `kdialog`

On Linux, install xclip (X11) or wl-clipboard (Wayland) for the clipboard button, and zenity or kdialog for the native folder picker; otherwise type a path into the scope box.

Testing

A GitHub Actions matrix (.github/workflows/ci.yml) runs the test suite on ubuntu / windows / macos on every push: package install (incl. the Apple-only extra resolving as a no-op off Apple Silicon), per-platform MLX gating, the server-side clipboard branch (Linux runs under xvfb + xclip, so the real X11 path executes), _reveal, the CLI entrypoints, and a CPU end-to-end index+search. Run locally:

uv run --with pytest --with pytest-timeout python -m pytest tests/ -v

Known limitation

Folder-scoped search compares paths case-sensitively, but NTFS and APFS are case-insensitive — so scoping with different casing than what was indexed (e.g. C:\Users\Me\Photos vs a stored ...\me\photos) can return no results. The fix needs a normalized path-key column (a schema change + re-index); exact-case scoping works today.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 233 Commits
.changeset		.changeset
.github/workflows		.github/workflows
docs		docs
eval		eval
mcp-ts		mcp-ts
muser		muser
reports		reports
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
REQUIREMENTS.md		REQUIREMENTS.md
TODO.md		TODO.md
bun.lock		bun.lock
grid1.png		grid1.png
grid2.png		grid2.png
grid_vp.png		grid_vp.png
package.json		package.json
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Muser

What's local vs. what's cloud

Features

Screenshots

How it works

Requirements

Install

CLI usage

MCP usage

Platform support

Testing

Known limitation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Muser

What's local vs. what's cloud

Features

Screenshots

How it works

Requirements

Install

CLI usage

MCP usage

Platform support

Testing

Known limitation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages