OpenGradient Veil

Drop-in confidential, self-verifying inference for AI agents.

Point any OpenAI SDK at og-veil with one env var. Your prompts are encrypted via Oblivious HTTP end-to-end to an attested TEE enclave, and every response is cryptographically verified before it reaches your agent, keeping every prompt private and verifiable. You trust math - not us, the host, or the network. Your agent's code doesn't change.

Private & unlinkable - Oblivious HTTP splits who you are from what you ask across two parties that never share both. The relay sees your identity (IP & account) but only ciphertext - never your prompt. The enclave sees your prompt but only the relay's IP - never you. So no one, including OpenGradient, can tie a user to a prompt (unless the relay and enclave collude).
Verified - each response is signed inside the enclave and checked on your machine, proving it ran in known, reproducible code and wasn't tampered with. Nothing unverified ever reaches your agent.

Quickstart

Requirements: a chat.opengradient.ai account - the first run logs you in through it, and the relay bills inference against that account (no wallet or key lives here). Prompts run on OpenGradient's decentralized network of attested TEE gateways and OHTTP proxy; og-veil just discovers, encrypts to, and verifies them locally.

# install (needs Python 3.11+; uv grabs one for you)
uv tool install opengradient-veil        # or: pipx install opengradient-veil

# run — logs you in (browser) the first time, then serves in the background
og-veil

# check it end-to-end — sends a one-off prompt through the verified TEE path
og-veil test "Explain TEE attestation in one line."

og-veil test posts to the same local endpoint your agent uses and prints the reply plus its tee_id, so it's the quickest way to confirm the whole path (login → encrypt → enclave → verify) works before wiring up your agent.

Point your agent at it:

export OPENAI_BASE_URL=http://127.0.0.1:11434/v1 
export OPENAI_API_KEY=og-veil            # ignored; your Chat login authenticates

from openai import OpenAI

client = OpenAI()  # picks up the env vars above
r = client.chat.completions.create(
    model="claude-sonnet-4-6",
    messages=[{"role": "user", "content": "Explain TEE attestation in one line."}],
)
print(r.choices[0].message.content)

That's it. Every response is verified before you see it — check the X-OpenGradient-Verified: true header (and the opengradient_verification block on the body). Streaming works too; it's verified before the first token replays.

Useful commands: og-veil test (send a one-off prompt to check the path), og-veil stop, og-veil restart (after an update), og-veil status, og-veil env (re-prints the env vars), og-veil models (list available models), og-veil update, og-veil logout.

Use it with Hermes Agent

Hermes Agent speaks OpenAI out of the box, so pointing it at og-veil routes every call through the verified TEE path. With og-veil running, set a custom endpoint — either via the CLI:

hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:11434/v1
hermes config set OPENAI_API_KEY og-veil      # ignored; your Chat login authenticates
hermes config set model.default claude-sonnet-4-6

…or by editing ~/.hermes/config.yaml directly:

model:
  default: claude-sonnet-4-6
  provider: custom
  base_url: http://127.0.0.1:11434/v1
  api_key: og-veil

Now hermes runs against attested, end-to-end-encrypted inference with no other changes. Confirm it's flowing through the enclave with og-veil status.

How it works

  your agent ──OpenAI SDK──▶ og-veil ──HPKE-encrypted──▶ relay ──▶ TEE gateway
                                 ▲         (sees only ciphertext)     (runs the LLM,
                                 │                                     signs in-enclave)
                                 └──── verifies the enclave's signature, then replies

Discover — picks a TEE from the on-chain registry (endpoint, encryption key, signing key, pcrHash).
Encrypt — HPKE/Oblivious-HTTP-encrypts the request; the relay (which pays per call against your Chat account) only sees ciphertext.
Verify — checks the enclave's RSA-PSS signature over the request/response hashes before handing anything back.

Why Oblivious HTTP?

Plain TLS to the gateway would hide your prompt from the network, but the gateway itself still sees both your IP and your prompt — it can build a profile of you. OHTTP fixes that by interposing a relay and splitting knowledge between two parties that never share both halves:

	sees your identity (IP/account)	sees your prompt
Relay (chat-api)	✅	❌ (ciphertext only)
TEE enclave	❌ (only the relay's IP)	✅

Your request is HPKE-sealed to the enclave's key before it leaves your machine, so the relay can route and bill it without ever decrypting it; the enclave decrypts and runs it but only ever talks to the relay, so it can't see who you are. Linking a person to a prompt would require the relay and the enclave to collude — and the enclave's code is attested and reproducible, so it provably doesn't log or phone home. (The relay still sees timing/size; that's inherent to any proxy.)

Trust chain: reproducible build → PCRs → on-chain registry (pcrHash + signing key) → per-response signature. The registry only admits a TEE whose Nitro attestation matches a known-good build. Pin it tighter with --expected-pcr <hash> to refuse any gateway whose pcrHash differs.

The protocol (registry discovery, OHTTP, verification) lives in the OpenGradient SDK (OhttpRelayClient, TEERegistry, verify_response), so this process and the web client share one implementation. This repo adds login + the local OpenAI-compatible server.

Commands

Command	What it does
`og-veil`	Set up on first run, then serve (detached). The one command you need.
`og-veil stop`	Stop the background server.
`og-veil restart`	Stop and start the background server — e.g. after `og-veil update`.
`og-veil status`	Login + network config + whether the server is running.
`og-veil test ["prompt"]`	Send a one-off prompt to the running server and print the verified reply.
`og-veil update`	Update og-veil to the latest version.
`og-veil login`	Authorize / re-authorize this device.
`og-veil setup`	Re-run the setup wizard.
`og-veil serve -f`	Run blocking in the foreground (for systemd/Docker).
`og-veil logout`	Remove the saved session.

Lifecycle

Background by default. Setup/login runs in the foreground, then it detaches and frees your terminal. Logs: ~/.opengradient/local/server.log. Use --foreground to block instead.
Stays signed in. The access token auto-refreshes. If you sign out in the Chat app, the next request tells you to run og-veil login.
Survives a dead node. If the chosen TEE goes offline, it reselects another from the registry and retries once.

Configuration

Session + prefs live in ~/.opengradient/local/ (override with OG_VEIL_HOME).

Env var	Flag	Default	Purpose
`OG_VEIL_PORT`	`--port`	`11434`	Bind port.
`OG_VEIL_HOST`	`--host`	`127.0.0.1`	Bind host.
`OG_VEIL_TEE_ID`	`--tee-id`	—	Pin a specific registry TEE.
`OG_VEIL_EXPECTED_PCR_HASH`	`--expected-pcr`	—	Refuse any TEE whose `pcrHash` differs.
`OG_VEIL_APP_URL`	`--app-url`	`https://chat.opengradient.ai`	Chat app origin for login.
`OG_VEIL_PII_SCRUB`	`--pii-scrub`	off	Redact high-impact PII from prompts locally before they leave the machine.

Local PII redaction (opt-in)

OHTTP unlinks who you are from what you ask — but only if the prompt itself doesn't name you. With --pii-scrub on, concrete identifiers are replaced with [REDACTED_*] tags locally before the prompt is encrypted, so they never leave your machine. Install the optional extra (one step — no model download) and turn it on:

uv tool install 'opengradient-veil[pii]'   # or: pipx install 'opengradient-veil[pii]'
og-veil --pii-scrub        # or: export OG_VEIL_PII_SCRUB=1

Redacts email, phone, US SSN, credit cards, IBANs, US bank numbers, and street addresses via Microsoft Presidio's pattern/checksum recognizers. Names, cities/countries, and dates are left in — detecting them needs statistical NER that over-redacts the third-party names real prompts are full of and mislabels uncommon ones. So this is a backstop for the hard data, not a substitute for your own discretion. Redaction is irreversible (the TEE's signed output_hash covers exactly what it ran); if the extra isn't installed, the server refuses to start rather than send PII.

Notes & limitations

OpenAI-compatible only (/v1/chat/completions, /v1/models); an Anthropic /v1/messages shim is a planned follow-up.
Verify-before-emit trades a little first-token latency for the guarantee that no unverified token leaves the machine.
Payment via your Chat account (the relay settles x402 server-side) — no wallet or key lives here. For a self-custodial wallet path, see the SDK's og.LLM.

Development

git clone https://github.com/OpenGradient/local && cd local
uv sync --all-groups
uv run pytest
uv run ruff check . && uv run mypy veil

Protocol-level crypto is tested in the SDK repo against the real tee-gateway recipient code, guaranteeing wire compatibility.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.github/workflows		.github/workflows
scripts		scripts
tests		tests
veil		veil
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenGradient Veil

Quickstart

Use it with Hermes Agent

How it works

Why Oblivious HTTP?

Commands

Lifecycle

Configuration

Local PII redaction (opt-in)

Notes & limitations

Development

About

Uh oh!

Releases 13

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OpenGradient Veil

Quickstart

Use it with Hermes Agent

How it works

Why Oblivious HTTP?

Commands

Lifecycle

Configuration

Local PII redaction (opt-in)

Notes & limitations

Development

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 13

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages