Private inference · attested Apple Silicon

Run the models nobody else will host.

Uncensored models and community fine-tunes, served from verified Macs — where the machine's owner can't read your prompts, and you can prove it.

9 models live · uncensored + fine-tunesVerify attestation →
100%
Hardware-attested
Every response on a real SE-backed Mac
Models live
registry · uncensored + community fine-tunes
Active providers
Apple Silicon, attested via Secure Enclave
~51%
Vs hosted APIs
Same models, ~half the price

Two sides of the eclipse

01 / duality
For developers

One base_url swap.

Keep your OpenAI client. Reach the long tail — uncensored models, rare fine-tunes — that no hosted API will touch. Your prompts stay private.

# point your existing client at Umbra
client = OpenAI(
  base_url="https://api.umbra.dev/v1",
  api_key="umbra-…")
For Mac owners

Your Mac, idle 18 hours a day.

Host the models you choose — pull them with your own Hugging Face key — and earn per token. The work runs where you can't be watched, and neither can the prompts.

# install the provider, start earning
curl -fsSL umbra.dev/install | sh
umbra serve › earning · 1,179 tok/s

How the dark holds

02 / attestation
01 · identity

Secure Enclave

P-256 keypair on the chip. Private key never leaves the Enclave, even to RAM.

02 · enrollment

MDM profile

Apple MDM SecurityInfo cross-checks the SE serial. Bad posture → out.

03 · policy

Apple MDA

Full X.509 chain to the Apple Enterprise Attestation Root CA.

04 · code identity

APNs

Binary SHA-256 pinned. The provider is the audited build, not a fork.

05 · model digest

GGUF SHA-256

You got the real model — verified at request time, every request.

You don't have to trust Umbra. The chain is in every response receipt. Verify in your browser, against Apple's published root store.verified locally · no telemetery · no Umbra round-trip
Models with
no other home.

The long tail incumbents won't host — uncensored and community fine-tunes — served from Macs you can verify, at prices the cloud can't match. Browse the catalog, pick one, point your client at it.

What we believe

The machine owner can't read your prompt — and you can prove it.

Private-prompt inference isn't a feature we promise. It's a property the hardware has. Every other promise in this market is built on top of that property — and on top of the assumption that you, the buyer, will trust the seller's word for it. We don't think you should have to.

Your idle Mac is a 1,200 tok/s inference server.

Most Apple Silicon machines sit unused 18 hours a day. Umbra turns that into a verified per-token earn — and gives the long-tail model market the hosts it can't get anywhere else.