Hardware encrypted inference.

Zero logs. Zero limits.

An encrypted AI gateway for every model.

Integration

Secure computation, end-to-end

Inference leading LLMs run inside verifiably secure runtimes, powered by Intel TDX and NVIDIA Confidential Computing architectures.

inference.ts

1import OpenAI from "openai"
2
3const zima = new OpenAI({
4  baseURL: "https://www.zima.chat/api/v1",
5  apiKey: process.env.ZIMA_KEY,
6})
7
8const response = await zima.chat.completions.create({
9  model: "qwen3-next-80B-a3B-instruct",
10  messages: [{ role: "user", content: "..." }],
11})
12
13// encrypted in hardware. zero data retained.

www.zima.chat

POST /api/v1/chat/completions

host www.zima.chat

authorization zima_sk_•••dk8f

x-zima-attest true

Sealed payload

{ "model": "qwen3-next-80B-a3B-instruct",
  "messages": [{ "role": "user", "content": "..." }] }

✓ 200 OK · 138ms

  "usage": { "prompt_tokens": 12, "total_tokens": 21 },
  "zima_proof": {
    "enclave": "tdx-v5",
    "enclave_attestation": "sha384:e7a1b3c8d2f0...",
    "zima_protected": true
  }

Zima Secured

Verified enclave execution

Enclave: Intel TDX v5
Attestation: sha384:e7a1b3c8d2f0...
Retention: Disabled
Memory: Scrubbed

Zima Secured ensures your data is wiped from memory.

Quickstart

Seamless migration to secure compute

Use your existing OpenAI client. Swap the base URL and API key to start encrypting every inference request today.

Install

Drop in the OpenAI SDK. No proprietary client, no vendor lock. One dependency you already know.

Point to Zima

Swap the base URL. Every request now routes through our secure enclave — encrypted in hardware, zero data retained.

Ship to Production

Same models, same latency, hardware-level encryption. Your compliance team will thank you.

From SDK request to protected output, without storing prompts or responses.

Architecture

Verified inference path

01Authenticate

API key verified, team resolved, usage policy enforced before entering the secure network.

02Encrypt

Payload sealed inside a hardware-backed TDX enclave. Data encrypted in CPU/GPU memory.

03Route

Forwarded to any of 100+ models. Provider credentials isolated, never exposed to the host.

04Deliver

Response returned, usage metered. Memory wiped per request - nothing persists.

Infrastructure

Secure by design

From key exchange to model output, every step runs inside hardware-attested enclaves.

Hardware encryption

Intel TDX and NVIDIA confidential compute enclaves protect every request made through Zima.

Zero data retention

No prompts, outputs, or metadata survive past completion. GPU memory gets wiped after every request.

OpenAI SDK compatible

Point your existing OpenAI client at Zima's base URL. Fully encrypted inference with a single line swap.

100+ model catalog

OpenAI, Anthropic, Mistral and more. Access the models you already use, through one secure endpoint.

Model	Provider	Input / 1M	Output / 1M
GPT-4o	OpenAI	$2.50	$10.00
Claude Sonnet 4	Anthropic	$3.00	$15.00
Gemini 2.5 Pro	Google	$1.25	$5.00
Llama 3.1 70B	Meta	$0.50	$0.75
Mistral Large	Mistral	$2.00	$6.00

Models

Zero markup

Provider pricing with zero intermediary markup. Hardware-grade encryption at no extra cost.

Full catalog

Secure Enclave

Zero bytes retained

Every request executes inside a hardware-isolated enclave. No prompts, outputs, or metadata survive past completion.

Zima Enclave

Intel TDX v5 · NVIDIA CC

0bytes retained

No prompts, outputs, or metadata survive past completion

AES-256-GCM

Memory Encryption

Data sealed in silicon. Encrypted at rest, in transit, and in use.

Attestation

Cryptographic proof before any key is released.

“We evaluated six AI gateways. Zima was the only one where our security team signed off without a single objection.”

Sarah ChenVP Engineering, Meridian Health

“Switched from direct OpenAI calls in an afternoon. Same SDK, same format — just encrypted now.”

Marcus RiveraCTO, Lattice Finance

FAQ

Frequently asked questions

Security, compatibility, and pricing.

No. Requests exist only for the duration of execution inside the enclave. No prompts, outputs, metadata, or logs are persisted. This is enforced by hardware, not policy.

$10 in free credits. Start encrypting your inference today.

Point the OpenAI SDK at Zima, keep model choice and pricing visibility, and move sensitive traffic into attested hardware.

Create Key