01
Select and provision TEE-capable hardware
Start with hardware that supports your required isolation level. For standard on-premise: any modern GPU server (NVIDIA A100, H100, or Blackwell) with Linux. For confidential computing: AMD EPYC processors with SEV-SNP (3rd gen Milan or later) or Intel Xeon 4th gen with TDX. For GPU TEE: NVIDIA H100 in confidential computing mode. Provision with a hardened Linux base — no unnecessary services, minimal attack surface.
02
Deploy the AI platform inside your perimeter
Install Cube AI on your provisioned hardware. Cube AI runs as a containerized stack — inference runtime (vLLM or Ollama), retrieval engine, guardrails engine (NeMo Guardrails + Presidio PII detection), the Cube Proxy API gateway, and the governance dashboard. For TEE deployments, Cocos AI provisions and manages the enclave automatically. For air-gapped: load all container images and model weights via offline transfer before disconnecting.
03
Configure network isolation and egress controls
Block all outbound traffic from the AI inference nodes by default. Define explicit allow-list rules for: (a) internal clients accessing the Cube API; (b) model weight update channels, if any, on a separate isolated path; (c) audit log export destinations inside your network. Verify with network monitoring that no prompt data or model output traverses the perimeter. Document the network architecture for regulatory audit.
04
Set up governance: RBAC, domains, and guardrails
Configure Cube AI's multi-domain structure — each team or use case gets its own isolated workspace with its own identity and policies. Define role-based access control: who can register models, who can modify guardrails, who can view audit logs, who can access which domains. Author guardrail policies for your specific context: PII redaction via Presidio, prompt-injection defense, content filtering, and domain-specific rules.
05
Verify with remote attestation
For TEE deployments: run the Cocos AI attestation verification flow. Cocos AI generates an attestation report from the hardware, verifies it against AMD's or Intel's certificate chain, and confirms that the expected Cube AI software is running in an unmodified enclave. Document this verification. For regulated environments, attestation reports are audit artifacts that demonstrate the infrastructure guarantee to regulators.