Threat Model and Security Guarantees

Trust Boundaries

Understanding what you must trust — and what’s protected by hardware or blockchain — is the foundation of evaluating dstack-cloud’s security. This page maps out trust boundaries, threat categories, and the guarantees the system provides.

Untrusted

Entity	Assumption
Cloud platform (GCP / AWS)	May attempt to read workload memory, inspect traffic, or modify configurations. TEE hardware prevents memory access.
Host machine (EC2 instance on Nitro)	Has root access to the host OS. Cannot access Enclave memory or modify Enclave code.
Network attackers	May intercept, modify, or replay network traffic. Defended by TLS / RA-TLS.
RPC providers	May return stale or malicious blockchain state. KMS should use multiple RPC sources.

Protected by Hardware (TEE)

Entity	Protection
dstack CVM (GCP workload)	Memory encrypted by TDX. Host and cloud platform cannot read or modify it. Guest Agent handles attestation and key management.
Nitro Enclave (AWS workload)	Memory encrypted by Nitro. Host and cloud platform cannot read or modify it. `dstack-util` handles attestation and key retrieval.
dstack-kms (KMS)	Runs in its own TEE. Keys are generated and stored inside; never exposed outside.

Protected by Blockchain Consensus

Entity	Protection
On-chain contracts (DstackKms, DstackApp)	Immutable unless governance process is followed. Changes require multisig + timelock.

Partially Trusted

Entity	Risk
Multisig signers	Can collude to push through unauthorized changes. Impact is limited by the signature threshold and timelock delay.

Threat Categories

T1: Malicious Cloud Platform Operator or Compromised Host OS

Attack: Cloud provider or host OS administrator attempts to read workload memory or extract keys.
Impact: Data breach, key compromise.
Mitigation: TEE hardware encryption prevents memory access on both GCP (TDX) and AWS (Nitro Enclave). Attestation proves hardware authenticity.
Residual risk: Microarchitectural side-channel attacks (speculative execution, etc.). See Residual Risks.

T3: Malicious or Compromised Workload

Attack: An attacker gains control of a workload container inside the CVM or Enclave.
Impact: Data within that container is compromised. The attacker may try to escalate to the Guest Agent (GCP) or dstack-util (Nitro).
Mitigation: Container isolation within the CVM/Enclave. The Guest Agent (GCP) or dstack-util (Nitro) validates attestation before delivering keys.
Residual risk: If the attacker can modify the CVM/Enclave image itself, the measurements change and KMS will refuse to deliver keys. On Nitro, since encryption strategy is user-controlled, a compromised workload may misuse any keys it has already obtained.

T4: Man-in-the-Middle / Network Attack

Attack: Attacker intercepts communication between CVM and KMS, or between CVM and external services.
Impact: Key interception, data theft, configuration tampering.
Mitigation: All communication uses TLS or RA-TLS. RA-TLS additionally verifies both parties’ attestation.
Residual risk: TLS implementation vulnerabilities, certificate authority compromise.

T5: Compromised RPC Provider

Attack: Attacker operates a malicious RPC node that returns false blockchain state.
Impact: KMS may accept unauthorized measurements or reject authorized ones.
Mitigation: Use multiple independent RPC providers. KMS should verify blockchain state across sources.
Residual risk: If all RPC providers are colluding or compromised.

T6: Compromised or Colluding Multisig Signers

Attack: Multiple signers collude to push through unauthorized governance changes (e.g., register malicious measurements).
Impact: Unauthorized workloads receive keys from KMS.
Mitigation: Signature threshold (≥ 2/3) limits the number of signers that must be compromised. Timelock provides a window for detection.
Residual risk: If enough signers collude to meet the threshold, the system is compromised.

T7: Covert Deployer Attack

Attack: A workload deployer secretly modifies the application code after deployment.
Impact: The workload behaves differently from what was approved.
Mitigation: On-chain measurement registration. Any code change produces new measurements. KMS refuses to deliver keys to unregistered measurements.
Residual risk: If the attacker can register the new measurements through governance without being detected.

Security Guarantees

Guarantee	Mechanism
Keys never leave verified TEE	KMS runs in its own TEE. Keys are generated, stored, and dispatched entirely within TEE. The cloud provider cannot access them.
Only approved code receives keys	Workload measurements must be registered on-chain. KMS verifies measurements before dispatching keys.
Governance changes are auditable	All governance actions go through Multisig + Timelock and are recorded on-chain. Anyone can verify the history.
Memory is encrypted	TEE hardware encrypts all memory. The host OS and cloud platform cannot read CVM (GCP) or Enclave (Nitro) memory.
Code integrity is verifiable	Attestation proves the exact code and configuration running in the TEE. External parties can independently verify.

Residual Risks

These are risks that the current architecture does not fully mitigate:

Risk	Description	Mitigation
Hardware side-channels	TEE hardware may be vulnerable to microarchitectural side-channel attacks (e.g., Spectre, Meltdown variants).	Keep TCB (Trusted Computing Base) firmware updated. Monitor Intel / AWS security advisories.
Smart contract vulnerabilities	Bugs in DstackKms, DstackApp, or governance contracts could lead to unauthorized access.	Conduct formal smart contract audits. Use well-tested contract libraries (Safe, Timelock).
KMS root key	The KMS root key is currently a single point of trust within the KMS TEE.	Future plans include MPC (Multi-Party Computation) to distribute root key generation.
Denial of service	The cloud provider or host operator can shut down CVMs or Enclaves, denying service.	Use cross-region, cross-provider redundancy for high-availability deployments.

Security Checklist for Deployments

Before going to production, verify: TEE and Attestation:

dstack OS image is built from audited source code
All measurements (RTMR / OS_IMAGE_HASH) are registered on-chain
TLS certificates are valid and properly configured

Governance:

Multisig signers are using hardware wallets
Signature threshold is ≥ 2/3
Timelock delay is appropriate for your risk profile

Operations:

Multiple independent RPC providers are configured
Monitoring and alerting are set up for attestation failures and governance events
Runbook exists for common failure scenarios

Next Steps

Glossary — Definitions of security-related terms
Runbook — Troubleshooting security-related issues
dstack Security Model — Official security model document

Concepts

How-to Guides

Operations

Reference

Appendix

Threat Model and Security Guarantees

Threat Model and Security Guarantees

Trust Boundaries

Untrusted

Protected by Hardware (TEE)

Protected by Blockchain Consensus

Partially Trusted

Threat Categories

T1: Malicious Cloud Platform Operator or Compromised Host OS

T3: Malicious or Compromised Workload

T4: Man-in-the-Middle / Network Attack

T5: Compromised RPC Provider

T6: Compromised or Colluding Multisig Signers

T7: Covert Deployer Attack

Security Guarantees

Residual Risks

Security Checklist for Deployments

Next Steps

​Threat Model and Security Guarantees

​Trust Boundaries

​Untrusted

​Protected by Hardware (TEE)

​Protected by Blockchain Consensus

​Partially Trusted

​Threat Categories

​T1: Malicious Cloud Platform Operator or Compromised Host OS

​T3: Malicious or Compromised Workload

​T4: Man-in-the-Middle / Network Attack

​T5: Compromised RPC Provider

​T6: Compromised or Colluding Multisig Signers

​T7: Covert Deployer Attack

​Security Guarantees

​Residual Risks

​Security Checklist for Deployments

​Next Steps

Threat Model and Security Guarantees

Trust Boundaries

Untrusted

Protected by Hardware (TEE)

Protected by Blockchain Consensus

Partially Trusted

Threat Categories

T1: Malicious Cloud Platform Operator or Compromised Host OS

T3: Malicious or Compromised Workload

T4: Man-in-the-Middle / Network Attack

T5: Compromised RPC Provider

T6: Compromised or Colluding Multisig Signers

T7: Covert Deployer Attack

Security Guarantees

Residual Risks

Security Checklist for Deployments

Next Steps