Skip to main content

Endpoints

GET https://inference.phala.com/v1/models
Use the live model catalog before hardcoding model IDs. The catalog returns model IDs, context windows, pricing, serving metadata, modalities, supported parameters, and whether a model can be served confidentially.

Examples

curl https://inference.phala.com/v1/models \
  -H "Authorization: Bearer <API_KEY>"

Response

{
  "data": [
    {
      "id": "phala/qwen3.5-27b",
      "name": "Qwen3.5 27B",
      "created": 1677652288,
      "is_tee": true,
      "description": "Qwen model running through Phala GPU TEE infrastructure",
      "context_length": 262144,
      "max_output_length": 262144,
      "pricing": {
        "prompt": "0.00000030",
        "completion": "0.00000240"
      },
      "providers": ["phala"],
      "input_modalities": ["text"],
      "output_modalities": ["text"],
      "supported_parameters": ["max_tokens", "temperature", "tools", "tool_choice", "response_format"],
      "metadata": {}
    }
  ]
}

Model Object Fields

FieldDescription
idModel identifier for API calls
nameHuman-readable model name
is_teetrue if the model can be served confidentially by a verified TEE provider
descriptionModel or provider description
context_lengthMaximum context window
max_output_lengthMaximum output length
pricing.promptInput token price per token; multiply by 1,000,000 for per-million-token pricing
pricing.completionOutput token price per token; multiply by 1,000,000 for per-million-token pricing
providersServing routes available for the model.
input_modalitiesSupported input types, such as text or image
output_modalitiesSupported output types, such as text or embeddings
supported_parametersRequest parameters accepted by the model

Find Verifiable TEE Models

Filter for models that can be served confidentially:
curl https://inference.phala.com/v1/models \
  -H "Authorization: Bearer <API_KEY>" | \
  jq -r '.data[] | select(.is_tee == true) | .id'
is_tee: true means the model can be served confidentially. The receipt remains the per-response proof: read the x-receipt-id header, then verify it with Attestation Report and Get Receipt.