Pods
designed to scale with you.
GPU
Serverless
workload. Save 25% over other Serverless
cloud providers on flex workers alone.
GPU
Instant Clusters
GPU
Per second
Per hour
Reserved Clusters
GPU
1mo
3mo
6mo
12mo
12mo+
Storage
$0.10/GB/mo
Idle - $0.20/GB/mo
Under 1TB - $0.07/GB/mo
Over 1TB - $0.05/GB/mo
Under 1TB - $0.14/GB/mo
Over 1TB - $0.07/GB/mo
Public Endpoints
$0.05 per 1000 characters
$0.00 per 1000 characters.
$0.05 per 1000 characters
$0.05 per 1000 characters
$0.0270 per request
$0.0270 per request
$0.0380 per request
$0.14 per request
$0.0050 per request
$0.01 per request
$0.03 per request
$0.02 per request
$0.025 per request
$0.0050 per request.
$0.00001 per 1m tokens
$10.00 per 1m tokens
$0.05 per 1000 characters
$0.05 per 1000 characters
$1.00 per 1m tokens
5s: $0.12(480p) per request
5s: $0.30 per request
5s: $0.30 per request
$0.30 per request
$0.30 per request
1-3s $0.21 per request
5s: $0.50 per request
$0.024 per second
$0.112 per second
5s: $0.50 per request

