We're officially SOC 2 Type II Compliant
You've unlocked a referral bonus! Sign up today and you'll get a random credit bonus between $5 and $500
You've unlocked a referral bonus!
Claim Your Bonus
Claim Bonus

Pods

Thousands of GPUs across 30+ regions. Simple pricing plans for teams of all sizes,
designed to scale with you.

Serverless

Cost effective for every inference
workload. Save 25% over other Serverless
cloud providers on flex workers alone.

GPU

Per second

Per hour

Flex
Workers that scale up during traffic spikes and return to idle after completing jobs. Cost-efficient and ideal for bursty workloads.
Active
Always-on workers that eliminate cold starts. Billed continuously but come with up to 30% discount.
8.64
/s
6.84
/s
180GB
B200
Maximum throughput for big models.
5.58
/s
4.46
/s
141GB
H200
Extreme throughput for big models.
4.18
/s
3.35
/s
80GB
H100
PRO
Extreme throughput for big models.
2.72
/s
2.17
/s
80GB
A100
High throughput GPU, yet still very cost-effective.
1.9
/s
1.33
/s
48GB
L40, L40S, 6000 Ada
PRO
Extreme inference throughput on LLMs like Llama 3 7B.
1.22
/s
0.85
/s
48GB
A6000, A40
A cost-effective option for running big models.
1.58
/s
1.11
/s
32GB
5090
PRO
Extreme throughput for small-to-medium models.
1.1
/s
0.77
/s
24GB
4090
PRO
Extreme throughput for small-to-medium models.
0.69
/s
0.48
/s
24GB
L4, A5000, 3090
Great for small-to-medium sized inference workloads.
0.58
/s
0.4
/s
16GB
A4000, A4500, RTX 4000, RTX 2000
The most cost-effective for small models.

Instant Clusters

Launch multi-GPU clusters in minutes with no commitments—scale up to 64 GPUs, attach shared storage, and pay only for what you use.

GPU

Per second

Per hour

H200 SXM
Contact sales
$
4.31
/hr
A100 SXM
Contact sales
$
1.79
/hr
H100 SXM
L40S
B200

Reserved Clusters

Dedicated GPU clusters with guaranteed availability, custom configurations, SLA-backed uptime, and discounted rates for enterprises scaling to 10,000+ GPUs.

GPU

1mo

3mo

6mo

12mo

12mo+

Storage

Flexible and persisitent storage options starting at $0.05/GB/mo with standard and high-performance tiers.
Storage Type
Container Disk

$0.10/GB/mo

Volume Disk

Idle - $0.20/GB/mo

Network Storage (High-Performance)

Under 1TB - $0.14/GB/mo

Over 1TB - $0.07/GB/mo

Public Endpoints

Instant access to pre-deployed AI models via API—no infrastructure setup required.
Model Name
Audio
Pruna / Whisper V3 Large

$0.05 per 1000 characters

resembleai / Chatterbox Turbo

$0.00 per 1000 characters.

minimax / Minimax Speech 02 HD

$0.05 per 1000 characters

minimax / Minimax Speech 02 HD

$0.05 per 1000 characters

Image
bytedance / Seedream 4.0 Edit

$0.0270 per request

bytedance / Seedream 4.0 T2I

$0.0270 per request

google / Nano Banana Edit

$0.0380 per request

google / Nano Banana Pro Edit

$0.14 per request

pruna / Pruna Image T2I

$0.0050 per request

pruna / Pruna Image Edit

$0.01 per request

alibaba / WAN 2.6 T2I

$0.03 per request

qwen / Qwen Image Edit 2511

$0.02 per request

qwen / Qwen Image Edit 2511 LoRA

$0.025 per request

Tongyi-MAI / Z Image Turbo

$0.0050 per request.

Language
deep-cogito / Deep Cogito v2 Llama 70B

$0.00001 per 1m tokens

qwen / Qwen3 32B AWQ

$10.00 per 1m tokens

minimax / Minimax Speech 02 HD

$0.05 per 1000 characters

minimax / Minimax Speech 02 HD

$0.05 per 1000 characters

ibm / IBM Granite 4.0 H Small

$1.00 per 1m tokens

Video
Bytedance / Seedance 1.0 pro

5s: $0.12(480p) per request

Alibaba / Wan 2.2 I2V 720p

5s: $0.30 per request

Alibaba / Wan 2.2 T2V 720p

5s: $0.30 per request

Alibaba / Wan 2.1 I2V 720p

$0.30 per request

Alibaba / Wan 2.1 T2V 720p

$0.30 per request

kwaivgi / Kling v2.6 Standard Motion Control

1-3s $0.21 per request

Alibaba / WAN 2.6 T2V

5s: $0.50 per request

bytedance / Seedance V1.5 Pro I2V

$0.024 per second

kwaivgi / Kling Video O1 R2V

$0.112 per second

Alibaba / Wan 2.6 I2V

5s: $0.50 per request

Storage Pricing

Flexible, cost-effective storage for every workload.

No fees for ingress/egress. Persistent and temporary storage available.
Pod Pricing

Storage Type

Running Pods

Idle Pods

Volume
$0.10/GB/mo
$0.20/GB/mo
Container Disk
$0.10/GB/mo
NA
Persistent Network Storage

Storage Type

Under 1TB

Over 1TB

Network Volume
$0.07/GB/mo
$0.05/GB/mo

Gain additional savings
with reservations.

Save more with long-term commitments. Speak with our team to reserve discounted active and flex workers.

Build what’s next.

The most cost-effective platform for building, training, and scaling machine learning models—ready when you are.

You’ve unlocked a
referral bonus!

Sign up today and you’ll get a random credit bonus between $5 and $500 when you spend your first $10 on Runpod.