Announcing Runpod Flash

Rent a cloud GPU. Any model. Under 30 seconds.

On-demand GPU rental with per-second billing. 30+ GPU models across 31 global regions. No minimums, no egress fees, no idle waste.

30-second GPU deploys

Spin up any GPU instance in under 30 seconds — no provisioning queues, no sales calls, no wait.

31 global regions

Rent GPU instances in 31 regions across the US, Europe, Asia, and Australia. Deploy where your users are, not where inventory is.

Per-second GPU billing

GPU rental billed by the second. No egress fees, no minimums, no surprises. Run a job for 3 minutes — pay for 3 minutes.

Trusted by top engineers at the world's leading companies.

30+ GPU models to rent. 31 global regions.

On-demand GPU cloud pricing with no long-term commitments. Rent by the second or lock in savings with reservations.

GPU

>80GB VRAM

H200

141 GB VRAM
276 GB RAM
24
vCPUs
$4.39/hr

B200

180 GB VRAM
283 GB RAM
28
vCPUs
$5.89/hr

RTX Pro 6000

96 GB VRAM
188 GB RAM
16
vCPUs
$2.09/hr

H100 NVL

94 GB VRAM
94 GB RAM
16
vCPUs
$3.19/hr
80GB VRAM

H100 PCIe

80 GB VRAM
188 GB RAM
16
vCPUs
$2.89/hr

H100 SXM

80 GB VRAM
125 GB RAM
20
vCPUs
$3.29/hr

A100 PCIe

80 GB VRAM
117 GB RAM
8
vCPUs
$1.39/hr

A100 SXM

80 GB VRAM
125 GB RAM
16
vCPUs
$1.49/hr
48GB VRAM

L40S

48 GB VRAM
94 GB RAM
16
vCPUs
$0.86/hr

RTX 6000 Ada

48 GB VRAM
167 GB RAM
10
vCPUs
$0.77/hr

A40

48 GB VRAM
50 GB RAM
9
vCPUs
$0.44/hr

L40

48 GB VRAM
94 GB RAM
8
vCPUs
$0.99/hr

RTX A6000

48 GB VRAM
50 GB RAM
9
vCPUs
$0.49/hr
32GB VRAM

RTX 5090

32 GB VRAM
35 GB RAM
9
vCPUs
$0.99/hr
24GB VRAM

L4

24 GB VRAM
50 GB RAM
12
vCPUs
$0.39/hr

RTX 3090

24 GB VRAM
125 GB RAM
16
vCPUs
$0.46/hr

RTX 4090

24 GB VRAM
41 GB RAM
6
vCPUs
$0.69/hr

RTX A5000

24 GB VRAM
25 GB RAM
9
vCPUs
$0.27/hr
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
"The Runpod team has clearly prioritized the developer experience to create an elegant solution that enables individuals to rapidly develop custom AI apps or integrations while also paving the way for organizations to truly deliver on the promise of AI."

Amjad Masad

"Runpod is the only place I can deploy high-end GPU models instantly—no sales calls, no rate limits, no nonsense."

Daniel Chang

“The main value proposition for us was the flexibility Runpod offered. We were able to scale up effortlessly to meet the demand at launch.”

Josh Payne

“Runpod helped us scale the part of our platform that drives creation. That’s what fuels the rest—image generation, sharing, remixing. It starts with training.”

Matty Shimura

Built-in developer tools & integrations.

Runpod works wherever you build — in code, in your terminal, in your pipeline.

Full API access.

Automate everything with a simple, flexible API.

CLI & SDKs.

Deploy and manage directly from your terminal.

GitHub & CI/CD.

Push to main, trigger builds, and deploy in seconds.

Persistent storage. No ingress fees. No egress fees.

No fees for ingress/egress. Persistent and temporary storage available.

Pod Pricing
Storage Type
Running Pods
Idle Pods
Volume
$0.10/GB/mo
$0.20/GB/mo
Container Disk
$0.10/GB/mo
NA
Persistent Network Storage
Storage Type
Under 1TB
Over 1TB
Network Volume
$0.07/GB/mo
$0.05/GB/mo

Gain additional savings
with reservations.

Save more with long-term commitments. Speak with our team to reserve discounted active and flex workers.

Questions? Answers.

Curious about unlocking GPU power in the cloud? Get clear answers to accelerate your projects with on-demand high-performance compute.

GPU Pods are dedicated GPU instances you can spin up on Runpod. Unlike abstracted serverless GPUs, Pods give you full control over the underlying VM, drivers, and environment. You get a persistent instance (or ephemeral, if you prefer) with direct access to powerful GPUs, letting you run training, inference, or other workloads exactly how you want.

We offer 30+ GPU models, from entry-level inference cards to top-tier training accelerators. Examples include A100, H100, RTX 6000 Ada, L4/L40 series, and many more—over 30 options in total. You can pick any supported GPU when you launch a Pod, and new models roll out as soon as they’re live on the platform. For the latest availability, check the dashboard or query the API.

Pricing is shown as an hourly rate but billed by the millisecond. You only pay for the exact time your Pod runs—if you start and stop a Pod in one minute, you’re charged just that minute. Storage volumes may incur minimal fees when attached, but compute costs are metered by the millisecond.

Yes. GPU Pods support custom Docker images. You can build an image with your preferred libraries and push it to a registry (Docker Hub, ECR, etc.), then reference it when you launch the Pod. That way you control the OS, drivers, and dependencies.

Any framework that runs on Linux and supports GPUs: PyTorch, TensorFlow, JAX, ONNX, CUDA toolkits, etc. Since you control the container, you can install whatever versions or additional tools you need (e.g., NCCL, Horovod). We provide base images with common ML stacks to speed up setup.

We offer spot instances where GPU capacity is available at a discount, but with the risk of eviction when demand spikes. You can use them for fault-tolerant or batch workloads. The UI/API will indicate current spot availability and pricing.

750,000 developers chose Runpod without a sales call.

Engineered for teams building the future.

Wix logo
Otovo logo
Scatter Lab logo
Abzu logo
Aneta logo
Perplexity logo
Replit logo
Civitai logo

Build something new today