Deploy GPU Pods in less than 3s — fully
configured and ready to run.

Fast, flexible, and cost-efficient GPU environments for your AI workloads.

Built for real-world AI workloads

Fast, On-Demand Compute

Launch powerful GPU resources in seconds, only when you need them, without idle capacity.

Fully Customizable Environments

Run any workload using Docker, official images, or your own containers with full control over dependencies and runtime.

Built for Long-Running Workloads

Stable, persistent compute environments built for continuous training and production inference.

Popular GPUs at Lower Prices

Looking for long-term deployments, discounted pricing, or custom GPU availability

GPU

GPU Model	VRAM	RAM	vCPU	On-demand	Spot
RTX 4090	24 GB	115 GB	30	$0.38/hr	—
RTX A6000	48 GB	24 GB	6	$0.45/hr	0.27/hr
RTX 5090 popular	32 GB	88 GB	14	$0.70/hr	—
A100 PCIe 80G	80 GB	75 GB	15	$0.92/hr	1.1664/hr
RTX 6000 Ada	48 GB	60 GB	10	$0.97/hr	0.3888/hr
RTX PRO 6000	96 GB	119 GB	14	$1.35/hr	0.7236/hr
A100 80G	80 GB	120 GB	22	$1.48/hr	0.5724/hr
H100	80 GB	119 GB	11	$2.56/hr	0.95/hr
H200	141 GB	181 GB	22	$2.50/hr	1.60/hr
B200	180 GB	184 GB	30	$5.37/hr	2.20/hr
B300	262 GB	275 GB	30	$7.64/hr	3.10/hr

Prices shown are example on-demand rates. Actual pricing may vary by region, availability, and deployment configuration.

Flexible Compute for Every Need

Choose the right compute type to fit your workflow, whether it’s fast development or long-term production.

Versatile Pod

Lightweight, fast-starting GPU environments ideal for development, experimentation, and scalable workloads.

Full Virtual Machine

Dedicated GPU VMs with full system control, designed for long-running, specialized, or stateful workloads.

Product Highlights

Multi-region GPU availability

Distribute workloads across regions for improved resilience, capacity planning, and geographic flexibility.

Persistent storage support

Support persistent data and state for long-running training and production inference workloads.

Secure, isolated environments

Provide workload isolation and access controls suitable for team-based and enterprise deployments.

Flexible Templates for Fast AI Deployment

Use ready-made templates or create private ones to deploy Pods and Serverless consistently and efficiently.

Pre-Configured Model Templates

Launch Pods or Serverless instantly for popular models and workflows, including ComfyUI, Wan2.1/2.2, Qwen, and FLUX-1.dev.

Custom Private Templates

Create and manage reusable templates for organization-wide or personal deployments.

Streamlined Deployment Workflow

Reduce setup time and operational overhead with templates designed for efficiency and flexibility.

Built for real-world AI workloads

Fast, On-Demand Compute

Launch powerful GPU resources in seconds, only when you need them, without idle capacity.

Fully Customizable Environments

Run any workload using Docker, official images, or your own containers with full control over dependencies and runtime.

Built for Long-Running Workloads

Stable, persistent compute environments built for continuous training and production inference.

GPU Model

VRAM

RAM

vCPU

On-demand

Spot

RTX 4090

24 GB

115 GB

$0.38/hr

—

RTX A6000

48 GB

24 GB

$0.45/hr

0.27/hr

RTX 5090

popular

32 GB

88 GB

$0.70/hr

—

A100 PCIe 80G

80 GB

75 GB

$0.92/hr

1.1664/hr

RTX 6000 Ada

48 GB

60 GB

$0.97/hr

0.3888/hr

RTX PRO 6000

96 GB

119 GB

$1.35/hr

0.7236/hr

A100 80G

80 GB

120 GB

$1.48/hr

0.5724/hr

H100

80 GB

119 GB

$2.56/hr

0.95/hr

H200

141 GB

181 GB

$2.50/hr

1.60/hr

B200

180 GB

184 GB

$5.37/hr

2.20/hr

B300

262 GB

275 GB

$7.64/hr

3.10/hr

Product Highlights

Multi-region GPU availability

Distribute workloads across regions for improved resilience, capacity planning, and geographic flexibility.

Persistent storage support

Support persistent data and state for long-running training and production inference workloads.

Secure, isolated environments

Provide workload isolation and access controls suitable for team-based and enterprise deployments.

Deploy GPU Pods in less than 3s — fully configured and ready to run.

Built for real-world AI workloads

Popular GPUs at Lower Prices

Flexible Compute for Every Need

Product Highlights

Flexible Templates for Fast AI Deployment

Deploy GPU Pods in less than 3s — fully configured and ready to run.

Built for real-world AI workloads

Popular GPUs at Lower Prices

Flexible Compute for Every Need

Product Highlights

Flexible Templates for Fast AI Deployment

Deploy GPU Pods in less than 3s — fully
configured and ready to run.

Deploy GPU Pods in less than 3s — fully
configured and ready to run.