Fast, flexible, and cost-efficient GPU environments for your AI workloads.

Fast, On-Demand Compute
Launch powerful GPU resources in seconds, only when you need them, without idle capacity.

Fully Customizable Environments
Run any workload using Docker, official images, or your own containers with full control over dependencies and runtime.

Built for Long-Running Workloads
Stable, persistent compute environments built for continuous training and production inference.

Looking for long-term deployments, discounted pricing, or custom GPU availability
GPU
| GPU Model | VRAM | RAM | vCPU | On-demand | Spot | |
|---|---|---|---|---|---|---|
RTX 4090 | 24 GB | 115 GB | 30 | $0.38/hr | — | |
RTX A6000 | 48 GB | 24 GB | 6 | $0.45/hr | 0.27/hr | |
RTX 5090 popular | 32 GB | 88 GB | 14 | $0.70/hr | — | |
A100 PCIe 80G | 80 GB | 75 GB | 15 | $0.92/hr | 1.1664/hr | |
RTX 6000 Ada | 48 GB | 60 GB | 10 | $0.97/hr | 0.3888/hr | |
RTX PRO 6000 | 96 GB | 119 GB | 14 | $1.35/hr | 0.7236/hr | |
A100 80G | 80 GB | 120 GB | 22 | $1.48/hr | 0.5724/hr | |
H100 | 80 GB | 119 GB | 11 | $2.56/hr | 0.95/hr | |
H200 | 141 GB | 181 GB | 22 | $2.50/hr | 1.60/hr | |
B200 | 180 GB | 184 GB | 30 | $5.37/hr | 2.20/hr | |
B300 | 262 GB | 275 GB | 30 | $7.64/hr | 3.10/hr |
Prices shown are example on-demand rates. Actual pricing may vary by region, availability, and deployment configuration.
Choose the right compute type to fit your workflow, whether it’s fast development or long-term production.

Versatile Pod
Lightweight, fast-starting GPU environments ideal for development, experimentation, and scalable workloads.

Full Virtual Machine
Dedicated GPU VMs with full system control, designed for long-running, specialized, or stateful workloads.

Multi-region GPU availability
Distribute workloads across regions for improved resilience, capacity planning, and geographic flexibility.
Persistent storage support
Support persistent data and state for long-running training and production inference workloads.
Secure, isolated environments
Provide workload isolation and access controls suitable for team-based and enterprise deployments.

Use ready-made templates or create private ones to deploy Pods and Serverless consistently and efficiently.

Pre-Configured Model Templates
Launch Pods or Serverless instantly for popular models and workflows, including ComfyUI, Wan2.1/2.2, Qwen, and FLUX-1.dev.

Custom Private Templates
Create and manage reusable templates for organization-wide or personal deployments.

Streamlined Deployment Workflow
Reduce setup time and operational overhead with templates designed for efficiency and flexibility.