Spheron GPU Catalog

GPU Rental: On-Demand NVIDIA H100, A100, B200, RTX 5090 from $0.58/hr

Rent NVIDIA GPUs by the minute with live pricing, bare-metal access, SSH root, and a dedicated IP. Deploy H100, H200, B200, B300, GH200, A100, L40S, RTX PRO 6000, RTX 5090, or RTX 4090 in under 2 minutes. No contracts, no warm-up charges, no hidden fees.

GPUs10+
Deploy in< 2 min
Starting from$0.58/hr
BillingPer-minute

GPU rental pricing

Live per-GPU hourly pricing across the full Spheron catalog. Every rate is per-minute billed with no minimum commit and no warm-up charges. Reserved multi-GPU clusters get deeper discounts; talk to sales for quotes.

RTX 4090Ada Lovelace
From$0.58/hr
VRAM24 GB
Best forDev, experimentation, small-model inference
Rent RTX 4090
RTX 5090Blackwell
From$0.68/hr
VRAM32 GB
Best forBudget inference, prototyping, single-GPU dev
Rent RTX 5090
L40SAda Lovelace
From$0.69/hr
VRAM48 GB
Best forInference serving, video/vision, rendering
Rent L40S
A100Ampere
From$0.72/hr
VRAM80 GB
Best forFine-tuning, mid-scale training, stable inference
Rent A100
RTX PRO 6000Blackwell
From$1.07/hr
VRAM96 GB
Best forProduction inference, rendering, visual workloads
Rent RTX PRO 6000
H100Hopper
From$1.33/hr
VRAM80 GB
Best forLLM training, HPC, large-scale inference
Rent H100
H200Hopper
From$1.56/hr
VRAM141 GB
Best forLong-context LLM inference, 70B+ model serving
Rent H200
GH200Grace Hopper
From$1.88/hr
VRAM96 GB
Best forCPU-GPU coherent workloads, graph AI, vector search
Rent GH200
B200Blackwell
From$2.25/hr
VRAM192 GB
Best forLarge-model training, FP4/FP8 inference
Rent B200
B300Blackwell Ultra
From$3.50/hr
VRAM288 GB
Best forFrontier training, trillion-parameter models
Rent B300

Pricing updates from live inventory. Spot availability varies by region and time of day. See each GPU page for per-minute rates, multi-GPU node pricing, and InfiniBand cluster options.

How GPU rental works on Spheron

Vetted data center capacity, exposed as VMs or bare-metal instances. You pick, you deploy, you pay for minutes used. No approval queue, no warm-up billing, no hypervisor tax.

Deployment flow< 2 min
01
Pick GPU
10 SKUs, RTX 4090 to B300
02
Pick region
North America, Europe, Asia
03
Click deploy
No queue, no approval
04
SSH in
Root access, dedicated IP

Dedicated vs spot

Both on-demand. Both bare-metal. Per-minute billed.
Dedicated99.99% SLA

Runs until you stop it. The provider cannot reclaim the node, so you pay a fixed hourly rate and keep the instance as long as you need it.

Use for
  • ·Production inference endpoints
  • ·Interactive development
  • ·Long-running training
Spot30–60% off

Runs on spare capacity at a deep discount. Interruptible when the provider reclaims that capacity. Each GPU page posts both rates so you can compare before you launch.

Use for
  • ·Checkpointable training runs
  • ·Batch inference
  • ·Hyperparameter sweeps

Per-minute billing

Billing starts when your instance reports ready and stops the moment you terminate. No minimum run time, no rounding up to the hour, no charge for boot time.

20-min bench= 20 min billed

Multi-GPU & interconnect

Single to 8x nodes with NVLink on H100, H200, B200, B300, and A100. Beyond 8 GPUs, InfiniBand clusters with RDMA and NCCL-tuned topology.

IB fabric400 Gb/s NDR

Bare-metal access

  • ·SSH root, dedicated public IP
  • ·Ubuntu 22.04 + CUDA preinstalled
  • ·Docker + NVIDIA Container Toolkit
  • ·No hypervisor overhead, no noisy neighbors
GPU Infrastructure

Ready to Deploy?

Deploy enterprise-grade GPU instances in minutes with instant provisioning and bare-metal performance. No contracts, no commitments, no hidden fees, pay only for what you use.

Deploy Time
< 2 min
Uptime SLA
99.9%
GPU Models
10+
Starting At
$0.58/hr