DigitalOcean GPU Droplet Pricing 2026: H100, H200, and Gradient Inference Cost vs Spheron

DigitalOcean GPU Droplets put H100 hardware on a familiar cloud platform at approximately $3.39/hr per GPU, per-second billing with a 5-minute minimum. The billing caveat matters: GPU Droplets charge while powered off. If your team keeps a Droplet alive overnight but idle, you pay for the full 24 hours. That single detail can double or triple your effective GPU cost compared to platforms that bill only for active compute. This post covers the DO GPU Droplet lineup, Gradient serverless inference pricing, hidden costs, and a direct cost comparison against Spheron's on-demand and spot rates.

TL;DR: DigitalOcean GPU Droplets vs Spheron H100/H200 (Jul 2026)

GPU	DO Droplet $/hr	Spheron On-Demand $/hr	Spheron Spot $/hr
H100 80GB 1x (DO: HGX/SXM; Spheron: PCIe)	~$3.39	$2.01 (PCIe)	N/A
H100 SXM 8x node, per GPU	~$3.39	$4.06 (SXM5)	$1.49 (SXM5)
H200 SXM5	~$3.44	$3.70	$1.76
Billing granularity	Per second (5-min min)	Per minute	Per minute
Powered-off billing	Yes, full rate	No	No
Minimum billing	5 minutes	1 minute	1 minute

At 100% utilization, DO H100 HGX at $3.39/hr is ~69% more expensive than Spheron H100 PCIe at $2.01/hr (note: different form factors, DO's H100 is HGX/SXM and Spheron's is PCIe). At 50% utilization (idle 12 hours/day on a kept-alive Droplet), the effective DO rate is $6.78/hr vs Spheron's $2.01/hr for actual active hours.

Pricing fluctuates based on GPU availability. The prices above are based on 03 Jul 2026 and may have changed. Check current GPU pricing → for live rates.

DigitalOcean GPU Droplet Lineup

DigitalOcean launched GPU Droplets in 2024 under their "AI-Native Cloud" positioning. The lineup centers on NVIDIA H100 and H200 hardware, offered in single-GPU and 8-GPU node configurations.

SKU	GPU	VRAM	vCPUs	RAM	$/hr
GBH100x1	1x H100 HGX (80GB)	80 GB	20	120 GB	~$3.39
GBH100x8	8x H100 SXM (80GB)	640 GB	160	960 GB	~$27.12
H200 SXM5	1x H200 SXM5 (141GB)	141 GB	N/A	N/A	~$3.44

Note: DigitalOcean has also announced bare metal GPU options and is investing in expanded AI infrastructure, but self-serve bare metal GPU access is limited. Verify current availability at cloud.digitalocean.com when provisioning. The Gradient product line (acquired from Paperspace) sits alongside GPU Droplets for managed notebooks and serverless inference workloads.

DigitalOcean H100 GPU Droplet Pricing Per Hour

The GBH100x1 instance (1x H100 80GB, HGX/SXM architecture) runs approximately $3.39/hr. The 8x GPU node (GBH100x8) runs approximately $27.12/hr, which is the same $3.39/hr per GPU on a per-GPU basis. The per-GPU rate does not increase for the 8x node; both configurations bill at $3.39/hr per GPU.

Billing is per-second, which sounds granular but is undermined by the 5-minute minimum and, more critically, by the powered-off policy: a Droplet that is powered off but not destroyed continues to bill at the full per-second rate. DigitalOcean's billing documentation explicitly states that GPU Droplets reserve capacity in the data center even when powered off, and they charge for that reservation.

For teams running GPUs for 8 hours per day and keeping the Droplet alive overnight:

DO charges: 24 hours/day at $3.39/hr = $81.36/day per GPU
Spheron charges: 8 hours/day at $2.01/hr = $16.08/day per GPU (on-demand PCIe)
Difference: DO costs ~5x more per active GPU-hour at 33% utilization

To avoid idle charges on DO, you must destroy the Droplet after each session and re-provision before the next. That adds provisioning time, loses in-progress state, and adds friction for teams running intermittent workloads.

Pricing fluctuates based on GPU availability. The prices above are based on 03 Jul 2026 and may have changed. Check current GPU pricing → for live rates.

DigitalOcean H200 GPU Droplet Pricing

DigitalOcean H200 GPU Droplets launched in 2026 and are available self-serve at $3.44/GPU/hr. An 8x H200 SXM5 node runs approximately $27.52/hr. The same powered-off billing policy applies: destroy the Droplet to stop charges, because powering off without destroying still bills at the full rate.

Comparing DO H200 against Spheron H200 pricing:

DO H200: $3.44/hr on-demand, no spot option
Spheron H200 SXM5: $3.70/hr on-demand, $1.76/hr spot

On raw on-demand rate, DO H200 at $3.44/hr is about 7% cheaper than Spheron's $3.70/hr. That gap reverses once you factor in DO's powered-off billing for teams that don't run GPUs continuously. Spheron H200 spot at $1.76/hr cuts cost by nearly 49% versus DO's on-demand rate for workloads that tolerate preemption.

For training runs with checkpoint/resume: Spheron H200 spot at $1.76/hr is the more cost-effective path. For fully managed 24/7 inference workloads where you're already in the DO ecosystem and powered-off billing doesn't apply, DO H200 at $3.44/hr is a viable alternative.

Verify current H200 Droplet availability and pricing at cloud.digitalocean.com.

DigitalOcean Gradient Serverless Inference Pricing

DigitalOcean Gradient Deployments offer serverless inference without managing GPU instances. You submit inference requests and pay per compute-second or per-token rather than per GPU-hour. Published rates for open-source models range from roughly $0.18 to $0.99 per million tokens depending on model size: smaller models like Ministral 3 14B run around $0.20/M tokens, mid-size models like Llama 3.3 70B around $0.65/M, and larger models like DeepSeek R1 Distill 70B around $0.99/M. Some larger models with separate input/output pricing reach $1.10/M input and $4.40/M output. Verify current rates at digitalocean.com/pricing before budgeting.

Gradient abstracts away GPU management: no SSH, no container orchestration, no instance lifecycle to manage. The trade-off is less control over the inference stack and per-token billing that can be expensive at scale.

Break-even math against a self-managed H100 Droplet at $3.39/hr:

Self-managed cost: $3.39/hr divided by tokens/hr processed
If vLLM on H100 processes 2M tokens/hr (realistic for a 7B model at reasonable batch size): cost = $3.39 / 2,000,000 = $0.0017 per 1,000 tokens = $1.70/million tokens
At Gradient's cheapest model (~$0.20/M tokens), self-managed H100 at $1.70/M effective cost is more expensive per token for low-throughput workloads; Gradient wins until throughput is high enough to amortize the fixed $3.39/hr cost
Break-even: self-managed H100 at $3.39/hr matches Gradient's $0.20/M rate at ~17M tokens/hr sustained, and matches the $0.65/M average rate at ~5M tokens/hr. Production batched inference workloads on 7B-70B models can realistically hit the 5M tokens/hr threshold; the 17M threshold is harder but achievable with continuous high-batch-size serving.

For exploratory use or low-traffic API endpoints, Gradient's serverless model avoids idle GPU cost. For sustained high-throughput production inference, self-managed H100 on a per-minute billing platform is the more cost-efficient path. For teams that need a middle ground, the vLLM production deployment guide covers setting up self-managed inference on bare-metal H100s with auto-scaling.

DigitalOcean GPU Hidden Costs

Beyond the listed per-hour rate, four cost categories can significantly inflate total DO GPU spend:

Powered-off billing. As covered above, GPU Droplets bill at full rate whether running or powered off. A team using GPUs 8 hours per day but keeping Droplets for scheduling convenience pays for 24 hours. At $3.39/hr per GPU, that inflates monthly cost from $814/month (8hr/day) to $2,441/month (24hr/day) per GPU. Spheron charges nothing for idle time since billing stops when you terminate the instance.

5-minute minimum billing. Per-second billing has a 5-minute floor. Short evaluation runs or single-batch inference jobs under 5 minutes bill as 5 minutes. For iterative development workflows with many short runs, this adds up.

Block storage volumes. Persistent data volumes are billed separately at DigitalOcean's standard volume rates (approximately $0.10/GB-month for standard SSD volumes). For large model weights or training datasets stored on persistent volumes, this can be substantial. A 1TB volume for checkpoint storage adds roughly $100/month on top of compute.

Egress. Each Droplet includes a free monthly outbound data transfer allowance of approximately 15TB per Droplet. Transfers above the included amount are billed at $0.01/GB. Downloading large checkpoint files to local storage or moving data between regions can exceed these limits for active training runs.

DigitalOcean vs Spheron: H100 and H200 Cost Per Hour

Feature	DigitalOcean H100 HGX	DigitalOcean H200 SXM5	Spheron H100 PCIe	Spheron H100 SXM5
On-demand $/hr	~$3.39	~$3.44	$2.01	$4.06
Spot/preemptible $/hr	None	None	N/A	$1.49
Billing granularity	Per second (5-min min)	Per second (5-min min)	Per minute	Per minute
Powered-off billing	Yes	Yes	No	No
Root/SSH access	Yes	Yes	Yes	Yes
Egress (included)	~15TB/month	~15TB/month	Included	Included
Minimum commitment	None	None	None	None

Spheron H100 PCIe at $2.01/hr is ~41% cheaper than DO's H100 HGX at ~$3.39/hr at 100% utilization (note: different form factors, DO's H100 is HGX/SXM and Spheron's at $2.01/hr is PCIe). For any utilization below 100% on a kept-alive Droplet, the gap widens. The powered-off billing difference is the defining cost factor for most workloads. For H200, Spheron spot at $1.76/hr beats DO's $3.44/hr on-demand by 49% for training workloads that can use checkpoint/resume.

When DO's managed simplicity justifies the premium:

Teams already deep in the DigitalOcean ecosystem (Managed Kubernetes via DOKS, Spaces object storage, Managed Databases, DO's App Platform) benefit from consolidated billing and the DO control plane. A single vendor for the entire stack can simplify ops significantly. Teams building on DOKS with GPU inference as one component of a larger DO-hosted application may find the per-GPU premium offset by workflow integration. DO's GPU Droplets also work with their Load Balancers and VPC networking, which matters for teams running inference behind a DO Load Balancer.

When Spheron wins on price:

No Oracle/DO ecosystem dependency, bursty training jobs, intermittent inference workloads, or teams that need H200 spot access. Spheron's per-minute billing and no powered-off charges make it the clear choice for workloads where GPU utilization is below 100%. Spheron aggregates capacity from 5+ providers, giving broader GPU availability than any single-provider cloud. No quota process and provisioning in under 2 minutes make it easier to respond to urgent compute needs.

Break-Even Math: When DigitalOcean GPU Droplets Make Sense

Consider a team running one H100 GPU for ML workloads, comparing monthly cost at different utilization levels:

Daily Active Hours	Monthly DO Cost (1x H100)	Monthly Spheron On-Demand Cost (PCIe)	Break-Even?
24hr (100%)	$2,441	$1,447	Spheron wins
16hr (67%)	$2,441 (DO bills all 24hr)	$965	Spheron wins
8hr (33%)	$2,441 (DO bills all 24hr)	$482	Spheron wins at ~5x
4hr (17%)	$2,441 (DO bills all 24hr)	$241	Spheron wins at ~10x

DO can only win on total cost if you run GPUs 24/7 with no idle time, and even then Spheron's on-demand H100 PCIe at $2.01/hr undercuts DO's $3.39/hr by ~41%. The only scenario where DO's platform can be the lower-cost option is a negotiated enterprise contract with significant volume discounts.

The one exception: if your team is already on DigitalOcean and the workflow integration savings in engineering time offset $1.38/hr per GPU, DO becomes defensible. At 5 GPUs running 8hr/day, the per-GPU gap is about $1.38/hr, which is $1,656/month in compute difference. Whether consolidated billing and DO control plane integration saves more than $1,656/month in team time is a judgment call.

DigitalOcean vs Spheron: Deployment Comparison

Factor	DigitalOcean GPU Droplets	Spheron
Time-to-first-GPU	~2-5 minutes	Under 2 minutes
GPU access model	Cloud VMs (SSH, console)	Bare-metal from 5+ providers (SSH)
Managed Kubernetes	Yes (DOKS)	No native managed K8s
Serverless inference	Yes (Gradient)	No native serverless tier
Spot/preemptible	None for GPU Droplets	Yes (SXM5 spot at $1.49/hr H100, $1.76/hr H200)
API/SDK	DigitalOcean API/CLI	Spheron SDK, REST API
Multi-GPU interconnect	NVLink (8x SXM node)	NVLink (SXM configs)

DO's managed Kubernetes via DOKS is a genuine advantage for teams building multi-service applications. Spheron's raw per-minute bare-metal pricing and broader provider coverage are better fits for compute-heavy ML workloads.

Spheron Live H100 and H200 Rates (Jul 2026)

GPU	Model	On-Demand $/hr	Spot $/hr
H100 PCIe 80GB	H100 PCIe	$2.01	N/A
H100 SXM5 80GB	H100 SXM5	$4.06	$1.49
H200 SXM5 141GB	H200 SXM5	$3.70	$1.76

For H100 SXM5 workloads on Spheron, Spheron H100 instances offer bare-metal access from 5+ providers with per-minute billing. For H200, the on-demand rate is $3.70/hr and spot is $1.76/hr, available without a sales process or quota application.

Pricing fluctuates based on GPU availability. The prices above are based on 03 Jul 2026 and may have changed. Check current GPU pricing → for live rates.

DigitalOcean vs Other GPU Clouds

Provider	H100 On-Demand $/hr	Spot/Preemptible $/hr	Billing	Powered-Off Billing
DigitalOcean	~$3.39 (HGX/SXM)	None	Per second	Yes
Spheron (PCIe)	$2.01	N/A	Per minute	No
Spheron (SXM5)	$4.06	$1.49	Per minute	No
RunPod (Secure Cloud)	~$3.29	Available	Per minute	No
Lambda Labs	~$2.49-$3.44	None	Per hour	No
Nebius	$3.85	$2.15 (preemptible)	Per hour	No
OCI	$10.00	Available	Per hour	No
Vast.ai (Verified DC)	~$1.50-$2.27 (PCIe)	~$0.90-$1.60	Per hour	No

For the full provider comparison including AWS, Azure, GCP, CoreWeave, and Lambda, see the full GPU cloud pricing comparison 2026.

DigitalOcean GPU Droplets sit in the mid-range on raw per-hour price but are effectively the most expensive option for any team that doesn't run GPUs at 100% utilization, because of powered-off billing. Vast.ai verified hosts are cheaper on paper for H100 PCIe but introduce host reliability variance and container-only access. Spheron H100 PCIe at $2.01/hr with per-minute billing and no powered-off charges is the most cost-effective managed option for intermittent or bursty GPU workloads.

DigitalOcean GPU Droplets cover managed simplicity at a premium, powered-off billing included. Spheron aggregates bare-metal H100 and H200 capacity from 5+ providers with true per-minute billing, no idle charges, and transparent on-demand and spot rates.
H100 on Spheron → | H200 SXM5 on Spheron → | View all GPU pricing →

STEPS / 04

Quick Setup Guide

Check DigitalOcean GPU Droplet current pricing
Go to cloud.digitalocean.com and create a new Droplet, selecting the GPU Droplet category. Current H100 80GB GPU Droplet instances (HGX/SXM architecture) are listed at approximately $3.39/hr. Confirm the powered-off billing policy in the DO billing documentation before deploying, since the Droplet will charge whether running or powered off.
Calculate total DigitalOcean GPU cost including powered-off charges
Multiply your total reserved hours (not just active hours) by the per-hour rate. If your team uses GPUs 10 hours per day but keeps the Droplet for scheduling convenience, you pay for 24 hours. Monthly cost per H100 GPU: $3.39 * 24 * 30 = $2,441/month regardless of utilization. To avoid idle charges, you must destroy the Droplet and re-provision when needed, which adds setup overhead and loses any in-memory state.
Compare Spheron H100 and H200 pricing
Visit spheron.network/pricing/ and filter by H100 or H200. Spheron H100 PCIe is $2.01/hr on-demand with per-minute billing and no powered-off charges. H100 SXM5 is $4.06/hr on-demand or $1.49/hr spot. H200 SXM5 is $3.70/hr on-demand or $1.76/hr spot. Spot is suited for training runs with checkpoint/resume enabled.
Deploy on Spheron
Sign in at app.spheron.ai, select H100 or H200 from the GPU catalog, choose on-demand or spot, and deploy. SSH root access is available within minutes. Billing starts when the instance launches and stops when you terminate it. No minimum commitment and no powered-off charges.

FAQ / 05

Frequently Asked Questions

DigitalOcean GPU Droplets with a single H100 80GB GPU (HGX/SXM architecture) start at approximately $3.39/hr. An 8x H100 SXM node runs approximately $27.12/hr, or $3.39/hr per GPU. Billing is per-second with a 5-minute minimum, but GPU Droplets continue to accrue charges while powered off. Spheron H100 PCIe starts at $2.01/hr on-demand with per-minute billing and no powered-off charges. Note that DO's H100 is HGX/SXM architecture while Spheron's H100 at $2.01/hr is PCIe, so these are different form factors.

DigitalOcean Gradient Deployments offer serverless GPU inference billed per request or per compute-second rather than per GPU-hour. Published rates for open-source models range from roughly $0.18 to $0.99 per million tokens: smaller models like Ministral 3 14B run around $0.20/M tokens, mid-size models like Llama 3.3 70B around $0.65/M, and larger models like DeepSeek R1 Distill 70B around $0.99/M. Larger models with separate input/output pricing can reach $1.10/M input and $4.40/M output. Verify current rates at digitalocean.com/pricing before budgeting. A self-managed H100 at $3.39/hr breaks even against the cheapest Gradient model (~$0.20/M) at around 17 million tokens per hour, and against an average-priced model (~$0.65/M) at around 5 million tokens per hour.

Yes. DigitalOcean GPU Droplets accrue per-second charges even when powered off, because the underlying GPU hardware remains reserved in the data center. To stop billing, you must destroy the Droplet. A team that powers off GPUs overnight (16 hours off per day) still pays for all 24 hours. This can inflate effective GPU cost by 2-3x compared to a platform that bills only for active compute.

Beyond the hourly rate, watch for: powered-off billing (charged 24/7 even when idle), 5-minute minimum billing on each session, block storage volumes billed separately per GB-month, and network egress above the free ~15TB/month per Droplet allowance at $0.01/GB. For GPU Droplets on managed Kubernetes (DOKS), additional control plane and node pool fees apply.

DigitalOcean H100 HGX at ~$3.39/hr is ~69% more than Spheron H100 PCIe at $2.01/hr on-demand (note: different form factors, DO offers HGX/SXM and Spheron offers PCIe), and Spheron does not bill while powered off. Spheron H100 SXM5 runs $4.06/hr on-demand or $1.49/hr spot. DO H200 GPU Droplets are available at $3.44/hr, while Spheron H200 SXM5 is $3.70/hr on-demand or $1.76/hr spot. Spheron bills per minute, requires no destroy-to-stop-billing workflow, and aggregates capacity from 5+ providers.

TL;DR: DigitalOcean GPU Droplets vs Spheron H100/H200 (Jul 2026)

DigitalOcean GPU Droplet Lineup

DigitalOcean H100 GPU Droplet Pricing Per Hour

DigitalOcean H200 GPU Droplet Pricing

DigitalOcean Gradient Serverless Inference Pricing

DigitalOcean GPU Hidden Costs

DigitalOcean vs Spheron: H100 and H200 Cost Per Hour

Break-Even Math: When DigitalOcean GPU Droplets Make Sense

DigitalOcean vs Spheron: Deployment Comparison

Spheron Live H100 and H200 Rates (Jul 2026)

DigitalOcean vs Other GPU Clouds

Quick Setup Guide

Check DigitalOcean GPU Droplet current pricing

Calculate total DigitalOcean GPU cost including powered-off charges

Compare Spheron H100 and H200 pricing

Deploy on Spheron

Frequently Asked Questions

01How much do DigitalOcean GPU Droplets cost per hour for H100?

02What is DigitalOcean Gradient serverless inference pricing?

03Does DigitalOcean charge for powered-off GPU Droplets?

04What are the hidden costs on DigitalOcean GPU Droplets?

05How does DigitalOcean GPU pricing compare to Spheron?

Build what's next.