NVIDIA Blackwell

NVIDIA B200 GPU Cloud

Reserve NVIDIA Blackwell B200 capacity on VESSL Cloud — 192GB HBM3e, 8 TB/s bandwidth, and FP4 acceleration for frontier-scale training and real-time inference.

Reserve capacity

NVIDIA B200

GPU memory: 192GB HBM3e
Memory bandwidth: 8 TB/s

Technical specifications

Architecture: Blackwell
GPU memory: 192GB HBM3e
Memory bandwidth: 8 TB/s
NVLink: 1.8 TB/s
FP8 (Tensor): 9 PFLOPS
FP4 (Tensor): 18 PFLOPS
Max TDP: 1,000W
GPUs per node: 8 (HGX B200)

*Peak performance with sparsity, per NVIDIA official specs. Final specs may vary by node configuration.

Pricing & availability

NVIDIA B200Available on request

Talk to sales

What's the B200 best for?

Frontier-scale pretraining

FP4 and FP8 throughput plus 1.8 TB/s NVLink make trillion-parameter model pretraining practical in a single Blackwell cluster.

High-throughput inference

192GB HBM3e and FP4 acceleration serve large models at high concurrency, with lower cost-per-token than Hopper for big deployments.

Multi-modal & generative video

Memory- and compute-heavy workloads — text-to-video, 3D generation, large multi-modal training — fit on a single HGX B200 node.

Compare NVIDIA data-center GPUs

	H100 Hopper	H200 Hopper	B200 You're viewing	B300 Blackwell
Architecture	Hopper	Hopper	Blackwell	Blackwell
GPU memory	80GB HBM3	141GB HBM3e	192GB HBM3e	up to 288GB HBM3e
Memory bandwidth	3.35 TB/s	4.8 TB/s	8 TB/s	8 TB/s
FP8 (Tensor)	3,958 TFLOPS	3,958 TFLOPS	9 PFLOPS	10 PFLOPS
Access	from $2.39/hr	Available on request	Available on request	Available on request
Best for	Cost-efficient training & inference	Long-context & large-model inference	Frontier-scale training (FP4)	Largest models & reasoning inference

Why industry-leading teams run GPUs on VESSL Cloud

No waitlists

Access capacity across clouds through one platform — skip quotas and procurement.

Scale to multi-node

Spin up a single GPU or scale to large multi-node clusters over high-speed InfiniBand — as much as you need.

Transparent pricing

Spot, on-demand, and reserved options with pay-as-you-go billing.

Enterprise-ready

SOC 2 Type II compliance, with dedicated support for production AI.

Frequently asked questions

How do I get access to NVIDIA B200 GPUs?

Blackwell capacity is allocated on request. Talk to our team and we'll secure B200 capacity matched to your timeline.

How much memory does the B200 have?

The B200 offers 192GB HBM3e per GPU at up to 8 TB/s memory bandwidth and 1.8 TB/s NVLink — roughly 2.4× the H100's memory and over 2× the bandwidth.

What's the difference between the B200 and B300?

The B300 (Blackwell Ultra) increases memory to up to 288GB HBM3e (vs 192GB on the B200) and adds roughly 1.5× FP4 compute — built for the largest models and high-concurrency reasoning.

How does the B200 compare to the H200?

The B200 is a generation ahead: Blackwell architecture, FP4 acceleration, 192GB HBM3e, and 8 TB/s bandwidth vs the Hopper H200's 141GB at 4.8 TB/s. Choose B200 for frontier-scale training and the lowest cost-per-token at scale.