GPU as a Service
Be faster, more intuitive and more efficient, accessing GPU power in a way that fits your business needs.
Packages
On Demand
On-Demand is the most flexible model for accessing H200 GPU power. Clients pay only for the exact amount of compute time they use, billed by hour, with no upfront costs or long-term contracts.
This plan is best suited for:
- New Customers who want to try the platform without committing to a long-term plan.
- Businesses with unpredictable workloads who need a plan that can adapt to their company’s demands.
- Developers and researchers who are running short-term tests and prototyping new models.
Reserved
The Reserved model is a strategic commitment involving committing to a specific amount of H200 GPU capacity for 1 to 5-year terms, they receive a substantial discount on the hourly rate. This provides a guarantee that the capacity they have paid for will always be available for their workloads.
This plan is best suited for:
- Businesses with stable production workloads that require applications to run around the clock.
- Budget-conscious organizations that want highly predictable and manageable costs.
- Companies with mission-critical applications and AI services that can’t be without computer power.
Spot
Take advantage of our unused GPU capacity. The most critical aspect of Spot instances is that they can be interrupted or “pre-empted” with very short notice. If we need that capacity to serve an On-Demand or Reserved customer, the Spot instance will be terminated. It is the most cost-effective way to access powerful H200 GPU resources.
This plan is best suited for:
- Highly cost-sensitive projects, such as start-ups or academic research, need to process massive amounts of data on a tight budget.
- Fault-tolerant tasks that can be paused and resumed (i.e. video rendering or scientific simulations)