NVIDIA DGX Cloud Serverless Inference Integrations in 2025

Google Cloud Platform

Google

Free ($300 in free credits)

See Software

Learn More

Google Cloud is an online service that lets you create everything from simple websites to complex apps for businesses of any size. Customers who are new to the system will receive $300 in credits for testing, deploying, and running workloads. Customers can use up to 25+ products free of charge. Use Google's core data analytics and machine learning. All enterprises can use it. It is secure and fully featured. Use big data to build better products and find answers faster. You can grow from prototypes to production and even to planet-scale without worrying about reliability, capacity or performance. Virtual machines with proven performance/price advantages, to a fully-managed app development platform. High performance, scalable, resilient object storage and databases. Google's private fibre network offers the latest software-defined networking solutions. Fully managed data warehousing and data exploration, Hadoop/Spark and messaging.

Microsoft Azure

Microsoft

21 Ratings

See Software

Microsoft Azure serves as a versatile cloud computing platform that facilitates swift and secure development, testing, and management of applications. With Azure, you can innovate purposefully, transforming your concepts into actionable solutions through access to over 100 services that enable you to build, deploy, and manage applications in various environments—be it in the cloud, on-premises, or at the edge—utilizing your preferred tools and frameworks. The continuous advancements from Microsoft empower your current development needs while also aligning with your future product aspirations. Committed to open-source principles and accommodating all programming languages and frameworks, Azure allows you the freedom to build in your desired manner and deploy wherever it suits you best. Whether you're operating on-premises, in the cloud, or at the edge, Azure is ready to adapt to your current setup. Additionally, it offers services tailored for hybrid cloud environments, enabling seamless integration and management. Security is a foundational aspect, reinforced by a team of experts and proactive compliance measures that are trusted by enterprises, governments, and startups alike. Ultimately, Azure represents a reliable cloud solution, backed by impressive performance metrics that validate its trustworthiness. This platform not only meets your needs today but also equips you for the evolving challenges of tomorrow.

Amazon Web Services (AWS)

Amazon

13 Ratings

See Software

If you're in need of computing power, database solutions, content distribution, or various other functionalities, AWS offers a wide array of services designed to assist you in developing advanced applications with enhanced flexibility, scalability, and reliability. Amazon Web Services (AWS) stands as the most extensive and widely utilized cloud platform globally, boasting over 175 fully functional services spread across data centers worldwide. A diverse range of customers, from rapidly expanding startups to major corporations and prominent government bodies, are leveraging AWS to reduce expenses, enhance agility, and accelerate innovation. AWS provides a larger selection of services, along with more features within those services, compared to any other cloud provider—covering everything from fundamental infrastructure technologies like computing, storage, and databases to cutting-edge innovations such as machine learning, artificial intelligence, data lakes, analytics, and the Internet of Things. This breadth of offerings facilitates a quicker, simpler, and more cost-effective transition of your current applications to the cloud, ensuring that you can stay ahead in a competitive landscape while taking advantage of the latest technological advancements.

Splunk Cloud Platform

Splunk

1 Rating

See Software

Transforming data into actionable insights is made simple with Splunk, which is securely and reliably managed as a scalable service. By entrusting your IT backend to our Splunk specialists, you can concentrate on leveraging your data effectively. The infrastructure, provisioned and overseen by Splunk, offers a seamless, cloud-based data analytics solution that can be operational in as little as 48 hours. Regular software upgrades guarantee that you always benefit from the newest features and enhancements. You can quickly harness the potential of your data in just a few days, with minimal prerequisites for translating data into actionable insights. Meeting FedRAMP security standards, Splunk Cloud empowers U.S. federal agencies and their partners to make confident decisions and take decisive actions at mission speeds. Enhance productivity and gain contextual insights with the mobile applications and natural language features offered by Splunk, allowing you to extend the reach of your solutions effortlessly. Whether managing infrastructure or ensuring data compliance, Splunk Cloud is designed to scale effectively, providing you with robust solutions that adapt to your needs. Ultimately, this level of agility and efficiency can significantly enhance your organization's operational capabilities.

Helm

See Software

Helm is compatible with GNU/Linux, Mac OSX, and Windows operating systems. You can utilize Helm as a standalone synthesizer or as a plugin in various formats such as LV2, VST, VST3, or AU, and it is available in both 32-bit and 64-bit versions. This flexibility allows you to use Helm anywhere without concerns about digital rights management (DRM), and it empowers you to examine, modify, and share the source code, whether in its original form or altered. As a software synthesizer, Helm enables users to generate electronic music directly from their computers. Its philosophy of being "free as in freedom" means you have complete control over the software rather than being controlled by it. In financial terms, Helm operates on a "pay what you want" model, giving you the option to use it without any payment if you choose. Moreover, any sounds produced by Helm are owned by the user, granting you copyright over every sound you create. You can easily toggle various modules on or off using the small power buttons located in the top left corner of the interface. Among these modules, the SUB module serves as one of Helm's primary sound generators, managing a single oscillator that typically plays an octave lower than the note currently being struck. This intuitive design ensures that users can quickly experiment and craft their unique sounds.

Nebius

$2.66/hour

See Software

A robust platform optimized for training is equipped with NVIDIA® H100 Tensor Core GPUs, offering competitive pricing and personalized support. Designed to handle extensive machine learning workloads, it allows for efficient multihost training across thousands of H100 GPUs interconnected via the latest InfiniBand network, achieving speeds of up to 3.2Tb/s per host. Users benefit from significant cost savings, with at least a 50% reduction in GPU compute expenses compared to leading public cloud services*, and additional savings are available through GPU reservations and bulk purchases. To facilitate a smooth transition, we promise dedicated engineering support that guarantees effective platform integration while optimizing your infrastructure and deploying Kubernetes. Our fully managed Kubernetes service streamlines the deployment, scaling, and management of machine learning frameworks, enabling multi-node GPU training with ease. Additionally, our Marketplace features a variety of machine learning libraries, applications, frameworks, and tools designed to enhance your model training experience. New users can take advantage of a complimentary one-month trial period, ensuring they can explore the platform's capabilities effortlessly. This combination of performance and support makes it an ideal choice for organizations looking to elevate their machine learning initiatives.

NVIDIA NIM

NVIDIA

See Software

Investigate the most recent advancements in optimized AI models, link AI agents to data using NVIDIA NeMo, and deploy solutions seamlessly with NVIDIA NIM microservices. NVIDIA NIM comprises user-friendly inference microservices that enable the implementation of foundation models across various cloud platforms or data centers, thereby maintaining data security while promoting efficient AI integration. Furthermore, NVIDIA AI offers access to the Deep Learning Institute (DLI), where individuals can receive technical training to develop valuable skills, gain practical experience, and acquire expert knowledge in AI, data science, and accelerated computing. AI models produce responses based on sophisticated algorithms and machine learning techniques; however, these outputs may sometimes be inaccurate, biased, harmful, or inappropriate. Engaging with this model comes with the understanding that you accept the associated risks of any potential harm stemming from its responses or outputs. As a precaution, refrain from uploading any sensitive information or personal data unless you have explicit permission, and be aware that your usage will be tracked for security monitoring. Remember, the evolving landscape of AI requires users to stay informed and vigilant about the implications of deploying such technologies.

Oracle Cloud Infrastructure

Oracle

See Software

Oracle Cloud Infrastructure not only accommodates traditional workloads but also provides advanced cloud development tools for modern needs. It is designed with the capability to identify and counteract contemporary threats, empowering innovation at a faster pace. By merging affordability with exceptional performance, it effectively reduces total cost of ownership. As a Generation 2 enterprise cloud, Oracle Cloud boasts impressive compute and networking capabilities while offering an extensive range of infrastructure and platform cloud services. Specifically engineered to fulfill the requirements of mission-critical applications, Oracle Cloud seamlessly supports all legacy workloads, allowing businesses to transition from their past while crafting their future. Notably, our Generation 2 Cloud is uniquely equipped to operate Oracle Autonomous Database, recognized as the industry's first and only self-driving database. Furthermore, Oracle Cloud encompasses a wide-ranging portfolio of cloud computing solutions, spanning application development, business analytics, data management, integration, security, artificial intelligence, and blockchain technology, ensuring that businesses have all the tools they need to thrive in a digital landscape. This comprehensive approach positions Oracle Cloud as a leader in the evolving cloud marketplace.

Yotta

See Software

Alloy propels asset-rich environments into the future by providing innovative connected asset management solutions. It stands out as the sole platform capable of effortlessly linking individuals, systems, and assets together. The solution is designed to be scalable across various asset types while integrating seamlessly with all existing systems through robust APIs. This encompasses every physical and organizational structure necessary for your enterprise's operations. As the volume of data continues to expand, Alloy aids in the effective management of this data, empowering you to make more informed and timely decisions. We ensure that your team has access to crucial information that enables them to make the right choices when it matters most. Additionally, Alloy offers a versatile and mobile asset management platform that delivers valuable operational insights related to your grounds management strategy, ensuring your enterprise remains efficient and responsive in an ever-evolving environment. With Alloy, you can confidently navigate the complexities of asset management and drive your organization forward.

CoreWeave

See Software

CoreWeave stands out as a cloud infrastructure service that focuses on GPU-centric computing solutions specifically designed for artificial intelligence applications. Their platform delivers scalable, high-performance GPU clusters that enhance both training and inference processes for AI models, catering to sectors such as machine learning, visual effects, and high-performance computing. In addition to robust GPU capabilities, CoreWeave offers adaptable storage, networking, and managed services that empower AI-focused enterprises, emphasizing reliability, cost-effectiveness, and top-tier security measures. This versatile platform is widely adopted by AI research facilities, labs, and commercial entities aiming to expedite their advancements in artificial intelligence technology. By providing an infrastructure that meets the specific demands of AI workloads, CoreWeave plays a crucial role in driving innovation across various industries.

NVIDIA DGX Cloud

NVIDIA

See Software

The NVIDIA DGX Cloud provides an AI infrastructure as a service that simplifies the deployment of large-scale AI models and accelerates innovation. By offering a comprehensive suite of tools for machine learning, deep learning, and HPC, this platform enables organizations to run their AI workloads efficiently on the cloud. With seamless integration into major cloud services, it offers the scalability, performance, and flexibility necessary for tackling complex AI challenges, all while eliminating the need for managing on-premise hardware.

NVIDIA AI Foundations

NVIDIA

See Software

Generative AI is transforming nearly every sector by opening up vast new avenues for knowledge and creative professionals to tackle some of the most pressing issues of our time. NVIDIA is at the forefront of this transformation, providing a robust array of cloud services, pre-trained foundation models, and leading-edge frameworks, along with optimized inference engines and APIs, to integrate intelligence into enterprise applications seamlessly. The NVIDIA AI Foundations suite offers cloud services that enhance generative AI capabilities at the enterprise level, allowing for tailored solutions in diverse fields such as text processing (NVIDIA NeMo™), visual content creation (NVIDIA Picasso), and biological research (NVIDIA BioNeMo™). By leveraging the power of NeMo, Picasso, and BioNeMo through NVIDIA DGX™ Cloud, organizations can fully realize the potential of generative AI. This technology is not just limited to creative endeavors; it also finds applications in generating marketing content, crafting narratives, translating languages globally, and synthesizing information from various sources, such as news articles and meeting notes. By harnessing these advanced tools, businesses can foster innovation and stay ahead in an ever-evolving digital landscape.

NVIDIA Cloud Functions

NVIDIA

See Software

NVIDIA Cloud Functions (NVCF) is a serverless API tailored for deploying and managing AI tasks on GPUs, ensuring security, scalability, and dependable performance. It accommodates various access methods, including HTTP polling, HTTP streaming, and gRPC protocols, for interacting with workloads. Primarily, Cloud Functions is optimized for brief, preemptable tasks such as inferencing and model fine-tuning. Users can choose between two types of functions: "Container" and "Helm Chart," enabling them to customize functions according to their specific needs. Since workloads are transient and preemptable, it is crucial for users to save their progress diligently. Additionally, models, containers, helm charts, and other essential resources are stored and retrieved from the NGC Private Registry. To begin utilizing NVCF, users can refer to the quickstart guide for functions, which outlines a comprehensive workflow for establishing and launching a container-based function utilizing the fastapi_echo_sample container. This resource not only highlights the ease of setup but also encourages users to explore the full potential of NVIDIA’s serverless infrastructure.

NVIDIA DGX Cloud Serverless Inference Integrations

NVIDIA

What Integrates with NVIDIA DGX Cloud Serverless Inference?

Google Cloud Platform

Microsoft Azure

Amazon Web Services (AWS)

Splunk Cloud Platform

Helm

Nebius

NVIDIA NIM

Oracle Cloud Infrastructure

Yotta

CoreWeave

NVIDIA DGX Cloud

NVIDIA AI Foundations

NVIDIA Cloud Functions

Llama

NVIDIA DGX Cloud Serverless Inference Integrations

NVIDIA

What Integrates with NVIDIA DGX Cloud Serverless Inference?

Google Cloud Platform

Microsoft Azure

Amazon Web Services (AWS)

Splunk Cloud Platform

Helm

Nebius

NVIDIA NIM

Oracle Cloud Infrastructure

Yotta

CoreWeave

NVIDIA DGX Cloud

NVIDIA AI Foundations

NVIDIA Cloud Functions

Llama

Relevant Categories

Category Integrations