Microsoft and NVIDIA will jointly deliver AI supercomputers

Artificial Intelligence (AI) is increasingly being adopted today as it helps organizations gain insights that they can apply to accelerate innovation and business results. Having an AI-first infrastructure is critical to developing and deploying AI applications.

Microsoft Azure will partner with NVIDIA to deliver purpose-built AI supercomputers in the cloud to handle the most demanding real-world workloads while meeting price/performance and time-to-solution requirements. They come with advanced machine learning tools that will help you integrate AI into your own work, making your simulations smarter and your decisions more intelligent.

This computer is powered by Microsoft Azure’s advanced supercomputing infrastructure and NVIDIA GPUs. It will help enterprises train, deploy and scale AI, including large, sophisticated models. Azure’s cloud-based AI supercomputers consist of powerful and scalable ND and NC series virtual machines optimized for AI distributed training and inference. NVIDIA will leverage Azure’s scalable virtual machine instances to explore and accelerate advancements in generative AI.

Microsoft and NVIDIA are also teaming up to improve Microsoft’s DeepSpeed ​​deep learning optimization software. NVIDIA will make its full stack of AI workflows and software development kits available to Azure enterprise customers.

Scalable peak performance for AI training with NVIDIA Compute and Quantum-2 InfiniBand on Azure

Microsoft Azure virtual machine instances optimized for AI use the most advanced data center GPUs from NVIDIA. These instances are the first public cloud instances with NVIDIA Quantum-2 400Gb/s InfiniBand networking. It allows customers to deploy thousands of GPUs in a single cluster to train large language models, build complex recommendation systems, and enable generative AI.

Current Azure instances have NVIDIA Quantum 200Gb/s InfiniBand networking and NVIDIA A100 GPUs. The future ones will have NVIDIA Quantum-2 400Gb/s InfiniBand networking and NVIDIA H100 GPUs. These new instances will integrate with Azure’s advanced cloud infrastructure, networking, and storage to provide scalable peak performance for AI training and deep learning inference workloads of all sizes.

Microsoft Azure’s AI-first cloud infrastructure and toolchain with NVIDIA are making a big impact in retail. With a GPU-accelerated computing platform, customers can quickly browse models to determine the best performing model.

Accelerated development and deployment of AI

The cloud-based AI supercomputer will support a wide variety of AI applications and services, including Microsoft DeepSpeed ​​and the NVIDIA AI Enterprise software suite.

Microsoft DeepSpeed ​​will use the NVIDIA H100 Transformer Engine to accelerate transformer-based models used for large language models, generative AI, and writing computer code, along with other applications. This technology applies 8-bit floating-point precision capabilities to DeepSpeed ​​to dramatically increase the speed of AI calculations for transformers.

NVIDIA AI Enterprise is a globally adopted software of the NVIDIA AI platform. It is certified and supported on Microsoft Azure instances with NVIDIA A100 GPUs. Support for Azure instances with NVIDIA H100 GPUs will be added in the future.

NVIDIA AI Enterprise includes the NVIDIA Riva for voice AI and NVIDIA Morpheus cybersecurity application frameworks. This helps streamline every step of the AI ​​workflow, from data processing and AI model training to simulation and large-scale deployment.

Read more: 10 alternatives to Twitter to consider

Leave a Comment