White Paper No. EB1_V2
AI Reference Designs to Enable Adoption: A Collaboration Between Schneider Electric and NVIDIA
White Paper No. EB1_V2
Schneider Electric and NVIDIA have developed reference designs to help data centers support high-density AI clusters, addressing power and cooling challenges while enhancing energy efficiency, reliability, and scalability, enabling seamless AI deployment in both retrofitted and new facilities.
The rise of artificial intelligence (AI) is reshaping industries with applications across healthcare, manufacturing, education, entertainment, and beyond. The adoption of AI demands highly specialized hardware, specifically accelerators like NVIDIA’s GPU-based compute solutions, to handle complex tasks in AI model training. NVIDIA has evolved from a GPU supplier to an infrastructure provider, offering comprehensive solutions such as server boards, SuperPODs (AI clusters), and software stacks to accelerate AI deployment. However, integrating these high-density AI clusters into traditional data centers poses significant challenges due to the limitations of conventional power, cooling, and physical infrastructure.
Schneider Electric and NVIDIA have joined forces to address these infrastructure challenges. They have developed reference designs for both retrofitting existing data centers and building new ones specifically designed to handle the unique demands of AI. These designs focus on creating reliable, high-density setups capable of managing AI workloads while remaining energy efficient.
For existing data centers, three retrofit designs offer options depending on the cooling infrastructure and density requirements:
For new builds, Schneider and NVIDIA offer a scalable data center design optimized for AI, featuring purpose-built heat rejection systems that can handle AI clusters up to 1.8 MW with a 73 kW per rack density.
The collaboration between NVIDIA and Schneider Electric also provides valuable insights into the challenges posed by high-density AI clusters. These challenges include increased power requirements, sophisticated cooling needs (often requiring liquid cooling), and the need for fortified racks and power delivery systems. To help data center operators, the reference designs include equipment lists, physical layouts, and best practices for efficient and reliable deployment.
These reference designs offer several benefits:
The exponential growth of AI is increasing demand for power-intensive models, with predictions of a 316% rise in AI-related power consumption by 2028. NVIDIA’s advancements, including their latest DGX models and the upcoming Blackwell platform, aim to improve energy efficiency and reduce costs, making high-performance AI more accessible.
In addition to serving as a blueprint for NVIDIA’s DGX SuperPODs, the Schneider and NVIDIA designs can be adapted for other high-density AI clusters, with considerations for specific infrastructure requirements. Organizations interested in these designs are encouraged to request detailed engineering documentation and implement best practices tailored to their needs.
Looking forward, the partnership between Schneider Electric and NVIDIA offers a robust foundation for companies adopting AI. This collaboration ensures that data centers will be able to support future AI advancements with reliable, sustainable infrastructure, empowering businesses to leverage AI’s potential with greater ease and efficiency.
Telephone: 01943 831990
Email: info@advancedpower.co.uk