Dell Brings Turnkey GPUaaS to VMware Using Bitfusion

Dell EMC is bringing a new GPU-as-a-Service or GPUaaS offering to the market. Underpinning the GPU hardware, Dell EMC is leveraging software from VMware as well as the Bitfusion acquisition to help drive adoption and utilization of accelerated computing. With the solution, instead of targeting those leading companies that have already deployed an AI or HPC solution, Dell EMC is hoping to capture the next wave of adoption by making the task easier.

Dell is using this graphic to frame the conversation. If we think of 14.6% of the market using AI today, they are the early adopters who are leaders in the field. Still, that leaves 85.4% of the market that are not leaders and that Dell hopes to service with their solutions.

Dell Brings Turnkey GPUaaS for AI and HPC
As part of the Dell Technologies strategy, it is leaning on its VMware integration to bring GPU accelerated AI and HPC to vSphere. One will notice that while the company is saying it is for HPC, this is not for high-performance supercomputer HPC. Instead, it can be used for “some HPC workloads.”

Most companies that have embarked on this journey have already solved this problem. Indeed, using Kubernetes has made solving the basic problem Dell points out trivial, but if you want to run VMware Tanzu and have a typical VMware environment, you likely have silos of GPUs that are available. These GPUs reside in different silos. A great example is GPUs that are used for VDI during the day but sit idle in the evening. Likewise, different groups may have small GPU servers. Dell EMC knows that this is an underutilization of expensive hardware, and is looking to make it more efficient.

With the Bitfusion acquisition, VMware vSphere admins can manage pools of GPUs. They can allocate GPUs and GPU memory to different user groups from one pool.

The overall Dell EMC PowerEdge servers, PowerSwitch networking, and Isilon storage is designed as a Ready Solution that can be quickly deployed according to a pre-defined formula.

The formula for the GPUaaS Ready Solution for AI uses fairly standard Intel and NVIDIA components:

As a quick aside here, Dell EMC’s offering is what one would expect from a larger corporate offering. Still, if you look at what leading AI companies such as Tesla and even smaller but well funded autonomous driving companies like Zoox use, they are not building out massive arrays of NVIDIA V100 / T4’s as their GPUs of choice. Those companies use different GPUs in systems more similar to the Dell EMC DSS 8440 for their scale-out GPU clusters. Higher-end scale-up NVIDIA-based AI work will happen on NVSwitch-based solutions such as the Inspur NF5488M5 we reviewed, the new DGX A100 (when it is available), new HGX-2 based 16x GPU solutions and the like.

Dell’s offering is not necessarily focused on those who see AI infrastructure as a critical competency, but rather for IT departments that want to provide solutions based on Dell EMC and VMware. We reviewed the Dell EMC PowerEdge R740xd and know that Dell has many great corporate HPC/ AI customers for the Dell EMC PowerEdge C4140, but it is a different solution for a different type of customer.

The HPC solution is interesting because it builds upon the AI solution, with additional hardware options such as using Mellanox (now NVIDIA) Infiniband as well as AMD EPYC based nodes. This is a good indication to see that Dell EMC is seeing interest in AMD EPYC Rome and future Milan CPUs in the HPC space.

Dell also has scale-out storage solutions available. One will quickly notice that the HPC design has a lot more options on its validated hardware.

Final Words
On a pre-briefing, Dell said that not only will the offering be part of one of its Ready Solutions but the company will go one step further and configure the rack at the factory using this solution so that it is a turnkey experience.

Overall, this makes a lot of sense. Dell has an enormous market within its customer base of Dell-VMware shops that can utilize this type of solution. Although the leading-edge deployments are always interesting, one can say that the sweet spot is making it available to more buyers. Perhaps the underlying message here is not necessarily that Dell EMC has a solution aimed at the leading edge ~15%. Instead, it is that Dell EMC has a solution that is going after the next 70% of customers. As Dell and VMware make it easier to deploy AI infrastructure, there is a natural separation that will occur where those who do not adopt, and are in that last 15% (or so) will not be able to compete. In a sense, by democratizing the technology with VMware and Bitfusion, Dell can help make structural winners and losers driven by the CIO’s staff at large corporations. Perhaps heading into the next economic cycle it is those who take the opportunity to integrate now that make it out.

server-GPU

搜索此博客

Dell Brings Turnkey GPUaaS to VMware Using Bitfusion

标签

评论

发表评论

此博客中的热门博文

Nvidia GTX 1660S (Super)