This was brought to my attention some time ago. Here an overview of what Amazon is doing there. Technical. Looking up some notes on gaudi generators, Know of someone doing this now?
Amazon Releases DL1 Instances Powered by Gaudi Accelerators
Amazon recently announced the general availability of the EC2 DL1 instances powered by Gaudi accelerators from Habana Labs. The new instances promise better price performances in training deep learning models for use cases such as computer vision, natural language processing, autonomous vehicle perception and recommendation engines.
The DL1 instances are available only in the DL1.24xlarge size and have 8 Gaudi accelerators with 32 GB of high bandwidth memory (HBM) per accelerator, Intel Xeon Scalable processors, 768 GB of memory, 400 Gbps of networking throughput, and 4 TB of local storage.
Jeff Barr, vice president and chief evangelist at AWS, explains the benefits of the new instances:
There are more applications today for deep learning than ever before. Natural language processing, recommendation systems, image recognition, video recognition, and more can all benefit from high-quality, well-trained models. (...) The training process is math and processor intensive, and places demands on just about every part of the systems used for training including the GPU or other training accelerator, the network, and local or network storage.
The new instances include the Habana SynapseAI SDK, which is integrated with the TensorFlow and PyTorch machine learning frameworks. They were originally announced by Andy Jassy, then CEO at AWS, at re:Invent 2020. Dylan Martin, senior associate editor at CRN, comments:
This is clearly much later than Intel was originally planning. Last December, Intel said these instances would be available in the first half of 2021...'
See: https://habana.ai/
No comments:
Post a Comment