blog_posts: 09047751e9
This data as json
id | createdDate | title | link | postExcerpt | featuredImageUrl | hash | contributors | modifiedDate | displayDate |
---|---|---|---|---|---|---|---|---|---|
blog-posts#34-56876 | 2023-06-07 18:42:18 | Accelerate PyTorch with DeepSpeed to train large language models with Intel Habana Gaudi-based DL1 EC2 instances | https://aws.amazon.com/blogs/machine-learning/accelerate-pytorch-with-deepspeed-to-train-large-language-models-with-intel-habana-gaudi-based-dl1-ec2-instances/ | Training large language models (LLMs) with billions of parameters can be challenging. In addition to designing the model architecture, researchers need to set up state-of-the-art training techniques for distributed training like mixed precision support, gradient accumulation, and checkpointing. With large models, the training setup is even more challenging because the available memory in a single [...] | https://d2908q01vomqb2.cloudfront.net/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59/2023/06/07/accelerate-pytorch-deepspeed-300x150.jpg | 09047751e9 | Mahadevan Balasubramaniam, Abhinandan Patni, Pierre-Yves Aquilanti, Sundar Ranganathan, RJ | 2023-06-07 18:42:18 | 07 Jun 2023 |
Links from other tables
- 8 rows from blog_post_hash in blog_post_tags