AWS And Facebook Launch PyTorch Tools
Written by Alex Denham   
Friday, 05 June 2020

Two new tools have been released for PyTorch, the open source library for deep learning. Both are collaborations between Amazon AWS and Facebook. TorchServe is a PyTorch model serving library, while the TorchElastic Controller for Kubernetes adds Kubernetes support to TorchElastic, a library for fault-tolerant and elastic training in PyTorch.

PyTorch is an optimized tensor library for deep learning using GPUs and CPUs. It aims to offer a replacement for NumPy that make suse of the power of GPUs, while providing a deep learning research platform that provides maximum flexibility and speed.

pytorch

TorchServe aims to provide a clean, well supported, and industrial-grade path to deploying PyTorch models for inference at scale without having to write custom code. TorchServe provides a low latency prediction API, and also embeds default handlers for the most common applications such as object detection and text classification. It also includes multi-model serving, model versioning for A/B testing, monitoring metrics, and RESTful endpoints for application integration.

The Kubernetes Controller with TorchElastic integration gives PyTorch developers a way to train machine learning models on a cluster of compute nodes that can dynamically change without disrupting the model training process. If a node goes down, TorchElastic can pause node level training and resume once the node is healthy again. By using the Kubernetes controller with TorchElastic, distributed training jobs can be run on clusters with nodes that get replaced, either due to hardware issues or node reclamation. This means developers can create training systems that can work on large distributed Kubernetes clusters that include cheaper spot instances. Such instances can vary significantly depending on how many unused EC2 instances are available, and are liable to interruption, which would cause problems with traditional machine learning training frameworks.

pytorch

More Information

TorchServe

TorchElastic Controller For Kubernetes

Related Articles

PyTorch Adds TorchScript API

PyTorch Scholarship Challenge

Microsoft Cognitive Toolkit Version 2.0

Microsoft Open Sources Natural Language Processing Tool

Microsoft Open Sources AI Debugging Tool

More AI Tools From Microsoft

 

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


Windows 11 Overtakes Windows 10 - But Not In Europe
08/07/2025

With the end of support of Windows 10 just three months away, Windows 11 has finally edged ahead of Windows 10 in terms of  Desktop Windows Version Market Share on a Worldwide Basis. In Europe, h [ ... ]



Alan Turing's Papers Raise A Fortune
23/06/2025

Because so much of his work was top secret, Alan Turing was very much an unsung hero during his lifetime. Recognition of his many achievements dawned gradually and now his reputation is worldwide [ ... ]


More News

pico book

 

Comments




or email your comment to: comments@i-programmer.info