Skip to main content

Overview

Supervised training of large networks requires large labeled datasets, which in turn demand high computational costs. While active practitioners in deep learning primarily develop and train their networks on local computing devices, with the increase of networks complexity, there is an urgent need to create, train, and test models on clusters.

In this workshop, we overview the basics of Docker and Singularity. (Working knowledge of Singularity as given in the Uppmax workshop on Singularity is desirable.) Distributed training using TensorFlow and Horovod frameworks on a supercomputer will be covered. Moreover, it will be shown how to use Singularity containers in conjunction with TensorFlow and Horovod to upscale an AI app.

The workshop will be entirely online using zoom.

Outcomes

  • Create, deploy, and update containers locally on a supercomputer
  • Upscale the transfer learning of an NLP model in TensorFlow
  • Upscale the transfer learning of an NLP model using Horovod
  • Upscale the transfer learning of a containerized NLP model

Prerequisites

Basic knowledge of UNIX OS and familiarity with NNs are required.

Agenda and Registration

For updated agenda and registration please visit https://enccs.se/events/2022-04-upscaling-ai-with-containers/