Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revision | |||
training:ds:2023:distributed_deep_learning_on_ksl_platforms [2023/05/04 11:46] – removed - external edit (Unknown date) 127.0.0.1 | training:ds:2023:distributed_deep_learning_on_ksl_platforms [2023/05/04 11:46] (current) – ↷ Page moved from training:ds:distributed_deep_learning_on_ksl_platforms to training:ds:2023:distributed_deep_learning_on_ksl_platforms James Kress | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | ====== Distributed Deep Learning on KSL Platforms ====== | ||
+ | |||
+ | <WRAP group> | ||
+ | |||
+ | <WRAP twothirds column> | ||
+ | |||
+ | ===== Overview ===== | ||
+ | |||
+ | With the increasing complexity and size of both Deep Learning (DL) models and datasets, the computational cost of training these model can be non-trivial, | ||
+ | |||
+ | ===== Learning Outcomes ===== | ||
+ | |||
+ | After attending the training, you will be able to: | ||
+ | |||
+ | * Understand the considerations when refactoring training scripts to scale from 1 to N GPUs | ||
+ | * Understand the data management related to distributed training jobs | ||
+ | * Familiarize how to launch distributed training jobs on Ibex resources | ||
+ | * Understanding the scaling characteristics of your distributed training workload | ||
+ | |||
+ | A Quiz will be conducted after the training, which is mandatory to submit to ensure the continued use of KSL resources. | ||
+ | |||
+ | </ | ||
+ | |||
+ | <WRAP quarter column>< | ||
+ | |||
+ | * February 12th, 2023 | ||
+ | * 9:00 am - 12:00 pm | ||
+ | |||
+ | <WRAP center round box 100%> {{: | ||
+ | |||
+ | * Room 5220, Level 5, Building 3 | ||
+ | |||
+ | </ | ||
+ | |||
+ | </ | ||
+ | |||
+ | <WRAP column> <WRAP center round box download 100%> {{: | ||
+ | |||
+ | {{: | ||
+ | |||
+ | {{: | ||
+ | |||
+ | </ | ||
+ | |||
+ | <wrap indent></ | ||
+ | |||
+ | </ | ||
+ | |||
+ | <WRAP quarter column>< | ||
+ | |||
+ | * Slides: [[https:// | ||
+ | * GitHub: [[https:// | ||
+ | * Recording: [[https:// | ||
+ | * Documentation: | ||
+ | |||
+ | </ | ||
+ | |||
+ | <WRAP center round box todo 100%> **Pre-requisites****? | ||
+ | |||
+ | * Have KAUST IT credentials (i.e. the ones you use to access your KAUST email) | ||
+ | * Bring your laptop and have your terminal ready | ||
+ | * Essential knowledge of Linux shell is necessary. | ||
+ | * Have some experience working with Conda package manager. | ||
+ | * Basic training “Data Science on-boarding on KSL platforms” or possess equivalent knowledge | ||
+ | |||
+ | </ | ||
+ | |||
+ | </ | ||
+ | |||
+ | </ | ||
+ | |||
+ | {{tag> | ||
+ | |||