Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revision | |||
| training:ds:2023:distributed_deep_learning_on_ksl_platforms [2023/05/04 11:46] – removed - external edit (Unknown date) 127.0.0.1 | training:ds:2023:distributed_deep_learning_on_ksl_platforms [2023/05/04 11:46] (current) – ↷ Page moved from training:ds:distributed_deep_learning_on_ksl_platforms to training:ds:2023:distributed_deep_learning_on_ksl_platforms James Kress | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| + | ====== Distributed Deep Learning on KSL Platforms ====== | ||
| + | |||
| + | <WRAP group> | ||
| + | |||
| + | <WRAP twothirds column> | ||
| + | |||
| + | ===== Overview ===== | ||
| + | |||
| + | With the increasing complexity and size of both Deep Learning (DL) models and datasets, the computational cost of training these model can be non-trivial, | ||
| + | |||
| + | ===== Learning Outcomes ===== | ||
| + | |||
| + | After attending the training, you will be able to: | ||
| + | |||
| + | * Understand the considerations when refactoring training scripts to scale from 1 to N GPUs | ||
| + | * Understand the data management related to distributed training jobs | ||
| + | * Familiarize how to launch distributed training jobs on Ibex resources | ||
| + | * Understanding the scaling characteristics of your distributed training workload | ||
| + | |||
| + | A Quiz will be conducted after the training, which is mandatory to submit to ensure the continued use of KSL resources. | ||
| + | |||
| + | </ | ||
| + | |||
| + | <WRAP quarter column>< | ||
| + | |||
| + | * February 12th, 2023 | ||
| + | * 9:00 am - 12:00 pm | ||
| + | |||
| + | <WRAP center round box 100%> {{: | ||
| + | |||
| + | * Room 5220, Level 5, Building 3 | ||
| + | |||
| + | </ | ||
| + | |||
| + | </ | ||
| + | |||
| + | <WRAP column> <WRAP center round box download 100%> {{: | ||
| + | |||
| + | {{: | ||
| + | |||
| + | {{: | ||
| + | |||
| + | </ | ||
| + | |||
| + | <wrap indent></ | ||
| + | |||
| + | </ | ||
| + | |||
| + | <WRAP quarter column>< | ||
| + | |||
| + | * Slides: [[https:// | ||
| + | * GitHub: [[https:// | ||
| + | * Recording: [[https:// | ||
| + | * Documentation: | ||
| + | |||
| + | </ | ||
| + | |||
| + | <WRAP center round box todo 100%> **Pre-requisites****? | ||
| + | |||
| + | * Have KAUST IT credentials (i.e. the ones you use to access your KAUST email) | ||
| + | * Bring your laptop and have your terminal ready | ||
| + | * Essential knowledge of Linux shell is necessary. | ||
| + | * Have some experience working with Conda package manager. | ||
| + | * Basic training “Data Science on-boarding on KSL platforms” or possess equivalent knowledge | ||
| + | |||
| + | </ | ||
| + | |||
| + | </ | ||
| + | |||
| + | </ | ||
| + | |||
| + | {{tag> | ||
| + | |||