Subscribe
Sign in
Home
Notes
Archive
About
The five things you can shard
A working person's map of distributed training. There are only five axes; everything else is a name.
May 25
•
Saksham Consul
3
Training is not (just) a compute problem
Why your GPUs spend most of their time waiting, and what that tells you about the field.
May 22
•
Saksham Consul
1
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts