Skip to content
Snippets Groups Projects
Unverified Commit d0c11e27 authored by Mika Senghaas's avatar Mika Senghaas
Browse files

Add figure to README

parent a7ac94c5
No related branches found
No related tags found
No related merge requests found
...@@ -2,6 +2,8 @@ ...@@ -2,6 +2,8 @@
This repository contains the full experiment code and results for validating DiLoCo-SWARM, a distributed training method that combines the pipeline and data parallelism of [SWARM](https://arxiv.org/abs/2301.11913) with [DiLoCo](https://arxiv.org/abs/2311.08105)'s reduced-frequency gradient synchronization. This repository contains the full experiment code and results for validating DiLoCo-SWARM, a distributed training method that combines the pipeline and data parallelism of [SWARM](https://arxiv.org/abs/2301.11913) with [DiLoCo](https://arxiv.org/abs/2311.08105)'s reduced-frequency gradient synchronization.
![DiLoCo-SWARM](/presentation/figures/experiment1-1.png)
## 🔗 Shortcuts ## 🔗 Shortcuts
- 📄 Read the [report](report.pdf) - 📄 Read the [report](report.pdf)
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment