Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
D
D-Cliques
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
SaCS
Distributed Machine Learning
D-Cliques
Commits
342217c8
Commit
342217c8
authored
3 years ago
by
aurelien.bellet
Browse files
Options
Downloads
Patches
Plain Diff
finished proofreading main text
parent
7dc1ba40
No related branches found
No related tags found
No related merge requests found
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
main.tex
+11
-11
11 additions, 11 deletions
main.tex
with
11 additions
and
11 deletions
main.tex
+
11
−
11
View file @
342217c8
...
...
@@ -970,21 +970,22 @@ non-IID data.
\label
{
section:conclusion
}
We proposed D-Cliques, a sparse topology that recovers the convergence
speed of a fully-connected topology in the presence of local class bias.
D-Cliques is based on assembling cliques of nodes such that their joint local
distribution is representative of the global distribution so as to locally
recover IIDness. Cliques are joined in a sparse inter-clique topology so that
speed of a fully-connected network in the presence of local class bias.
D-Cliques is based on assembling subsets of nodes into cliques such
that the clique-level class distribution is representative of the global
distribution, thereby locally recovering IIDness. Cliques are joined in a
sparse inter-clique topology so that
they quickly converge to the same model. We proposed Clique
Averaging to remove the non-IID bias in gradient computation by
averaging gradients only with other nodes within the clique. Clique Averaging
can in turn be used to implement unbiased momentum to recover the convergence
speed usually only possible with IID mini-batches. Through our experiments, we
showed that the clique structure of D-Cliques is critical in obtaining these
results and that a small-world inter-clique topology with
$
O
(
n
+
log
(
n
))
$
edges
seems to
achieve the best compromise between
results and that a small-world inter-clique topology with
only
$
O
(
n
+
log
(
n
))
$
edges achieve
s
the best compromise between
convergence speed and scalability with the number of nodes.
D-Cliques thus appears to be promising to reduce bandwidth
D-Cliques thus appears to be
very
promising to reduce bandwidth
usage on FL servers and to implement fully decentralized alternatives in a
wider range of applications where global coordination is impossible or costly.
For instance, the presence and relative frequency of classes in each node
...
...
@@ -994,10 +995,9 @@ PeerSampling~\cite{jelasity2007gossip}. This will be investigated in future work
We also believe that our ideas can be useful to deal
with more general types of data non-IIDness beyond the important case of
local class bias that we studied in this paper. An important example is
covariate shift or feature distribution skew
\cite
{
kairouz2019advances
}
, where
local density estimates could be used as basis to construct cliques that
approximately
recover the global distribution.
covariate shift or feature distribution skew
\cite
{
kairouz2019advances
}
, for
which local density estimates could be used as basis to construct cliques that
approximately recover the global distribution.
%\section{Future Work}
%\begin{itemize}
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment