Skip to content
Snippets Groups Projects
Commit 342217c8 authored by aurelien.bellet's avatar aurelien.bellet
Browse files

finished proofreading main text

parent 7dc1ba40
No related branches found
No related tags found
No related merge requests found
...@@ -970,21 +970,22 @@ non-IID data. ...@@ -970,21 +970,22 @@ non-IID data.
\label{section:conclusion} \label{section:conclusion}
We proposed D-Cliques, a sparse topology that recovers the convergence We proposed D-Cliques, a sparse topology that recovers the convergence
speed of a fully-connected topology in the presence of local class bias. speed of a fully-connected network in the presence of local class bias.
D-Cliques is based on assembling cliques of nodes such that their joint local D-Cliques is based on assembling subsets of nodes into cliques such
distribution is representative of the global distribution so as to locally that the clique-level class distribution is representative of the global
recover IIDness. Cliques are joined in a sparse inter-clique topology so that distribution, thereby locally recovering IIDness. Cliques are joined in a
sparse inter-clique topology so that
they quickly converge to the same model. We proposed Clique they quickly converge to the same model. We proposed Clique
Averaging to remove the non-IID bias in gradient computation by Averaging to remove the non-IID bias in gradient computation by
averaging gradients only with other nodes within the clique. Clique Averaging averaging gradients only with other nodes within the clique. Clique Averaging
can in turn be used to implement unbiased momentum to recover the convergence can in turn be used to implement unbiased momentum to recover the convergence
speed usually only possible with IID mini-batches. Through our experiments, we speed usually only possible with IID mini-batches. Through our experiments, we
showed that the clique structure of D-Cliques is critical in obtaining these showed that the clique structure of D-Cliques is critical in obtaining these
results and that a small-world inter-clique topology with $O(n results and that a small-world inter-clique topology with only $O(n
+ log (n))$ edges seems to achieve the best compromise between + log (n))$ edges achieves the best compromise between
convergence speed and scalability with the number of nodes. convergence speed and scalability with the number of nodes.
D-Cliques thus appears to be promising to reduce bandwidth D-Cliques thus appears to be very promising to reduce bandwidth
usage on FL servers and to implement fully decentralized alternatives in a usage on FL servers and to implement fully decentralized alternatives in a
wider range of applications where global coordination is impossible or costly. wider range of applications where global coordination is impossible or costly.
For instance, the presence and relative frequency of classes in each node For instance, the presence and relative frequency of classes in each node
...@@ -994,10 +995,9 @@ PeerSampling~\cite{jelasity2007gossip}. This will be investigated in future work ...@@ -994,10 +995,9 @@ PeerSampling~\cite{jelasity2007gossip}. This will be investigated in future work
We also believe that our ideas can be useful to deal We also believe that our ideas can be useful to deal
with more general types of data non-IIDness beyond the important case of with more general types of data non-IIDness beyond the important case of
local class bias that we studied in this paper. An important example is local class bias that we studied in this paper. An important example is
covariate shift or feature distribution skew \cite{kairouz2019advances}, where covariate shift or feature distribution skew \cite{kairouz2019advances}, for
local density estimates could be used as basis to construct cliques that which local density estimates could be used as basis to construct cliques that
approximately approximately recover the global distribution.
recover the global distribution.
%\section{Future Work} %\section{Future Work}
%\begin{itemize} %\begin{itemize}
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment