Skip to content

Dirichlet

The Dirichlet-based distribution uses a Dirichlet distribution to determine the share of each label for every client. The distribution parameter α is customizable by the user. Following related literature, a default value of α = 0.5 is set.

Parameters

Key Description Example Value
data_label_distribution_parameter Minimum number of samples to assign to each client 1

Based on:

Li, Qinbin, Yiqun Diao, Quan Chen, and Bingsheng He. 2022. 
“Federated Learning on Non-IID Data Silos: An Experimental Study.” 
In 2022 IEEE 38th International Conference on Data Engineering (ICDE). IEEE. 
https://doi.org/10.1109/icde53745.2022.00077.
Hsu, Tzu-Ming Harry, Hang Qi, and Matthew Brown. 2019. 
“Measuring the Effects of Non-Identical Data Distribution for Federated Visual Classification.” 
arXiv [cs.LG]. arXiv. http://arxiv.org/abs/1909.06335.