[Feature Proposal] (New data partition strategy) Extended Dirichlet strategy

Recently, I find one new data partition strategy called Extended Dirichlet strategy ~~~ ours :), which could be added in this repo.

It combines the two common partition strategies (i.e., Quantity-based class imbalance and Diribution-based class imbalance in Li et al. (2022)) to generate arbitrarily heterogeneous data. The difference is to add a step of allocating classes (labels) to determine the number of classes per client (denoted by $C$) before allocating samples via Dirichlet distribution (with concentrate parameter $\alpha$).

The implementation is in [convergence](https://github.com/liyipeng00/convergence/). You can find more details in [Convergence Analysis of Sequential Federated Learning on Heterogeneous Data](https://arxiv.org/abs/2311.03154).
[Figure:
Row 1:       $C=2$   with $\alpha=0.1$, $\alpha=1.0$, $\alpha=10.0$;
Row 2:       $C=5$   with $\alpha=0.1$, $\alpha=1.0$, $\alpha=10.0$;
Row 3:       $C=10$ with $\alpha=0.1$, $\alpha=1.0$, $\alpha=10.0$; ]

<table>   
    <tr>   
        <td><img src="https://github.com/SMILELab-FL/FedLab/assets/70316218/e0be216f-51fa-4ae7-be9f-c2321c987aff"></td>   
        <td><img src="https://github.com/SMILELab-FL/FedLab/assets/70316218/98b98801-06d9-4535-a0ec-8161fbf9875f"></td>   
        <td><img src="https://github.com/SMILELab-FL/FedLab/assets/70316218/1f61917d-e155-4dcf-baab-fc90d3282f99"></td>
    </tr> 
    <tr>   
        <td><img src="https://github.com/SMILELab-FL/FedLab/assets/70316218/ee347025-cafc-4318-997c-ef32f391ebad"></td>   
        <td><img src="https://github.com/SMILELab-FL/FedLab/assets/70316218/24c4c1a3-9b01-4ec4-a49e-9d68976b3178"></td>   
        <td><img src="https://github.com/SMILELab-FL/FedLab/assets/70316218/b4fc9f5f-c0a9-46ce-ad1d-52df509310b7"></td>
    </tr> 
    <tr>   
        <td><img src="https://github.com/SMILELab-FL/FedLab/assets/70316218/ad98f16f-5160-411f-b391-c2ad20ee03e3"></td>   
        <td><img src="https://github.com/SMILELab-FL/FedLab/assets/70316218/1baff606-a02a-40c6-b156-9bdc8a1190d2"></td>   
        <td><img src="https://github.com/SMILELab-FL/FedLab/assets/70316218/a3173f60-9226-4b42-a258-f123c993461f"></td>
    </tr> 
</table>

Li, Q., Diao, Y., Chen, Q., & He, B. (2022, May). Federated learning on non-iid data silos: An experimental study. In 2022 IEEE 38th International Conference on Data Engineering (ICDE) (pp. 965-978). IEEE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Proposal] (New data partition strategy) Extended Dirichlet strategy #337

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Proposal] (New data partition strategy) Extended Dirichlet strategy #337

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions