flower-dp

PyTorch Details

Usage

The demo is setup with a simple CIFAR10 model. The demo can be executed with

python3 pytorch/simulation.py

Important Parameters

...
# privacy guarantees for (epsilon, delta)-DP
epsilon = 0.8  # lower is better
delta = 1 / 2e5 # probability of exceeding the privacy budget
l2_norm_clip = 1.5  # max euclidean norm of gradients

# client variables
epochs = 1  # how many epochs to go through
batch_size = 256  # batch size for training
learning_rate = 0.001  # how quickly the model learns
min_dataset_size = 1e5  # minimum training set size

# server variables
num_rounds = 3  # number of train/val rounds to go through
min_available_clients = 3  # minimum number of clients to train/val - `N``
clients_per_round = 3  # number of clients to be selected for each round - `K`
...

PrivateServer Walkthrough

PrivateServer is pretty simple. It’s a subclass of fl.server.strategy.FedAvg, with modifications to the constructor and aggregate_fit function. The following arguments have been added:

    def __init__(
        self,
        epsilon: float,
        delta: float = 1 / 2e5,
        l2_norm_clip: float = 1.5,
        num_rounds: int = None,
        min_dataset_size: int = 1e5,
        *args,
        **kwargs
    ) -> None:
        super().__init__(*args, **kwargs)
        self.sigma_d = privacy.calculate_sigma_d(
            epsilon=epsilon,
            delta=delta,
            l2_norm_clip=l2_norm_clip,
            num_exposures=num_rounds,
            num_rounds=num_rounds,
            num_clients=self.min_fit_clients,
            min_dataset_size=min_dataset_size,
        )

These extra arguments (epsilon, delta, l2_norm_clip, num_rounds, min_dataset_size) are used to compute self.sigma_d.¹

The aggregate_fit function is expanded to include noising. The function’s signature is the same as its superclass.

    def aggregate_fit(
        self,
        rnd: int,
        results: List[Tuple[fl.server.client_proxy.ClientProxy, fl.common.FitRes]],
        failures: List[BaseException],
    ) -> Tuple[Optional[Parameters], Dict[str, Scalar]]:
        # call the superclass method
        response = super().aggregate_fit(rnd, results, failures)
        aggregated_params = response[0]

        # add noise
        noised_parameters = privacy.server_side_noise(aggregated_params, self.sigma_d)
        return noised_parameters, response[1]

Naturally, this can be extended to any other strategy. Make sure to apply the noise as the last step in the aggregate_fit function.

(from pytorch/server.py)

PrivateClient Walkthrough

This client is loosely based off of the PyTorch Quickstart from flower’s docs.

The constructor is pretty self explanatory. It consists of variable assignment, and then computation of sigma_u for the added noise in training.

    def __init__(
        self,
        trainloader: DataLoader,
        testloader: DataLoader,
        model: nn.Module,
        loss_function: nn.Module,
        epsilon: float = 10,
        delta: float = 1 / 2e5,
        l2_norm_clip: float = 1.5,
        num_rounds: int = 3,
        min_dataset_size: int = 1e5,
        epochs: int = 1,
        device: str = torch.device("cuda:0" if torch.cuda.is_available() else "cpu"),
        *args,
        **kwargs,
    ) -> None:
        ... # variable assignment
        self.sigma_u = privacy.calculate_sigma_u(
            epsilon=epsilon,
            delta=delta,
            l2_norm_clip=l2_norm_clip,
            num_exposures=num_rounds,
            min_dataset_size=min_dataset_size,
        )

The training function is standard for most pytorch implementations. However, one important difference is the addition of the privacy.noise_and_clip_gradients function. This is defined in pytorch/privacy.py and handles clipping and noising the model’s gradients.

    def train_step(self, x, y) -> None:
        # send to device and compute loss
        images, labels = x.to(self.device), y.to(self.device)
        loss = self.loss_function(self.net(images), labels)

        # compute grads
        self.optimizer.zero_grad()
        loss.backward()

        # apply noise
        privacy.noise_and_clip_gradients(
            self.net.parameters(),
            l2_norm_clip=self.l2_norm_clip,
            sigma=self.sigma_u,
        )
        # apply gradients
        self.optimizer.step()

The test function is also very straightforward. Again, pretty standard for a pytorch testing implementation.

    def test(self) -> Union[float, float]:
        """Validate the network on the entire test set."""
        self.net.eval()  # put in test mode
        criterion = self.loss_function
        correct, total, loss = 0, 0, 0.0
        with torch.no_grad():
            for data in self.testloader:
                images, labels = data[0].to(self.device), data[1].to(self.device)
                outputs = self.net(images)
                loss += criterion(outputs, labels).item()
                _, predicted = torch.max(outputs.data, 1)
                total += labels.size(0)
                correct += (predicted == labels).sum().item()
        accuracy = correct / total
        return loss, accuracy

Lastly, the get_parameters, set_parameters, fit, and evaluate functions are what tie it all together. You can read more about them on flower’s docs.

    def get_parameters(self):
        return [val.cpu().numpy() for _, val in self.net.state_dict().items()]

    def set_parameters(self, parameters):
        params_dict = zip(self.net.state_dict().keys(), parameters)
        state_dict = OrderedDict({k: torch.tensor(v) for k, v in params_dict})
        self.net.load_state_dict(state_dict, strict=True)

    def fit(self, parameters, config):
        self.set_parameters(parameters)
        self.train()
        return self.get_parameters(), self.num_examples["trainset"], {}

    def evaluate(self, parameters, config):
        self.set_parameters(parameters)
        loss, accuracy = self.test()
        return float(loss), self.num_examples["testset"], {"accuracy": float(accuracy)}

(from pytorch/client.py)

Model Handling

Using a custom model, and loss function with flower-dp is simple. When instantiating client.PrivateClient, pass the custom model as the model parameter and the loss function you’d like to use as the loss_function parameter. For example

import client
import torch

class Net(torch.nn.Module):
    ...
    # custom model definition here

trainloader = ... # dataloader with training set
testloader = ... # dataloader with testing set
net = Net()
loss = torch.nn.CrossEntropyLoss()

client = client.PrivateClient(
    trainloader=trainloader,
    testloader=testloader,
    model=net,
    loss_function=loss,
)

Note that the Adam optimizer is default in the client.PrivateClient class.

Privacy Endpoints

The privacy class contains the most important privacy calculations. The equations used are taken directly from Federated Learning with Differential Privacy: Algorithms and Performance Analysis.

Calculations

calculate_c(delta: float) -> float
calculate_sigma_d(epsilon: float, delta: float, l2_norm_clip: float, num_exposures: int, num_rounds: int, num_clients: int, min_dataset_size: int) -> float
calculate_sigma_u(epsilon: float, delta: float, l2_norm_clip: float, num_exposures: int, min_dataset_size: int) -> float

Federated Learning with Differential Privacy: Algorithms and Performance Analysis ↩