Federated Learning for Privacy-Sensitive Industries

Simor Consulting | 17 Jun, 2024 | 04 Mins read

Federated Learning for Privacy-Sensitive Industries

Data privacy regulations constrain how organizations in healthcare, finance, and telecommunications can use machine learning. Federated learning trains models on decentralized data without moving it to a central location.

What is Federated Learning?

Federated learning is a machine learning approach that trains algorithms across multiple decentralized devices or servers holding local data samples, without exchanging them. Instead of sending raw data to a central server for training, the model is sent to where the data resides. After local training, only model updates are communicated back to the central server, where they are aggregated to improve the global model.

This approach offers several key advantages:

Enhanced Privacy: Sensitive data never leaves the local environment
Regulatory Compliance: Easier adherence to regulations like GDPR, HIPAA, or CCPA
Reduced Data Transfer: Minimizes bandwidth usage and associated costs
Leveraging Distributed Data: Utilizes data across organizational boundaries

Implementing Federated Learning Systems

Core Components

A typical federated learning system consists of:

Central Server: Coordinates the training process and aggregates model updates
Client Devices/Servers: Where local data resides and local training occurs
Aggregation Algorithm: Combines updates from all clients (e.g., FedAvg, FedProx)
Secure Communication Layer: Ensures safe transmission of model updates

Implementation Example with TensorFlow Federated

TensorFlow Federated (TFF) is an open-source framework for machine learning and other computations on decentralized data. Here’s a simplified example of implementing a federated learning system:

import tensorflow as tf
import tensorflow_federated as tff

# Define a simple model for demonstration
def create_keras_model():
    return tf.keras.models.Sequential([
        tf.keras.layers.InputLayer(input_shape=(784,)),
        tf.keras.layers.Dense(10, kernel_initializer='zeros'),
        tf.keras.layers.Softmax(),
    ])

# Convert to TFF format
def model_fn():
    keras_model = create_keras_model()
    return tff.learning.from_keras_model(
        keras_model,
        input_spec=preprocessed_sample_dataset.element_spec,
        loss=tf.keras.losses.SparseCategoricalCrossentropy(),
        metrics=[tf.keras.metrics.SparseCategoricalAccuracy()]
    )

# Create a federated learning algorithm
fed_avg = tff.learning.build_federated_averaging_process(
    model_fn,
    client_optimizer_fn=lambda: tf.keras.optimizers.SGD(0.1),
    server_optimizer_fn=lambda: tf.keras.optimizers.SGD(1.0)
)

# Initialize the server state
state = fed_avg.initialize()

# Run federated training for multiple rounds
for round_num in range(NUM_ROUNDS):
    # Sample a batch of clients for this round
    sampled_clients = np.random.choice(
        client_ids, size=NUM_CLIENTS_PER_ROUND, replace=False)

    # Get the datasets for the selected clients
    sampled_train_data = [client_train_data[client] for client in sampled_clients]

    # Perform one round of federated learning
    state, metrics = fed_avg.next(state, sampled_train_data)

    print(f'Round {round_num}: {metrics}')

Enterprise Implementation with PySyft

For enterprise environments, PySyft offers additional privacy features:

import torch
import syft as sy

# Initialize PySyft hook
hook = sy.TorchHook(torch)

# Create virtual workers (in production, these would be real remote workers)
hospital_a = sy.VirtualWorker(hook, id="hospital_a")
hospital_b = sy.VirtualWorker(hook, id="hospital_b")
insurance = sy.VirtualWorker(hook, id="insurance")
central_server = sy.VirtualWorker(hook, id="central_server")

# Define model
class MedicalDiagnosisModel(torch.nn.Module):
    def __init__(self):
        super(MedicalDiagnosisModel, self).__init__()
        self.fc1 = torch.nn.Linear(100, 50)
        self.fc2 = torch.nn.Linear(50, 10)

    def forward(self, x):
        x = torch.nn.functional.relu(self.fc1(x))
        x = self.fc2(x)
        return x

model = MedicalDiagnosisModel()

# Send the model to workers and have them train locally
model_hospital_a = model.copy().send(hospital_a)
model_hospital_b = model.copy().send(hospital_b)

# After local training (simplified here)
# ...

# Get models back and average them
model_updates = []
model_updates.append(model_hospital_a.get())
model_updates.append(model_hospital_b.get())

# Perform secure aggregation
with torch.no_grad():
    for param_idx, param in enumerate(model.parameters()):
        update = torch.zeros_like(param)
        for client_model in model_updates:
            update += client_model.parameters()[param_idx] - param
        update /= len(model_updates)
        param.add_(update)

# Send updated model back to clients
# ...

Privacy-Enhancing Techniques in Federated Learning

While federated learning inherently improves privacy, additional techniques can further strengthen protection:

Differential Privacy

Adding noise to model updates to prevent extraction of individual data points:

import tensorflow_privacy as tfp

# Create a differentially private optimizer
optimizer = tfp.DPKerasSGDOptimizer(
    l2_norm_clip=1.0,
    noise_multiplier=0.5,
    num_microbatches=1,
    learning_rate=0.1
)

# Use this optimizer in your federated learning process
def model_fn():
    keras_model = create_keras_model()
    keras_model.compile(
        optimizer=optimizer,
        loss=tf.keras.losses.SparseCategoricalCrossentropy(),
        metrics=[tf.keras.metrics.SparseCategoricalAccuracy()]
    )
    return tff.learning.from_compiled_keras_model(
        keras_model,
        sample_batch
    )

Secure Aggregation

Cryptographic techniques to aggregate model updates without revealing individual contributions:

# Pseudocode for secure aggregation protocol
def secure_aggregation(model_updates, encryption_key):
    # Each client encrypts their model update
    encrypted_updates = []
    for update in model_updates:
        encrypted_update = encrypt(update, encryption_key)
        encrypted_updates.append(encrypted_update)

    # Server aggregates encrypted updates
    aggregated_encrypted_update = sum(encrypted_updates)

    # Server decrypts the aggregated update
    aggregated_update = decrypt(aggregated_encrypted_update, encryption_key)

    return aggregated_update

Homomorphic Encryption

Enabling computations on encrypted data without decryption:

from tenseal import context, vector

# Create TenSEAL context
ctx = context.Context(
    scheme=context.SCHEME_TYPE.CKKS,
    poly_modulus_degree=8192,
    coeff_mod_bit_sizes=[60, 40, 40, 60]
)
ctx.global_scale = 2**40

# Each client encrypts their model update
def encrypt_model_update(update, ctx):
    flattened = update.flatten()
    encrypted = vector.Vector(ctx, flattened)
    return encrypted

# Server aggregates encrypted updates
def aggregate_encrypted_updates(encrypted_updates):
    result = encrypted_updates[0]
    for update in encrypted_updates[1:]:
        result += update
    return result

# Each client would then decrypt the aggregated result
# ...

Industry Applications

Healthcare

In healthcare, federated learning allows hospitals and research institutions to collaborate on developing advanced diagnostic models without sharing sensitive patient data:

# Example: Multi-hospital pneumonia detection system
class PneumoniaDetectionModel(torch.nn.Module):
    def __init__(self):
        super(PneumoniaDetectionModel, self).__init__()
        self.features = torchvision.models.resnet18(pretrained=True)
        self.features.fc = torch.nn.Linear(512, 2)  # Binary classification

    def forward(self, x):
        return self.features(x)

# Each hospital trains locally on their X-ray images
# Central server aggregates model updates
# No patient data is ever exchanged

Finance

Financial institutions can develop fraud detection systems while keeping transaction data private:

# Example: Federated fraud detection across banks
class TransactionFraudModel(tf.keras.Model):
    def __init__(self):
        super(TransactionFraudModel, self).__init__()
        self.dense1 = tf.keras.layers.Dense(64, activation='relu')
        self.dense2 = tf.keras.layers.Dense(32, activation='relu')
        self.dense3 = tf.keras.layers.Dense(1, activation='sigmoid')

    def call(self, inputs):
        x = self.dense1(inputs)
        x = self.dense2(x)
        return self.dense3(x)

# Each bank trains on local transaction data
# Model updates are aggregated securely
# Transaction details remain within each bank's system

Telecommunications

Telecom providers can improve network optimization and user experience models without exposing user behavior data:

# Example: Network traffic optimization
class NetworkOptimizationModel(torch.nn.Module):
    def __init__(self):
        super(NetworkOptimizationModel, self).__init__()
        self.lstm = torch.nn.LSTM(input_size=24, hidden_size=100, num_layers=2)
        self.fc = torch.nn.Linear(100, 24)

    def forward(self, x):
        lstm_out, _ = self.lstm(x)
        return self.fc(lstm_out[:, -1, :])

# Each provider trains on local network traffic patterns
# Resulting model predicts optimal resource allocation
# User behavior details remain private

Implementation Challenges and Solutions

Communication Efficiency

Federated learning can be bandwidth-intensive. Solutions include:

Model Compression:

# Quantize model updates before transmission
def compress_update(update):
    return tf.quantization.quantize(
        update, min_range=-1.0, max_range=1.0, dtype=tf.quint8)

def decompress_update(compressed_update):
    return tf.quantization.dequantize(
        compressed_update, min_range=-1.0, max_range=1.0)

Sparse Updates:

# Send only significant gradients
def sparsify_update(update, sparsity=0.1):
    # Keep only top 10% of gradients by magnitude
    k = int(tf.size(update) * sparsity)
    values, indices = tf.math.top_k(tf.abs(tf.reshape(update, [-1])), k=k)
    sparse_update = tf.sparse.SparseTensor(
        indices=tf.expand_dims(indices, 1),
        values=tf.gather(tf.reshape(update, [-1]), indices),
        dense_shape=[tf.size(update)])
    return sparse_update

System Heterogeneity

Clients with varying computational capabilities require adaptive approaches:

# Adapt local computation based on client capabilities
def get_client_computation_plan(client_capabilities):
    if client_capabilities == "high":
        return {
            "local_epochs": 5,
            "batch_size": 64,
            "learning_rate": 0.01
        }
    elif client_capabilities == "medium":
        return {
            "local_epochs": 3,
            "batch_size": 32,
            "learning_rate": 0.01
        }
    else:  # low
        return {
            "local_epochs": 1,
            "batch_size": 16,
            "learning_rate": 0.005
        }

Model Poisoning Attacks

Detecting and mitigating malicious updates:

# Simple outlier detection for model updates
def detect_poisoned_updates(updates, threshold=2.0):
    # Calculate mean and standard deviation across all updates
    update_norms = [tf.norm(update).numpy() for update in updates]
    mean_norm = np.mean(update_norms)
    std_norm = np.std(update_norms)

    # Flag updates that deviate significantly
    suspicious_updates = []
    for i, norm in enumerate(update_norms):
        z_score = (norm - mean_norm) / std_norm
        if abs(z_score) > threshold:
            suspicious_updates.append(i)

    return suspicious_updates

Decision Rules

Use this checklist for federated learning decisions:

If data cannot leave its location due to regulations, federated learning may apply
If model convergence time is acceptable, federated learning works; real-time needs may not fit
If clients have heterogeneous hardware, plan for adaptive computation budgets
If adversarial updates are a concern, add outlier detection for model aggregation
If privacy guarantees need mathematical proof, add differential privacy

Federated learning adds communication overhead and implementation complexity. Only use it when data cannot be centralized.

Shipping a production AI system?

Find the control gaps before they turn into incidents. Take the AI Production Scorecard for a fast baseline across the seven layers, or book an architecture review and we will turn it into a hardening plan.

Take the AI Production Scorecard Book an Architecture Review