Effortless Federated Learning on Mobile with NVIDIA FLARE and Meta ExecuTorch

NVIDIA and the PyTorch team at Meta announced a groundbreaking collaboration that brings federated learning (FL) capabilities to mobile devices through the integration of NVIDIA FLARE and ExecuTorch.

NVIDIA FLARE is a domain-agnostic, open-source, extensible SDK that enables researchers and data scientists to adapt existing machine learning or deep learning workflows to a federated paradigm. It also enables platform developers to build a secure, privacy-preserving offering for a distributed multi-party collaboration.

ExecuTorch is an end-to-end solution for enabling on-device inference and training capabilities across mobile and edge devices. It is part of the PyTorch Edge ecosystem and enables efficient deployment of various PyTorch models to edge devices.

By integrating the two, we offer a solution for you to leverage the power of FL on mobile devices while preserving user privacy and data security. To enable cross-device FL, there are two critical components:

An FL environment that is capable of orchestrating proper learning flow with a participant pool consisting of millions of devices.
An effective on-device training workflow, preferably easy to migrate from the your existing development environment.

With the collaboration between NVIDIA FLARE and ExecuTorch, you can now define your model architecture and training parameters using familiar PyTorch code and migrate it to cross-device FL paradigm:

NVIDIA FLARE handles the cross-device FL process with new modules designed for edge applications, both for federated workflow, and for on-device development.
ExecuTorch enables the edge-side training and easy migration from existing PyTorch solutions.

To support large-scale deployments, NVIDIA FLARE implements a hierarchical FL architecture, enabling the efficient management of a large number of edge devices. This solution ensures reliable and scalable model training across distributed mobile devices while keeping all data local.

NVIDIA FLARE and ExecuTorch are democratizing edge AI training on mobile devices by making it more accessible and efficient while preserving privacy in decentralized AI:

Effortless development: Abstracted device complexity, handling hardware, OS, ML frameworks, and programming languages for seamless FL on mobile.
Streamlined prototyping: Device Simulator to simulate a large number of devices.
Industrial-ready federated deployment: Cross-device FL system supporting a large number of devices.

With this framework, data scientists can focus solely on defining model architecture and training parameters in PyTorch, and federated pipeline design.

Challenge	Solution
Large numbers of devices	NVIDIA FLARE designed a hierarchical communication and aggregation mechanism, such that a logical tree-structure is utilized to achieve exponential efficiency in dealing with thousands of concurrent connections.
Different operating systems and hardware environments	Meta ExecuTorch achieves streamlined cross-platform deployment with NVIDIA FLARE edge modules.
Limited computation capacity and communication bandwidth	NVFLARE enables you to use efficient models, compress updates with longer local training, and either optimize client selection or use hierarchical FL to reduce compute, communication, and bandwidth costs.
Connectivity	NVIDIA FLARE provides robust solutions and the flexibility to account for different scenarios and use cases.

Collaborative model learning on distributed edge devices

Hierarchical FL for cross-device applications

Effortless cross-device federated pipeline development

Running NVFlare Mobile example

Summary

Acknowledgement

Visit our articles

Zasus

Categories

Services