The HeteroSwitch research, detailed in “HeteroSwitch: Characterizing and Taming System-Induced Data Heterogeneity in Federated Learning,” explores the challenges posed by data heterogeneity in Federated Learning (FL) systems. This research has been published on arXiv and presented at the 7th Annual Conference on Machine Learning and Systems (MLSys 2024), providing key insights into managing data inconsistencies across diverse systems.
Dataset Description
It includes a wide array of data collected from multiple types of devices, focusing on the effects of system variability on data uniformity.
Data Collection Methodology
To assess the influence of system-induced heterogeneity on FL, the dataset was generated using nine different smartphone models: Galaxy S22, Galaxy S9, Galaxy S6, VELVET, G7, G4, Pixel 5, Pixel 2, and Nexus 5X. Each smartphone was mounted on a tripod and used to capture images displayed on a monitor within a controlled darkroom setting.