The OARF Benchmark Suite: Characterization and Implications

The OARF Benchmark Suite: Characterization and Implications for Federated Learning Systems - 2020

Research Area: Machine Learning

Abstract:

This paper presents and characterizes an Open Application Repository for Federated Learning (OARF), a benchmark suite for federated machine learning systems. Previously available benchmarks for federated learning have focused mainly on synthetic datasets and use a limited number of applications. OARF mimics more realistic application scenarios with publicly available data sets as different data silos in image, text and structured data. Our characterization shows that the benchmark suite is diverse in data size, distribution, feature distribution and learning task complexity. The extensive evaluations with reference implementations show the future research opportunities for important aspects of federated learning systems. We have developed reference implementations, and evaluated the important aspects of federated learning, including model accuracy, communication cost, throughput and convergence time. Through these evaluations, we discovered some interesting findings such as federated learning can effectively increase end-to-end throughput.

Keywords:

Author(s) Name: Sixu Hu, Yuan Li, Xu Liu, Qinbin Li, Zhaomin Wu, Bingsheng He

Journal name: Computer Science

Conferrence name:

Publisher name: arXiv:2006.07856

DOI: 10.1145/3510540

Volume Information:

Paper Link: https://arxiv.org/abs/2006.07856

Office Address

Social List