BenchPilot

BenchPilot: Repeatable & Reproducible Benchmarking for Edge Micro-DCs

BenchPilot is a modular and highly customizable benchmarking framework for edge micro-DCs. BenchPilot provides a high-level declarative model for describing experiment testbeds and scenarios that automates the benchmarking process on Streaming Distributed Processing Engines (SDPEs). The latter enables users to focus on performance analysis instead of dealing with the complex and time-consuming setup. BenchPilot instantiates the underlying cluster, performs repeatable experimentation, and provides a unified monitoring stack in heterogeneous Micro-DCs.

BenchPilot Bootstrapping

Before the experimentation phase starts, a necessary bootstrapping needs to be done. The BenchPilot user only needs to execute a parameterizable installation script of Benchpilot on every cluster node. The script installs all necessary software dependencies across the micro-DC and downloads the required workload docker images.

This step is required only for the first time of the BenchPilot installation or in case of a hardware update, e.g., introducing a new device.

Experiment Setup

A typical workflow starts with the user submitting in a yaml file their choice of experiments and their specific parameters. The BenchPilot model is composed with Experiments, where Workloads are described. Each workload can have the following:

name, which will be selected from the supported workload list.
record name, so that the user can later on retrieve its monitored metrics based on it
number of repetitions,
duration,
specific workload parameters,
cluster configurations including the manager node's IP, list of cluster nodes, etc.
engine configurations, in case of streaming distributed-based workloads

Deployment

When the description is ready, the user deploys the application using the BenchPilotSDK through a Jupyter notebook. If there's no validation error from the description, the Parser will parse the preferences to the BenchPilot Deployment Template Generator, where the preferences will be transformed into docker-compose templates. At last, the Deployment Coordinator will deploy each experiment to the underlying orchestrator and closely monitor its performance through the monitoring stack. At the beginning and end of each experiment, the Coordinator records the starting/ending timestamps, so that the user can retrieve the monitored information later on.

Monitoring

For extracting various infrastructure utilization metrics, including CPU, Memory, and Network Utilization, BenchPilot offers a transparent, from the application under test, monitoring stack. To achieve this, BenchPilot, in the bootstrapping stage, instantiates a containerized monitoring agent on every node. The agent inspects system information (e.g., performance pseudofiles and cgroup files) and extracts the required metrics in a non-intrusive way. The agent starts various probes, one for each sub-component (e.g., cgroup probe, OS probe, etc.), and exposes an API through which a centralized monitoring server retrieves the data periodically and stores them to the monitoring storage. Furthermore, the monitoring agent offers probes for external resources as well. From the implementation perspective, we have selected Netdata, a widely known and used monitoring tool, and Prometheus, an open-source and popular monitoring server, for our stack. For a monitoring storage backend, InfluxDB is used.

Post-Experiment Analysis

To create an end-to-end interactive analytic tool for benchmarking, BenchPilot utilizes the Jupyter Notebook stack. Specifically, after the experimentation process is over, the user can request the monitored metrics of each execution from the monitoring storage based on the provided experiments' starting/ending timestamps. Users can apply high-level analytic models to the retrieved metrics of each experiment and have a clear overview of their deployments.

Workload List

As for now BenchPilot only supports the following containerized workloads:

Name	Description	Specific Configuration Parameters
marketing-campaign	A streaming distributed workload that features an application as a data processing pipeline with multiple and diverse steps that emulate insight extraction from marketing campaigns. The workload utilizes technologies such as Kafka and Redis.	campaigns, which is the number of campaigns, the default number is 1000, tuples_per_second, the number of emitted tuples per second, the default is 10000 kafka_event_count, the number of generated and published events on kafka, the default is 1000000 maximize_data, this attribute is used to automatically maximize the data that are critically affecting the workload's performance, the input that the user can put is in the format of x10, x100, etc.

It's important to note that BenchPilot can be easily extended to add new workloads.

Engine Parameters

In the case of streaming distributed workloads, the user needs to define specific engine parameters along with their experiment declaration. The structure should be as the example below:

   engine:
      name: "storm"
      parameters:
         partitions: 5
         ackers: 2
         executors_per_node: [ 4, 4, 4, 4, 16 ]

For each Streaming Distributed Processing Engine, the following attributes can be specified:

Engine	Storm	Flink	Spark
Parameters	partitions ackers executors_per_node	partitions buffer_timeout checkpoint_interval	partitions batchtime executor_cores executor_memory

Resources

The Team

The creators of the BenchPilot are members of the Laboratory for Internet Computing (LInC), University of Cyprus. You can find more information about our research activity visit our publications' page and our on-going projects.

Acknowledgements

This work is partially supported by the EU Commission through RAINBOW 871403 (ICT-15-2019-2020) project and by the Cyprus Research and Innovation Foundation through COMPLEMENTARY/0916/0916/0171 project, and from RAIS (Real-time analytics for the Internet of Sports), Marie Skłodowska-Curie Innovative Training Networks (ITN), under grant agreement No 813162.

License

The framework is open-sourced under the Apache 2.0 License base. The codebase of the framework is maintained by the authors for academic research and is therefore provided "as is".

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
benchpilot-sdk		benchpilot-sdk
docs-code		docs-code
docs		docs
monitoring		monitoring
.gitignore		.gitignore
BenchPilot_architecture.png		BenchPilot_architecture.png
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BenchPilot

BenchPilot Bootstrapping

Experiment Setup

Deployment

Monitoring

Post-Experiment Analysis

Workload List

Engine Parameters

Resources

The Team

Acknowledgements

License

About

Releases

Packages

Languages

License

UCY-LINC-LAB/BenchPilot

Folders and files

Latest commit

History

Repository files navigation

BenchPilot

BenchPilot Bootstrapping

Experiment Setup

Deployment

Monitoring

Post-Experiment Analysis

Workload List

Engine Parameters

Resources

The Team

Acknowledgements

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages