📊 Adds automatic benchmarking #241

tomrijnbeek · 2021-10-17T15:58:13Z

✨ What's this?

This PR adds automatic benchmarking to our PR, and demonstrates it by adding some simple benchmarks of our Linq extensions.

🔗 Relationships

Requirement to complete #183

🔍 Why do we want this?

This library is specifically aiming to provide high performance code to use in game development. Benchmarks help us quantify this claim, and can catch unexpected performance regressions.

🏗 How is it done?

The benchmarks themselves are run by Benchmark.NET, which seems to be the biggest benchmarking library out there.

The GitHub workflow is built on this action. Currently it is a fork of a different action which is currently inactive. Not a great state to be in, but this appears to be the only way to actually get performance comparisons between a baseline and PR version.

The workflow is executed on each PR, as we do with tests. The latest results for the master branch are stored using the cache action (and you see that the final step is to copy our results back into the cache location, but only if we are on the master branch). If the performance gets worse significantly, a warning comment is added to the PR.

💥 Breaking changes

N/A

🔬 Why not another way?

There are a few other possibilities:

Don't use the action that parses the benchmark, and use the built-in Benchmark.NET Markdown exporter. The Markdown can then be uploaded as an artefact of the action, and can be made visible that way. It would require manual inspection whenever we change performance critical code, so making it more proactive sounded like a good deal. However, if things turn out not to work that well, I think this is a great alternative choice.
Only run benchmarks on the master branch. This would basically only keep track of ongoing performance, and export them to our documentation. This didn't seem particularly useful.
The GitHub workflow isn't particularly tamperproof. It is fairly straightforward to remove the "if" from the cache persisting action, and taint the cache with wrong results. Changing this would require us to set up a separate GitHub workflow that always gets read from the master branch, so that local PR changes do not change it. This complication was considered not worth it at this time. I believe it will take us some time to find the right way to integrate benchmarks into our workflows, so let's start simple and iterate as needed.

🦋 Side effects

N/A

💡 Review hints

None.

paulcscharf · 2021-10-18T08:30:19Z

Bearded.Utilities.Benchmarks/Linq/Extensions.cs

+    [SuppressMessage("ReSharper", "ClassCanBeSealed.Global")]
+    public sealed class Extensions
+    {
+        private const int seed = 1337;


This could potentially lead to inconsistent results if the implementation of Random changes. Did you try just not using a seed to average over a variety of different seeds? (Assuming seeds of Random objects created quickly after each other are random enough - otherwise we could consider generating them with the static random class...)

I believe using a fixed seed is desirable. You are right that this could lead to inconsistent results if the implementation of Random changes. However, the implementation of Random changing must be because we update a dependency or runtime. In that case, the results being inconsistent in itself becomes an important signal. If we see a significant performance reduction because we update or .NET version, then that is worth finding.

By using a variety of different seeds, we surely average out things correctly, but all of a sudden we have to start tuning our benchmarks to make them (1) as fast as possible while at the same time make them (2) run often enough to balance out the statistical noise from the random.

Using a fixed seed is what is used across the board in the examples in the BenchmarkDotNet documentation. I believe my argument above justifies following that example.

The library examples themselves using static seed surprises me, but perhaps there is indeed a reason... but given that it reruns the test many times, it would still make more sense, at least in principle, to use random seeds to find out NOW if the performance in inconsistent (doesn't the library give us variance measurement?), instead of waiting for a runtime update to surprise us. :D

Bearded.Utilities.Benchmarks/Linq/Extensions.cs

paulcscharf

I'm happy to take this for now, because it's a good thing, and we can look further into the random stuff another time.

tomrijnbeek added 3 commits October 17, 2021 15:59

📊 Add benchmarks for the random Linq extensions

97c73c5

🚫 Ignore benchmark outputs

3199805

👷‍♂️ Add action to automatically run benchmarks

01ee920

tomrijnbeek requested a review from paulcscharf October 17, 2021 15:58

tomrijnbeek added 2 commits October 17, 2021 17:25

🐛 Deal with procedural benchmark filenames

a4f162e

🐛 Fix the benchmark file path

0a01222

paulcscharf reviewed Oct 18, 2021

View reviewed changes

paulcscharf approved these changes Oct 20, 2021

View reviewed changes

Merge branch 'master' into benchmarks

f4d951e

paulcscharf merged commit 012e785 into master Oct 20, 2021

paulcscharf deleted the benchmarks branch October 20, 2021 20:24

paulcscharf mentioned this pull request Oct 21, 2021

#234 Linq Extensions tests #236

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

📊 Adds automatic benchmarking #241

📊 Adds automatic benchmarking #241

tomrijnbeek commented Oct 17, 2021

paulcscharf Oct 18, 2021

tomrijnbeek Oct 20, 2021

paulcscharf Oct 20, 2021

paulcscharf left a comment

📊 Adds automatic benchmarking #241

📊 Adds automatic benchmarking #241

Conversation

tomrijnbeek commented Oct 17, 2021

✨ What's this?

🔗 Relationships

🔍 Why do we want this?

🏗 How is it done?

💥 Breaking changes

🔬 Why not another way?

🦋 Side effects

💡 Review hints

paulcscharf Oct 18, 2021

Choose a reason for hiding this comment

tomrijnbeek Oct 20, 2021

Choose a reason for hiding this comment

paulcscharf Oct 20, 2021

Choose a reason for hiding this comment

paulcscharf left a comment

Choose a reason for hiding this comment