Products

Auditboard Case Study

Flaky Tests

May 14, 2024

AuditBoard’s mission is to cut down administrative tasks for your audit, risk, sustainability, and compliance teams by automating enterprise compliance risk management.

Pain Points

When developers pushed code, builds consistently failed due to flaky tests - often from different areas of the codebase, which led to hours spent troubleshooting tests for every code change. Strategies such as rerunning and retrying to triangulate flaky tests, compounded the problem over time. This was an untenable situation as the team grew, leading to a substantial decrease in the engineering team’s velocity.

Solution

The team turned to BuildPulse with confidence that they would not only automatically track flaky tests and their impact, but also provide workflows to automatically quarantine tests to reduce developer and CI time spent addressing flakiness.

Key Points

Auditboard’s engineering team faced substantial decrease in velocity from code merges impacted by flaky tests.
With BuildPulse, the quarantine process is blazingly fast, flaky tests are trackable, and their numbers are decreasing. Auditboard is now achieving cleaner results every night without additional noise.
By expanding BuildPulse to other parts of the software stack, Auditboard not only manages flaky tests, but also addresses broken and failing tests effectively.
BuildPulse helps increase engineering velocity, streamlines communication between team members, and automates test processes.

Auditboard reduced developer + CI time spent on flaky tests by integrating with BuildPulse.

Flaky Tests: Build vs. Buy

At first, the internal solution consisted of rerunning each test 10 times - code could only be merged if the tests passed. From there, failing tests were manually tracked across various test frameworks, and issues were created to assign ownership. Custom scripts were written to then skip captured failing tests. Due to the nature of flaky tests, identifying and cataloging them in a centralized location with a manual triage process became unmanageable as the number of tests and developers grew.

Without impact metrics, it was difficult to determine the level of internal resources and workflow changes required to resolve the problem.

All of this adds up in development and support cost for incidents in production. By utilizing BuildPulse, Auditboard was able to save on this cost - as well as the developer hours saved in triage and rerunning builds.

The key success criteria were:

Automatically identify and centrally catalog flaky tests
Metrics on flakiness and time consumed for prioritization
Test quarantining to mitigate impact until tests are fixed
Speed of implementation

“You need a tool to track flakiness, or you will spend a lot of time doing analytics and quarantining tests”

Dickson Wu, Director of Quality Engineering

Implementation

BuildPulse was up and running with test quarantining within a handful of days - the first set of flaky tests were captured immediately. The process consists of:

Monitor and send results to BuildPulse
Implement skip quarantine test logic built into the testing framework
Test it during releases with manual Jira ticket creation and quarantine during the release
Turn on automated Jira ticket creation and test quarantining - start using BuildPulse at 100% capacity

The following highlights Auditboard’s new flakiness workflow:

Tests are run on PR builds, nightly builds, as well as before releases.
BuildPulse skips quarantined tests, runs the remaining tests, and automatically detects new flaky tests.
Manual quarantining is also done for broken tests pre-release.
For each quarantined test, BuildPulse automatically creates Jira tickets and assign tickets.
Each ticket will trigger an investigation, either opening a bug, or fixing the test if it is broken.
After the fix is merged and the Jira ticket moves to ‘done’, BuildPulse automatically removes the test from quarantine and re-enabled on the next build.

Outcome

BuildPulse has helped Auditboard minimize time spent collecting and triaging flaky tests, streamline communication between team members, and automate testing processes around resolution. Beyond flakiness, BuildPulse has also helped address broken and failing tests effectively.

FAQ

Does BuildPulse replace my current CI system?

No.

We use GitHub Actions / CircleCI / Semaphore CI self-hosted functionality to run your builds on our infrastructure.

Other than faster builds, there are no changes to tooling or your developer workflows. You can continue using your CI system as-is.

How is BuildPulse faster than GitHub Actions hosted runners?

We use GitHub’s self-hosted functionality to run your builds on our infrastructure with latest generation + high single-core performance CPUs, also then further optimized for CI-type workloads. We’ve also tuned our VMs and block storage devices, increasing baseline performance while also cutting costs in half.

We also provide a toolkit to further speed up your pipelines, which includes ultra fast remote docker builders, docker layer caching, dependency caching, and more. With all of these improvements, we’ve seen 2x+ performance improvements in build times.

Can I use BuildPulse with other CI providers than GitHub Actions?

Yes! BuildPulse Runners will run jobs for CircleCI, SemaphoreCI - GitLab coming soon.

We aim to support all popular CI systems. If you're using one that's not listed, please contact support@buildpulse.io!

Is there a free trial available?

Yes, you can book a meeting here!

How do you secure my builds?

BuildPulse runs each job in a network- and compute- isolated environment with ephemeral VMs that leave behind a clean state after every run.

Do you support Mac and Windows runners?

This is on our roadmap! Email us at hello@buildpulse.io, or book a demo here!

Is BuildPulse SOC 2 compliant?

Yes, BuildPulse is SOC 2 Type 2 compliant.

How are BuildPulse Runners priced?

BuildPulse Runners charges on a per-second basis, which depend on the runner-type used. See our pricing page for more details.

How long does implementation/integration with BuildPulse take?

The minimum implementation involves 2 steps: Signing up for BuildPulse, and changing 1 in your GitHub Actions yaml file.

If you're using Semaphore CI or Circle CI, it's a 4 line change. See our Quickstart guide for more details.

Does BuildPulse replace my current CI system?

No.

We use GitHub Actions / CircleCI / Semaphore CI self-hosted functionality to run your builds on our infrastructure.

Other than faster builds, there are no changes to tooling or your developer workflows. You can continue using your CI system as-is.

How is BuildPulse faster than GitHub Actions hosted runners?

Can I use BuildPulse with other CI providers than GitHub Actions?

Yes! BuildPulse Runners will run jobs for CircleCI, SemaphoreCI - GitLab coming soon.

We aim to support all popular CI systems. If you're using one that's not listed, please contact support@buildpulse.io!

Is there a free trial available?

Yes, you can book a meeting here!

How do you secure my builds?

BuildPulse runs each job in a network- and compute- isolated environment with ephemeral VMs that leave behind a clean state after every run.

Do you support Mac and Windows runners?

This is on our roadmap! Email us at hello@buildpulse.io, or book a demo here!

Is BuildPulse SOC 2 compliant?

Yes, BuildPulse is SOC 2 Type 2 compliant.

How are BuildPulse Runners priced?

BuildPulse Runners charges on a per-second basis, which depend on the runner-type used. See our pricing page for more details.

How long does implementation/integration with BuildPulse take?

The minimum implementation involves 2 steps: Signing up for BuildPulse, and changing 1 in your GitHub Actions yaml file.

If you're using Semaphore CI or Circle CI, it's a 4 line change. See our Quickstart guide for more details.

Does BuildPulse replace my current CI system?

No.

We use GitHub Actions / CircleCI / Semaphore CI self-hosted functionality to run your builds on our infrastructure.

Other than faster builds, there are no changes to tooling or your developer workflows. You can continue using your CI system as-is.

How is BuildPulse faster than GitHub Actions hosted runners?

Can I use BuildPulse with other CI providers than GitHub Actions?

Yes! BuildPulse Runners will run jobs for CircleCI, SemaphoreCI - GitLab coming soon.

We aim to support all popular CI systems. If you're using one that's not listed, please contact support@buildpulse.io!

Is there a free trial available?

Yes, you can book a meeting here!

How do you secure my builds?

BuildPulse runs each job in a network- and compute- isolated environment with ephemeral VMs that leave behind a clean state after every run.

Do you support Mac and Windows runners?

This is on our roadmap! Email us at hello@buildpulse.io, or book a demo here!

Is BuildPulse SOC 2 compliant?

Yes, BuildPulse is SOC 2 Type 2 compliant.

How are BuildPulse Runners priced?

BuildPulse Runners charges on a per-second basis, which depend on the runner-type used. See our pricing page for more details.

How long does implementation/integration with BuildPulse take?

The minimum implementation involves 2 steps: Signing up for BuildPulse, and changing 1 in your GitHub Actions yaml file.

If you're using Semaphore CI or Circle CI, it's a 4 line change. See our Quickstart guide for more details.

Ready for Takeoff?

Book a Demo

Ready for Takeoff?

Book a Demo

Ready for Takeoff?

Book a Demo