Features

Keep Your Feature Flags Synchronized

Sebastian Arrubia on June 16, 2017

#> split-sync

Split lets companies fulfill the promise of continuous delivery: with Split’s safe launch and experimentation, teams can take any feature into production with fine-grain control.

In order to precisely release features (and gather data on their engagement), Split’s SDKs utilize different streams of data. And many of our customers run Split with SDKs installed in multiple languages and throughout their application stack.

By default, Split’s SDKs keep segment and split data synchronized as your users navigate across large distributed systems, treatments and conditions. Some languages, however, do not have a native capability to keep a shared local cache of this data to properly serve treatments.

Today, we’re giving these languages the power to do so by introducing a new feature: the Split Synchronizer. Let’s take a look at how it works, and the performance boost it can give your app in serving treatments with Split.

How the Split Synchronizer Works

The Split Synchronizer coordinates the sending and receiving of data to a remote data store that all of your processes can share when updating definitions and rules for the evaluation of treatments. Its an easy-to-install job that sits on your servers and is designed to run entirely in the background so that it has zero impact on the performance of your application servers,. Split supports Redis as a remote data store out-of-the box, so it’s no surprise the Split Synchronizer utilizes Redis as the cache for your SDKs when evaluating treatments. It also posts impression data and metrics generated by the SDKs back to Split’s servers, for exposure in the web console or sending to the data integration of your choice.

The Split Synchronizer works with most of the languages Split supports:

Those not currently listed already have the ability to keep a local data cache for Split’s SDKs, but if you’re already a Split customer and curious about using the Synchronizer with a language not listed above, drop us a note.

Benefits

Why take the Synchronizer approach? Whether you’re working with one Split SDK or many, your team will see the benefits of:

  • Data Consistency: Split-synchronizer and associated SDKs implement a producer-consumer pattern, using a Redis cache as a remote data store to guarantee data consistency. This allows you to take ownership of the data store, giving your team the power to improve the performance when feature flag definitions are requested from the SDK.
  • Security: Since the Redis instance is located on either your local network or behind a virtual private network (VPN), any data contained within your features flags will remain in your app and secure.
  • Fault Tolerance: If for some reason a connection issue is experienced on your network, the last consistent state of your feature flags will be stored in the Redis cache, and the Split Synchronizer will synchronize it once that the connection issue is restored.
  • Error Handling: In addition to logging errors in the file system or even in the stdout, Split Synchronizer has the ability to send errors to a customized Slack channel or Slack user.

Service Architecture

This service performs four different actions:

  • Fetch Splits: retrieve the split definitions.
  • Fetch Segments: retrieve your segments lists and memberships to targeting groups.
  • Post Impressions: send the impressions generated by the SDK to Split servers.
  • Post Metrics: send different metrics (like latencies) from the SDK to Split servers.

Performance Improvements and Benchmark

The Split Synchronizer is written in Go and introduces notable performance improvements over previous language-specific synchronizers bundled with our SDKs. Now, tracked data (Split’s impressions log) will be posted at Split servers in a near real-time window.

The following numbers show an SDK’s impression-handling capability over time, after installing the Split Synchronizer:

~3M /minute ~180M /hour ~4.32B /day

These numbers belong to the test scenario below:

Spliti-sync instance: m4.large (2 CPU - 8 GB Memory)
Redis instance: cache.m3.large (2 vCPU - 6 GB Memory)
Number of features flags: 200
Impressions to send per post: 500k
Impressions generated per feature flag: 100k
Insertion rate in Redis: 1M each ~5s
Duration of the benchmark: 48 hours


Analyzing Performance Improvements

Let’s look at performance by analyzing some infrastructure numbers for the duration of this benchmark .

The rate of successfully posted impressions remained stable throughout the duration of the test:

Memory consumption remained stable during the benchmark:

The CPU remained constant and within boundaries (~25% to 55%):

Throughput remained constant at ~ 20MB/s:

We're Hiring!

Join the Team