Platform

Developers

Resources

Pricing

Snowflake and Statsig for experimentation

Thu Jul 27 2023

Ryan Musser

Lead Solutions Engineer, Statsig

We’ve collaborated with dozens of the world’s most successful digital experimentation programs in search of the keys to success.

In this search, we’ve observed some reoccurring themes: Trusted and complete data, accessible data, and automated analysis.

Using Statsig and Snowflake, you can repeat these patterns, and increase the efficiency and effectiveness of your digital experimentation program, without over-consuming data science and engineering resources.

The points of friction

  • Incomplete and untrustworthy data. Successful experimentation programs reduce their exposure to brittle architecture and data quality risk.

  • Inaccessible data. Do practitioners have the data where they need it? Do they understand what the data represents and where it came from?

  • Manual analysis increases manual workloads and delays experimentation motions.

While experimentation programs may have varying business goals, there are patterned approaches that we find that ensure trusted and complete data, accessible data, and automated analysis.

Foundational success patterns

360-degree user views

Building a 360-degree view of your users involves consolidating and cleaning your data, enriching it with additional information, analyzing and segmenting it, and using the insights you gain to further invest in customer experience.

This data typically includes data from your application, model outputs, CRM systems, marketing automation tools, social media platforms, and other sources. Accurate and thorough data collection is essential for ensuring the validity of your analysis.

Data accessibility

By ensuring that data is well-documented and easy to access, organizations can help to promote the valid use of data and support the development of trusted outputs.

Data documentation might include metadata (such as the date the data was collected, the source of the data, and the methods used to collect it), as well as clear and detailed explanations of the data's structure and content. Data accessibility is built using appropriate data collection methods and tools, and properly recording and storing this data.

Automated analysis

Automated analysis is an important tool for organizations that need to analyze large and complex datasets quickly and accurately.

Using automated analysis, organizations can gain insights faster from their data and make more informed decisions. Analysis includes a wide range of tasks, such as data cleaning and preprocessing, statistical analysis, building consumable readouts, and data visualizations.

How to get started

Consolidate user data in Snowflake

We have observed that Snowflake Data Cloud serves as an effective engine to power a 360-degree customer view and mobilize this data because of Snowflake’s mature integration ecosystem, secure data accessibility, near unlimited scale, concurrency, and performance.

You can have Statsig pull raw data from Snowflake, or using Statsig Warehouse Native, you can calculate intermediate results fully within your warehouse and only export a small, aggregated, and anonymized dataset that's displayed in your Statsig console:

connecting statsig to snowflake

You can pull events, pre-computed metrics or exposure data from Snowflake into Statsig.

Use Statsig (a Snowflake-connected app) to create, launch, and measure experiments

Advanced user targeting

Using Statsig’s Feature Gates, you can select the criteria for identifying your target users in several ways:

  • You can target users based on common application properties such as the operating system that the application is running on as shown below:

targeting users based on common application properties
  • You can target users based on key value attributes; for example, you can select the user's Email attribute and the Contains any of operator, and enter the email domain of your company to target only internal employees as shown below:

targeting users based on key value attributes
  • You can target users in a defined segment.

  • You can also target users who are eligible for another feature gate; this ensures that this feature gate is activated only for users who pass (or fail) the selected target gate check.

Join the Slack community

Connect with our data scientists and engineers, and ask questions or just hang out with other cool folks that believe in an experimentation culture!
join slack community cta image

Scheduled Rollouts

Scheduled Rollouts add a time-based scheduling layer to Feature Gates, enabling you to pre-set any rollout schedule you want, which will execute automatically. This is particularly useful if, for example, you have a feature launch happening in another timezone (and don’t want to stay up all night!) or you have a standard, company-wide ramp-up schedule you follow with every feature release.

adding a time-based scheduling layer to feature gates

Statsig will then measure metric deltas for all users that are seeing the new feature, and allow you to decide if it’s worth shipping the new feature based on this data, or kill the feature if it is degrading guardrail metrics.

a statsig pulse report showing the impact of features

Experiments

Similar to A/B tests, Experiments provides the following features to run and analyze experiments with ease:

Experiments offers additional features to accelerate and scale your experiments:

  • Create multiple variants as part of the same experiment (A/B/n tests)

  • Define layers to exclude the users participating in one experiment from other related experiments

  • Select the key metrics that you want to prioritize for your hypothesis

viewing metrics in statsig's metric lifts dashboard

Request a demo

Statsig's experts are on standby to answer any questions about experimentation at your organization.
request a demo cta image

Metrics catalog

Statsig provides data observability and accessibility by combining data from any of your existing data sources to give you a complete view of your product metrics as well as the impact new features and experiments have on these metrics. This includes data from Snowflake, data collectors (like Segment, Rudderstack, etc.), and data using Statsig’s SDKs.

Statsig automatically creates metrics from the raw events that you log from your application as well as raw and transformed events you send to Statsig via a data collector.

statsig's automatically-generated metrics based on raw events

Statsig can also reuse your existing precomputed metrics by natively integrating and pulling data from Snowflake.

Then, Statsig builds a Metrics Catalog that allows you to search and tag your metrics, as well as create custom metrics. Tags enable you to organize your metrics and create collections of metrics that are associated in some way.

For example, you could tag a set of metrics focused on a product area, business function, business objective, and so on. You can also create a loose collection of guardrail metrics that teams check in every experiment to ensure there are causing no unexpected effects in other parts of the business.

Once you create a tagged collection of metrics, you can easily pull up this set of metrics when viewing your experiment results and zoom into the context that you want to focus on.

viewing experiment results in statsig

Data-driven growth is now accessible to everyone

The combination of Statsig and Snowflake makes data-driven growth accessible for businesses of all sizes. Statsig's ability to quickly and easily analyze large amounts of data, combined with Snowflake's powerful data cloud capabilities, allows businesses to gain insights and make data-driven decisions with ease.

This combination not only saves time and resources, but also allows for more accurate and informed decision-making, leading to sustainable growth for businesses. By leveraging the strengths of both Statsig and Snowflake, businesses can stay ahead of the competition and make data-driven growth a reality.

Create a free account

You're invited to create a free Statsig account! Get started today, and ping us if you have questions. No credit card required, of course.
363530325 3574351406143515 2998266432766333418 n

Stay ahead of the curve

Get experimentation insights in your inbox!

Try Statsig Today

Get started for free. Add your whole team!
We use cookies to ensure you get the best experience on our website.
Privacy Policy