Why should product managers measure shipped features?

Wed Sep 14 2022

Jack Virag

Editor in Chief, Statsig

As a product manager, it's important to constantly assess and evaluate the impact of the features you're shipping to users. By measuring the impact of every single feature, you can gain valuable insights into how your product is being used, what's working well, and what areas may need improvement.

Measuring the impact of shipped features:

Here are just a few of the benefits of measuring the impact of every feature you ship:

Improved product decision-making

By gathering data on how your features are performing, you can make more informed decisions about which features to prioritize and how to improve existing features. This can help you avoid making costly mistakes and ensure that your product is consistently meeting the needs of your users.

Enhanced user experience

By gathering feedback from users and measuring the impact of specific features, you can identify areas where the user experience can be improved. This can help you create a more intuitive and enjoyable product for your users.

Increased user engagement

By measuring the impact of your features, you can gain insight into which features are most popular and which ones are being underutilized. This information can help you create more engaging content and drive higher levels of user engagement.

Better product differentiation

By gathering data on how your features are being used, you can identify unique value propositions and differentiators for your product. This can help you stand out in a crowded market and attract more users.

Using Statsig

Statsig offers multiple ways to measure the impact of experiments by default.

The scorecard

The scorecard panel displays Primary and Secondary experiment metrics for each variant. These metrics are in the context of experiment hypothesis, which is highlighted at the top of the scorecard.

By default, all scorecard metrics have CUPED applied in order to shrink confidence intervals and reduce bias.

All metrics

The all metrics tab shows the metric lifts across all metrics in the metrics catalog.

Significance level

To further adjust the results, significance levels can be tweaked in the following ways:

Apply Bonferroni Correction: This reduces the probability of false positives by adjusting the significance level alpha, which will be divided by the number of test variants in the experiment.

Confidence Interval: Choose lower confidence intervals (e.g.: 80%) when there's higher tolerance for false positives and fast iteration with directional results is preferred over longer/larger experiments with increased certainty.

CUPED: Toggle CUPED on/ off via the inline settings above the metric lifts. NOTE- this setting can only be toggled for Scorecard metrics, as CUPED is not applied to non-Scorecard metrics.

Sequential Testing: This helps mitigate the increased false positive rate associated with the "peeking problem". Toggle Sequential Testing on/ off via the inline settings above the metric lifts. NOTE- this setting is available only for experiments with a set target duration.

In conclusion

Measuring the impact of every single feature you ship to users is a crucial part of the product development process. By gathering data and feedback, you can make more informed decisions, improve the user experience, increase user engagement, and better differentiate your product.

If you're a product manager, make sure to incorporate impact measurement into your workflow and start reaping the benefits.

Build fast?

Subscribe to Scaling Down: Our newsletter on building at startup-speed.

Try Statsig Today

Get started for free. Add your whole team!
We use cookies to ensure you get the best experience on our website.
Privacy Policy