Significance Level

What is significance level?

Significance Level, denoted as alpha (α), is a key concept in hypothesis testing. It serves as the threshold to decide if your test results are statistically significant.

Common significance levels are 0.01 and 0.05. These values correspond to a 1% and 5% chance of rejecting the null hypothesis when it’s actually true. This helps you understand the probability of making a Type I error.

Why use a significance level?

Setting a significance level allows you to control the likelihood of incorrectly rejecting a true null hypothesis. This makes your results more reliable.

  • 0.05: Indicates a 5% risk of concluding a difference exists when there isn't one.

  • 0.01: Indicates a 1% risk, making it more stringent.

How to choose a significance level?

Choosing the right significance level depends on the context of your test. For most experiments, 0.05 is common. However, in more critical fields like medicine, 0.01 might be preferred to minimize errors.

Why significance level matters

Setting a significance level helps you control Type I errors. It prevents you from incorrectly rejecting a true null hypothesis. This makes your experiment results reliable.

Reliability: A clear significance level ensures your findings aren't due to random chance. For more details, check out Statistical Significance.

Trustworthiness: Results become trustworthy, allowing for confident decision-making. By minimizing errors, you enhance the credibility of your study. This is crucial for drawing meaningful conclusions and making informed choices. Learn more about Type 1 Error and Type 2 Error.

Example: medical drug effectiveness

  • Scenario: Testing a new drug against a placebo.

  • Null Hypothesis (H0): The new drug has no effect.

  • Alternative Hypothesis (H1): The new drug is effective.

  • Significance Level: Set at 0.01.

  • Interpretation: If p-value ≤ 0.01, reject H0, indicating the drug is effective.

You test a new drug against a placebo to see if it works. The null hypothesis states the drug has no effect. The alternative hypothesis claims the drug is effective.

A significance level of 0.01 means there's a 1% risk of a false positive. Collect your data and calculate the p-value. If this p-value is ≤ 0.01, you reject the null hypothesis.

This result means the drug likely has a real effect. You can then consider the drug effective based on your data. This ensures your conclusions are reliable.

For more information on hypothesis testing and interpretation of results, you can refer to Statistical Significance and Sequential Testing.

Example: marketing campaign analysis

  • Scenario: Evaluating the impact of a new marketing campaign on sales.

  • Null Hypothesis (H0): The campaign has no effect on sales.

  • Alternative Hypothesis (H1): The campaign increases sales.

  • Significance Level: Set at 0.05.

  • Interpretation: If p-value ≤ 0.05, reject H0, indicating the campaign is effective.

You want to know if a new marketing campaign boosts sales. The null hypothesis suggests there's no sales impact. The alternative hypothesis suggests the campaign increases sales.

Set your significance level at 0.05. This means a 5% chance of wrongly rejecting the null hypothesis. Collect your sales data and calculate the p-value.

If the p-value is ≤ 0.05, reject the null hypothesis. This indicates the campaign likely increased sales. Your decision will be backed by data, ensuring reliability.

Loved by customers at every stage of growth

See what our users have to say about building with Statsig
OpenAI
"At OpenAI, we want to iterate as fast as possible. Statsig enables us to grow, scale, and learn efficiently. Integrating experimentation with product analytics and feature flagging has been crucial for quickly understanding and addressing our users' top priorities."
Dave Cummings
Engineering Manager, ChatGPT
SoundCloud
"We evaluated Optimizely, LaunchDarkly, Split, and Eppo, but ultimately selected Statsig due to its comprehensive end-to-end integration. We wanted a complete solution rather than a partial one, including everything from the stats engine to data ingestion."
Don Browning
SVP, Data & Platform Engineering
Recroom
"Statsig has been a game changer for how we combine product development and A/B testing. It's made it a breeze to implement experiments with complex targeting logic and feel confident that we're getting back trusted results. It's the first commercially available A/B testing tool that feels like it was built by people who really get product experimentation."
Joel Witten
Head of Data
"We knew upon seeing Statsig's user interface that it was something a lot of teams could use."
Laura Spencer
Chief of Staff
"The beauty is that Statsig allows us to both run experiments, but also track the impact of feature releases."
Evelina Achilli
Product Growth Manager
"Statsig is my most recommended product for PMs."
Erez Naveh
VP of Product
"Statsig helps us identify where we can have the most impact and quickly iterate on those areas."
John Lahr
Growth Product Manager
"The ability to easily slice test results by different dimensions has enabled Product Managers to self-serve and uncover valuable insights."
Preethi Ramani
Chief Product Officer
"We decreased our average time to decision made for A/B tests by 7 days compared to our in-house platform."
Berengere Pohr
Team Lead - Experimentation
"Statsig is a powerful tool for experimentation that helped us go from 0 to 1."
Brooks Taylor
Data Science Lead
"We've processed over a billion events in the past year and gained amazing insights about our users using Statsig's analytics."
Ahmed Muneeb
Co-founder & CTO
SoundCloud
"Leveraging experimentation with Statsig helped us reach profitability for the first time in our 16-year history."
Zachary Zaranka
Director of Product
"Statsig enabled us to test our ideas rather than rely on guesswork. This unlocked new learnings and wins for the team."
David Sepulveda
Head of Data
Brex
"Brex's mission is to help businesses move fast. Statsig is now helping our engineers move fast. It has been a game changer to automate the manual lift typical to running experiments and has helped product teams ship the right features to their users quickly."
Karandeep Anand
President
Ancestry
"We only had so many analysts. Statsig provided the necessary tools to remove the bottleneck. I know that we are able to impact our key business metrics in a positive way with Statsig. We are definitely heading in the right direction with Statsig."
Partha Sarathi
Director of Engineering
"Statsig has enabled us to quickly understand the impact of the features we ship."
Shannon Priem
Lead PM
Ancestry
"I know that we are able to impact our key business metrics in a positive way with Statsig. We are definitely heading in the right direction with Statsig."
Partha Sarathi
Director of Engineering
"Working with the Statsig team feels like we're working with a team within our own company."
Jeff To
Engineering Manager
"[Statsig] enables shipping software 10x faster, each feature can be in production from day 0 and no big bang releases are needed."
Matteo Hertel
Founder
"We use Statsig's analytics to bring rigor to the decision-making process across every team at Wizehire."
Nick Carneiro
CTO
Notion
"We've successfully launched over 600 features behind Statsig feature flags, enabling us to ship at an impressive pace with confidence."
Wendy Jiao
Staff Software Engineer
"We chose Statsig because it offers a complete solution, from basic gradual rollouts to advanced experimentation techniques."
Carlos Augusto Zorrilla
Product Analytics Lead
"We have around 25 dashboards that have been built in Statsig, with about a third being built by non-technical stakeholders."
Alessio Maffeis
Engineering Manager
"Statsig beats any other tool in the market. Experimentation serves as the gateway to gaining a deeper understanding of our customers."
Toney Wen
Co-founder & CTO
"We finally had a tool we could rely on, and which enabled us to gather data intelligently."
Michael Koch
Engineering Manager
Notion
"At Notion, we're continuously learning what our users value and want every team to run experiments to learn more. It's also critical to maintain speed as a habit. Statsig's experimentation platform enables both this speed and learning for us."
Mengying Li
Data Science Manager
Whatnot
"Excited to bring Statsig to Whatnot! We finally found a product that moves just as fast as we do and have been super impressed with how closely our teams collaborate."
Rami Khalaf
Product Engineering Manager
"We realized that Statsig was investing in the right areas that will benefit us in the long-term."
Omar Guenena
Engineering Manager
"Having a dedicated Slack channel and support was really helpful for ramping up quickly."
Michael Sheldon
Head of Data
"Statsig takes away all the pre-work of doing experiments. It's really easy to setup, also it does all the analysis."
Elaine Tiburske
Data Scientist
"We thought we didn't have the resources for an A/B testing framework, but Statsig made it achievable for a small team."
Paul Frazee
CTO
Whatnot
"With Warehouse Native, we add things on the fly, so if you mess up something during set up, there aren't any consequences."
Jared Bauman
Engineering Manager - Core ML
"In my decades of experience working with vendors, Statsig is one of the best."
Laura Spencer
Technical Program Manager
"Statsig is a one-stop shop for product, engineering, and data teams to come together."
Duncan Wang
Manager - Data Analytics & Experimentation
Whatnot
"Engineers started to realize: I can measure the magnitude of change in user behavior that happened because of something I did!"
Todd Rudak
Director, Data Science & Product Analytics
"For every feature we launch, Statsig saves us about 3-5 days of extra work."
Rafael Blay
Data Scientist
"I appreciate how easy it is to set up experiments and have all our business metrics in one place."
Paulo Mann
Senior Product Manager
We use cookies to ensure you get the best experience on our website.
Privacy Policy