Design of Experiments

The goal of an experiment is to determine a causal relationship between the factors changed during the experiment and an outcome. Determining a causal relationship through observational studies alone is not possible due to confounding. This process is also referred to as A/B testing, hypothesis testing, or just testing depending on the industry. These types of experiments are helpful in human decision-making, and model-based design of experiments (DoE) can be explored elsewhere on this site.

In order to remove the potential of confounding factors an experimenter changes a relatively small number of factors on a portion of the overall population, while the rest of the population gets the "normal" treatment (control). The performance / behavior of the two groups are then compared using statistics to determine if the differing treatment had an effect on the outcome. The most basic approach is to change one feature at a time, and while this is not necessary it often leads to most interpretable results and has less risk of executing the experiment incorrectly.

For the output of the experiment to be actionable the experimenter needs to ensure that they run enough tests that they can be confident in the results. The more samples you have as a part of your experiment will reduce the risk of missing an effect when it is really there (Type I error) or declaring there was an effect when there was not (Type II error). Please feel free to use the sample size and minimum detectable effect calculator to the right.

Sample Size Calculator

Two-Tailed

One-Tailed

Sample Size

Minimum Detectable Effect

Design of Experiments

Definitions

Sample Size Calculations

Minimum Detectable Effect Calculations