Standard Error

category_specifier : "Statistics"

Reference Docs: Hypothesis Testing|Using Control Variables

Context

In A/B test, we want to improve the precision of the estimator for treatment effect (coefficient of the treatment variable)

→ Understand what affects the precision

Standard Error is useful in understanding how reliable our sample statistic is as an estimator of the population parameter
Used to construct a confidence interval, which provides margin of error in our estimator
Used in the Hypothesis Testing

Standard error measures standard deviation of the sampling distribution of a statistic (such as mean)
It explains the accuracy of a sample statistic in representing the population parameter
Lower standard error → More precise sample statistic

Example: Standard Error of the Mean (SEM)

\[ SEM = \frac{s}{\sqrt{n}} \]

Standard Error of the Regression Coefficient

\[ \widehat{SE}(\hat{\beta}_1) = \sqrt{\frac{s^2}{\sum_{i=1}^n (x_i - \bar{x})^2}} = \sqrt{\frac{s^2}{(N-1)s_x^2}} \]

Measures the variability of the estimated regression coefficients if we were to repeatedly sample and run regressions
It explains how close the estimated treatment effect (reflected in the regression coefficient) is to the true treatment effect
Relies on the three factors:
To improve the precision of the coefficient estimator, we control these factors
1. \(N\) : Sample Size
2. \(s_x^2\) : Variability in X
3. \(s^2\) : Variance of regression residual