Sampling Methods
Sampling methods enable efficient analysis of large populations by studying representative subsets. At MathMultiverse, we explore simple random sampling and stratified sampling, with clear formulas, examples, visualizations, and applications in data science and statistics.
Simple Random Sampling
Every unit has an equal chance of selection:
Sample mean and variance:
Example: \(N = 1000\), \(n = 100\), \(\sigma = 10\), \(\text{Var}(\bar{x}) \approx 0.901\).
Stratified Sampling
Divides population into strata, samples proportionally:
Optimal (Neyman) allocation:
Mean and variance:
Example: \(N = 1000\), \(n = 100\), \(n_1 = 60\), \(n_2 = 40\), \(\text{Var}(\bar{x}_{\text{st}}) \approx 1.352\).
Examples
Simple Random Sampling
Population: 10,000 employees, sample 500. Variance of proportion:
Stratified Sampling
Proportional: \(n_1 = 300\), \(n_2 = 200\). Lower variance ensures balance.
Visualizations
Sample Mean Variance Comparison
Applications
- Election Polling: Margin of error \( \approx 0.031 \) for \( n = 1000 \).
- Quality Control: Defect rate variance \( \approx 0.000471 \).
- Health Studies: Stratified sampling for accurate prevalence.
- Data Science: Sampling 1% of 1TB data saves time.