Central Limit Theorem: Simulation-Based Exploration

Theoretical Foundation

The Central Limit Theorem (CLT) states that, given a sufficiently large sample size, the sampling distribution of the mean of any independent random variable will be approximately normally distributed, regardless of the variable's original distribution.

Key Formula:

If $X_1, X_2, \dots, X_n$ are independent and identically distributed (i.i.d.) random variables with mean $\mu$ and variance $\sigma^2$, then the sample mean:

\[ \bar{X}_n = \frac{1}{n} \sum_{i=1}^{n} X_i \]

has an approximate normal distribution:

\[ \bar{X}_n \sim \mathcal{N}\left(\mu, \frac{\sigma^2}{n}\right) \quad \text{as } n \to \infty \]

Analysis of Range

We explore the following distributions:

1. Uniform Distribution $\sim \mathcal{U}(a, b)$

Mean: $\mu = \frac{a + b}{2}$
Variance: $\sigma^2 = \frac{(b - a)^2}{12}$

2. Exponential Distribution $\sim \text{Exp}(\lambda)$

Mean: $\mu = \frac{1}{\lambda}$
Variance: $\sigma^2 = \frac{1}{\lambda^2}$

3. Binomial Distribution $\sim \text{Bin}(n, p)$

Mean: $\mu = np$
Variance: $\sigma^2 = np(1 - p)$

As $n$ increases, the sample mean distribution approaches:

\[ \bar{X}_n \approx \mathcal{N}(\mu, \frac{\sigma^2}{n}) \]

Practical Applications

Statistical inference: Confidence intervals and hypothesis tests rely on normal approximations.
Quality control: Monitoring product variation via sample averages.
Economics & finance: Portfolio returns modeled via CLT for large $n$.

Implementation with Python Simulation

alt text

1. Simulating Sampling Distributions

🔢 Objective: Simulate three types of populations to explore how their shapes influence the behavior of sample means in the Central Limit Theorem (CLT).

alt text

Python Code: CLT in Quality Control

alt text Diagram Explanation:

The histogram shows the distribution of the means of 1000 random samples, each with 30 product weights.
The red dashed line represents the average of these sample means (≈ population mean).
The bell-shaped curve demonstrates that sample means approximate a normal distribution, validating the Central Limit Theorem.
Explanation: With increasing samples, discrete distributions yield smooth, approximately normal sample mean distributions.

Conclusion

The CLT holds for all tested distributions regardless of their shape.
As sample size $n$ increases, the sample mean distribution converges to a normal distribution with mean $\mu$ and variance $\sigma^2/n$.
Practical experiments confirm theoretical expectations, reinforcing the foundational importance of the CLT in statistical analysis.