TN Online TestSamacheer Kalvi · 1–12

Probability Distributions — Study Notes

Probability Distributions — Study Notes

This chapter is about turning the random outcomes of an experiment into numbers and describing how those numbers are spread out. The two core skills are (a) moving between a probability function and its cumulative form, and (b) computing the mean and variance that summarise a distribution.

1. Random variables: discrete vs. continuous

A random variable \(X\) is a rule that attaches a real number to every outcome of a random experiment. We write capital letters \(X, Y, Z\) for the variable and small letters \(x, y, z\) for the values it can take.

2. Discrete variables: pmf and cdf

The probability mass function (pmf) lists the probability of each value: \(f(x_k)=P(X=x_k)\). A valid pmf must satisfy

\[ f(x_k)\ge 0 \quad\text{for every } k, \qquad \sum_k f(x_k)=1. \]

The cumulative distribution function (cdf) accumulates probability up to a point:

\[ F(x)=P(X\le x)=\sum_{x_k\le x} f(x_k). \]

For a discrete variable the cdf is a step function: flat between successive values, with a jump at each \(x_k\). The size of the jump at \(x_k\) is exactly the probability there, so the pmf can be recovered from the cdf by

\[ f(x_i)=F(x_i)-F(x_{i-1}). \]

3. Continuous variables: pdf and cdf

For a continuous variable the pmf is replaced by a probability density function (pdf) \(f(x)\), which must satisfy

\[ f(x)\ge 0 \quad\text{for all } x, \qquad \int_{-\infty}^{\infty} f(x)\,dx = 1. \]

Probability is now the area under the curve: \(P(a\le X\le b)=\displaystyle\int_a^b f(x)\,dx\). The cdf is the running integral of the density,

\[ F(x)=\int_{-\infty}^{x} f(u)\,du, \]

and, conversely, the density is the derivative of the cdf wherever that derivative exists:

\[ f(x)=\dfrac{dF(x)}{dx}=F'(x). \]

Because single points have zero probability, the symbols \(\lt\) and \(\le\) may be interchanged freely for a continuous variable.

4. Mean and variance

The mean (expected value) locates the centre of a distribution:

\[ \mu=E(X)=\sum_k x_k\,f(x_k)\ \ \text{(discrete)}, \qquad \mu=E(X)=\int_{-\infty}^{\infty} x\,f(x)\,dx\ \ \text{(continuous)}. \]

More generally \(E\big(g(X)\big)=\sum_k g(x_k)f(x_k)\) or \(\displaystyle\int g(x)f(x)\,dx\). The variance measures the spread about the mean:

\[ \operatorname{Var}(X)=E\big[(X-\mu)^2\big]=E(X^2)-\big(E(X)\big)^2, \qquad \sigma=\sqrt{\operatorname{Var}(X)}. \]

The shortcut \(E(X^2)-\mu^2\) is almost always faster than working from the definition. Two linear rules are used constantly:

\[ E(aX+b)=aE(X)+b, \qquad \operatorname{Var}(aX+b)=a^2\operatorname{Var}(X). \]

Adding a constant shifts the mean but leaves the variance unchanged, while a multiplier \(a\) scales the variance by \(a^2\) (not by \(a\)).

5. Special discrete distributions

\[ P(X=x)=\binom{n}{x}p^x q^{\,n-x}, \quad x=0,1,2,\dots,n, \qquad \mu=np,\quad \sigma^2=npq. \]

The three conditions for a binomial model are: a fixed number \(n\) of independent trials, exactly two outcomes per trial, and a constant success probability \(p\).

6. Common exam traps

Solved MCQs → Practice test →

More for this chapter

Solved MCQsAnswers + explanations
Practice TestInteractive · instant score
Formula SheetAll key formulas
Book Back AnswersQuick answer key