7 Continuous Probability Distributions

7.1 Continuous Probability Distributions and Their Properties

“Statistics is the grammar of science.” – Karl Pearson

Guiding question: How do probability distributions work for continuous random variables?

So far, we has focused on discrete outcomes: counts of patients, number of mutated alleles and so on. In those settings we could list the possible values, assign a probability to each one, and check that the probabilities summed to one. Many measurements in medicine and biology, however, can take any value within a range rather than a handful of distinct values. A person’s height could be 170.23 cm or 170.231 cm; the concentration of a hormone in blood plasma might be 2.7 or 2.701 ng/mL. When a random quantity can assume infinitely many values on an interval we call it a continuous random variable.

Because there are infinitely many possible values, we cannot find the probability that a continuous random variable takes any exact value. Instead of assigning probabilities to single points, we assign probabilities to intervals: the chance that a drug’s plasma concentration is between 2.5 and 3.5 ng/mL, for example.

Probability Distribution for Continuous Random Variables

Recall in Chapter 3, we discussed histograms in which the width of the bars is some interval of values and the height is either the frequency or relative frequency of the observations that fall in that interval. We could examine the relative frequency of the data that fall between any two value by adding the relative frequencies of the bars in that interval. For example, suppose we are looking for the relative frequency of the data shaded in the histogram below.

Thus, the area of these bars is the relative frequency in the interval of interest. In Chapter 5, we stated that we are using the relative frequency interpretation of probability. Therefore, the area shaded in the histogram will estimate the probability of the random variable being in that interval.

Now, let’s think of all of the possible data in the population. In this case, we can shrink the width of the bars to however small we wish. As we let the bar widths shrink to zero, then we end up with a smooth curve like below.

The smooth curve (probability distribution of a continuous random variable) is denoted by the symbol \(f(x)\) and is often called the probability density function (pdf). We can still view the area of the shaded region as the probability.

Because the area under the pdf represents probability, then by definition we have \[ P(X=x)=0 \] In other words, we assign a probability of zero at a point. This happens since there is no area under the curve at a point.

We do find the probability of a continuous random variable in an interval: \[ P(a< X< b) \] How do we do this? We find the area under the curve between \(a\) and \(b\). We find the area under the curve by taking the integral \[ P(a < X < b) = \int_a^b f(x) \; dx. \]

The cumulative distribution function (cdf) of \(X\), denoted \(F(x)\), gives the probability that \(X\) is less than or equal to \(x\); it is the area under the density to the left of \(x\).

In summary:
The probability distribution of a continuous random variable \(X\) * is represented by a smooth curve * the curve is called the probability density function (pdf) * the probability \(P(a<X<b)=P(a\le X\le b)\) is the area under the curve between \(a\) and \(b\) * the cumulative distribution function (cdf) give the area to the left of some value: \(F(x)=P(X\le x)\)

Examples from biology and medicine

Drug metabolism. After an oral dose, the amount of a drug in the bloodstream rises and then falls over time. If we pick a random patient and record their peak plasma concentration, that value could be any number within a physiological range. The probability that the peak is exactly 3.000 µg/mL is zero; but we can talk meaningfully about the probability it lies between 2.8 and 3.2 µg/mL, which is the area under the density between those points.
Plant heights. The height of a genetically identical group of plants grown under controlled conditions will vary due to micro‑environmental factors. Those heights are modeled as a continuous random variable. We might ask, for example, how likely it is for a plant to be taller than 15 cm; again, we look at the area under the density to the right of 15.

When interpreting a pdf, remember that taller regions of the curve correspond to higher likelihood density, not to the probability of a specific value. Probability comes from the area, not the height at a point.

Mean, Variance, and Standard Deviation of a Continuous Random Variable

Recall that the mean of a discrete random variable is \[ \mu = \sum_x xP(X=x) \]

and the variance is \[ \sigma^2= \sum_x (x-\mu)^2P(X=x) \]

When working with continuous RVs, using the sum in the formulas above will not make any sense (since there are infinite number of value to sum over in any given interval).

Instead, we will use integration. The expected value for a continuous RV is \[ { \mu = \int_{-\infty}^{\infty}x f(x) dx } \]

and the variance is \[ { \sigma^2= \int_{-\infty}^{\infty} (x-\mu)^2f(x) dx } \]

Working in JMP Pro 17

JMP can help you explore continuous distributions experimentally. Here is a general workflow using an exponential example, but you can adapt it to other distributions:

Simulate continuous data. Create a new data table and use Rows → Add Rows to add, say, 1 000 rows. Add a new column and choose Column → Formula. In the formula editor search for Random Exponential(rate) and specify a rate (e.g., 1/10). Each cell will then contain a simulated lifetime.
Visualize the distribution. Use Analyze → Distribution and select your simulated column. JMP produces a histogram and summary statistics. You can overlay a smooth density by clicking the red triangle ▸ next to the variable name and choosing Continuous Fit → Exponential.
Compute probabilities. JMP’s distribution calculator (found under Add‑ins → Calculators → Distribution Calculator in JMP Pro 17) lets you choose a distribution, enter parameter values, and compute the probability that a continuous random variable lies between two values. For the exponential example, choose Exponential, set the rate, and enter the lower and upper bounds to find \(P(a ≤ X ≤ b)\).

Recap

Keyword	Definition
continuous random variable	A random variable that can take any value in an interval; probabilities are assigned to ranges of values rather than individual points.
probability density function (PDF)	A non‑negative function \(f(x)\) such that \(P(a ≤ X ≤ b)\) equals the area under \(f(x)\) between \(a\) and \(b\) and the total area under the curve of \(f(x)\) is one.
cumulative distribution function (CDF)	The function \(F(x)=P(X ≤ x)\) giving the area under the PDF to the left of \(x\). It increases from 0 to 1 as \(x\) goes from \(-∞\) to \(∞\).

Check your understanding

Explain in your own words why the probability that a continuous random variable equals exactly 5 is zero. How, then, do we assign probabilities for continuous variables?
Sketch or describe the shape of a PDF that would model serum cholesterol levels in a population. Why can’t a PDF ever dip below the horizontal axis?

Solutions

A continuous random variable can take infinitely many values within any interval. Because the PDF spreads probability continuously across these values, the probability of landing on any single point is zero. We obtain meaningful probabilities by integrating the density over an interval to find the area under the curve between the limits.
Serum cholesterol tends to cluster around an average value with fewer extremely low or high values. A plausible PDF would be unimodal and right‑skewed: low near 0, rising to a peak near the typical cholesterol level, and gradually decreasing. The density must always stay at or above zero because probabilities cannot be negative.

7.2 The Uniform Distribution

“Don’t mistake possibilities for probabilities. Anything is possible. It’s the probabilities that matter” – Ray Dalio

Guiding question: How does an equally likely continuous random variable work?

The simplest continuous distribution is the uniform distribution. Imagine selecting a time uniformly at random within a two‑hour window; any minute in that window is just as likely as any other. More formally, a continuous random variable \(X\) has a Uniform\((c,d)\) distribution if its PDF is constant on the interval \((c,d)\) and zero elsewhere.

Since all values in the interval \((c,d)\) are equally likely, the pdf of a uniform random variable appears as a horizontal line:

Note that the area under the curve must equal 1 (since the area corresponds to probability). Therefore, the area between \(c\) and \(d\) \[ \begin{align*} \text{area of rectangle} = \text{base}\times \text{height} &\Longrightarrow{ 1 = (d-c) \times \text{height}}\\\\ &\Longrightarrow \text{height} = \frac{1}{d-c} \end{align*} \]

So, the pdf of a uniform random variable is \[ f(x) = \frac{1}{d-c} \]

The expected value is \[ \begin{align*} E(X)=\int_{c}^d xf(x)dx &= \int_c^dx\left(\frac{1}{d-c}\right)dx\\ & {= \left(\frac{1}{d-c}\right)\int_c^dxdx}\\ & {= \left(\frac{1}{d-c}\right)\left(\frac{1}{2}\right)x^2\Big\vert^d_c} \\ & {= \left(\frac{1}{d-c}\right)\left(\frac{1}{2}\right)\left(d^2-c^2\right)} \\ & {= \left(\frac{1}{d-c}\right)\left(\frac{1}{2}\right)\left(d-c\right)\left(d+c\right)} \\ & {= \frac{c+d}{2}} \end{align*} \]

We will not show the steps here but we could find the variance in a similar fashion to get \[ \sigma^2 = \frac{\left(d-c\right)^2}{12} \] The standard deviation is then \[ \sigma = \frac{\left(d-c\right)}{\sqrt{12}} \]

For a uniform random variable \(X\), what is the probability \(P(a<X<b)\)? \[ \begin{align*} P(a<X<b)=\int_{a}^b f(x)dx & = \int_{a}^b \frac{1}{d-c} dx\\ & {= \left(\frac{1}{d-c}\right)x\Big\vert_a^b}\\ &{= \left(\frac{b-a}{d-c}\right)}\\ \end{align*} \]

Summary of Uniform RVs:

the pdf is \(f(x)=\frac{1}{d-c}\qquad c\le X\le d\)
the mean is \(\mu = \frac{c+d}{2}\) and the standard deviation is \(\sigma=\frac{d-c}{\sqrt{12}}\)
\(P(a<X<b)=\frac{b-a}{d-c}\)

Examples from medicine and biology

Patient arrival time. Suppose a clinic accepts blood samples from 8 am to 10 am and the phlebotomist expects donors to arrive at random. Let \(T\) be the arrival time after 8 am (in hours). If arrivals are equally likely at any moment, \(T \sim \text{Uniform}(0,2)\). The probability that a randomly arriving donor comes between 8:30 and 9:00 am (i.e., \(0.5 ≤ T ≤ 1\)) is \[ \begin{align*} P(0.5 ≤ T ≤ 1)=&\frac{1-0.5}{2-0}\\ =&0.25 \end{align*} \]
Randomized drug administration. In a study, participants are randomly assigned to take a dose of medication at any time between noon and 3 pm. The time of ingestion is Uniform\((0,3)\) hours after noon. If we want the probability that a dose is taken in the first half‑hour, we compute \[ \begin{align*} P(0 ≤ T ≤ 0.5)=&\frac{0.5-0}{3-0}\\ =&0.167 \end{align*} \]

Working in JMP Pro 17

Uniform simulations are straightforward in JMP:

Generate uniform random values. In a new data table, choose Rows → Add Rows to add your desired number of observations. Use Column → Formula, find the function Random Uniform, and specify the lower and upper bounds \(c\) and \(d\).
Visualize and compute probabilities. Use Analyze → Distribution to produce a histogram. Since the density is flat, the histogram should approximate a rectangle when you use many bins. To compute \(P(a ≤ X ≤ b)\) without simulation, use the distribution calculator: select Uniform from the list, enter \(c\) and \(d\), and set the lower and upper limits. JMP will report the probability \((b-a)/(d-c)\).

Recap

Keyword	Definition
uniform distribution	A continuous distribution on \((c,d)\) whose PDF is constant at height \(1/(d-c)\). All intervals of equal length within \((c,d)\) have equal probability.

Check your understanding

Suppose \(X\sim\text{Uniform}(0,10)\). What is \(P(3 ≤ X ≤ 7)\)? Explain your reasoning.
A researcher measures the pH of soil samples collected uniformly at random along a transect from 0 to 100 m. What is the probability that a randomly selected soil sample comes from between 20 m and 35 m? Express your answer numerically.
If \(Y\sim\text{Uniform}(c,d)\) and you know that \(P(Y ≤ 5) = 0.5\), what relationship does this imply between \(c\), \(d\) and 5?

Solutions

The interval from 3 to 7 has length 4. Since the distribution is Uniform\((0,10)\), the probability of any subinterval equals its length divided by the total length: \(4/10=0.4\).
The transect is 100 m long. The segment from 20 to 35 m is 15 m long, so \(P(20 ≤ X ≤ 35) = 15/100 = 0.15\).
For a uniform distribution, \(P(Y ≤ y) = (y-c)/(d-c)\) for \(c ≤ y ≤ d\). Setting \(P(Y≤5)=0.5\) implies \((5 - c)/(d - c) = 0.5\). Equivalently, \(5\) is the midpoint of the interval and \(5 = (c + d)/2\).

7.3 The Normal Distribution

“the normal distribution is seldom, if ever, observed in nature.” – Louis Guttman

Guiding question: Why is the normal distribution so commonly used?

The normal distribution (also called the Gaussian distribution) is the most celebrated continuous distribution in statistics. It is the foundation for much of statistical inference.

A \(N(\mu,\sigma)\) distribution is symmetric, bell‑shaped and centered at its mean \(\mu\). In a normal distribution, the mean, median and mode all coincide, and the curve extends infinitely in both directions. The two parameters \(\mu\) and \(\sigma\) control the center and spread of the distribution:

\(\mu\) is the location of the peak and
\(\sigma\) measures the standard deviation (the “width” of the bell).

The density function of a \(N(\mu,\sigma)\) random variable can be written as

\[ f(x) = \frac{1}{\sigma\sqrt{2\pi}} \exp\left( - \frac{1}{2\sigma^2}(x - \mu)^2 \right). \]

You do not need to memorize this formula to use the normal model. More important are its qualitative properties: it is unimodal, symmetric, and tails off smoothly; the total area under the curve is one.

Examples from medicine and biology

Adult heights. Within a homogeneous population, adult heights tend to cluster around an average and taper off symmetrically. For example, men’s heights might follow a \(N(175,7)\). Values far below 175 cm or far above are increasingly rare.
Gene expression levels. In microarray experiments, the log‑transformed expression levels of many genes approximate normality. This allows researchers to use statistical tests that assume normally distributed data.

Working in JMP Pro 17

To explore normal distributions in JMP:

Simulate normal data. Create a new column with Column → Formula and use Random Normal(µ, σ) to generate values. For example, Random Normal(175,7) will simulate heights in centimetres with mean 175 and standard deviation 7.
Visualise the distribution. Use Analyze → Distribution to generate a histogram and overlay a fitted normal curve. Click the red triangle ▸ next to the variable and choose Continuous Fit → Normal. JMP displays parameter estimates and a density overlay.

Recap

Keyword	Definition
normal distribution	A continuous, symmetric bell‑shaped distribution defined by its mean \(\mu\) and standard deviation \(\sigma\); mean = median = mode.
standard normal distribution	The special case Normal\((0,1)\); its values are often called z‑scores, and any normal distribution can be standardized via \(z = (x - \mu)/\sigma\).

Check your understanding

What does it mean that the mean, median and mode of a normal distribution are equal? How is this reflected in the shape of the curve?
For a \(N(150,20)\) distribution (representing, say, birth weights in grams), approximately what percentage of babies weigh between 110 g and 190 g?
Explain why extreme values (more than 3 standard deviations from the mean) are considered unusual under the normal model.

Solutions

A normal curve is perfectly symmetric about its mean; the highest point occurs at \(\mu\) and the curve declines equally on both sides. Because of this symmetry, the most typical value (the mode), the point dividing the distribution in half (the median) and the arithmetic average (the mean) coincide.
Two standard deviations on either side of the mean cover about 95% of the data. The interval from \(\mu-2\sigma = 150 - 40 = 110\) to \(\mu+2\sigma = 190\) therefore captures roughly 95% of birth weights.
Under the normal model, only about 0.3% of observations lie beyond three standard deviations from the mean by the empirical rule. Thus values outside that range are rare and often signal measurement error or a departure from normality.

7.4 Finding Probability for a Normal Distribution

“I once worked with a guy for three years and never learned his name. Best friend I ever had. We still never talk sometimes.” -Ron Swanson

Guiding question: How can we find probabilities using the normal model?

Once we determine the normal model for a variable, we can compute probabilities by standardizing the variable to a z‑score. Given \(X\sim N(\mu,\sigma)\), the z‑score corresponding to a value \(x\) is

\[ z = \frac{x - \mu}{\sigma}. \]

This transformation rescales and recenters \(X\) so that \(Z \sim N(0,1)\). We then look up the area under the standard normal curve up to \(z\) (or beyond) using tables, software or JMP. Because the normal distribution is continuous, we always compute probabilities for intervals, not exact points.

Step‑by‑step procedure

State the distribution. Identify \(\mu\) and \(\sigma\) for your normal variable.
Draw a sketch. Label the mean and the point(s) of interest on a bell curve. Shading the region corresponding to the probability helps visualize whether you need the area to the left, right or between two points.
Compute z‑scores. For each boundary \(x\) compute \(z=(x-\mu)/\sigma\).
Use a table or software. For the standard normal distribution, tables give \(P(Z ≤ z)\) for many \(z\) values. For probabilities of the form \(P(X ≥ x)\) or \(P(a ≤ X ≤ b)\), convert to z‑scores and use the fact that \(P(Z > z) = 1 - P(Z ≤ z)\) and \(P(a ≤ X ≤ b) = P(z_a ≤ Z ≤ z_b)\).
Interpret in context. State your answer in terms of the original problem.

Example: antihypertensive drug

Suppose the reduction in systolic blood pressure (SBP) after taking a new antihypertensive follows a \(N(10,4)\) distribution. What is the probability that a randomly treated patient experiences a reduction of at least 15 mm Hg?

Let \(X\) be the reduction in SBP. We want \(P(X ≥ 15)\).

Compute the z‑score.

\[ z=\frac{15-10}{4}=1.25 \]

Find the area to the right. Using the Normal Probability Calculator in JMP 18 Student Edition (Student → Applets → Distribution Calculator):

From this calculator we have: \[ P(Z ≥ 1.25)=0.1056 \]

Using the Normal Calculator also allows us to find the probability without taking a z-score first. We just need to change the values of Mean and Std. Dev. in the applet.

Interpretation. About 10.6% of patients have a reduction of at least 15 mm Hg.

Working in JMP Pro 17

JMP’s distribution calculator makes these computations straightforward:

Open the distribution calculator. In JMP Pro 17 go to Add‑ins → Calculators → Distribution Calculator. Choose Normal from the list of distributions.
Enter parameters. Enter the mean and standard deviation (e.g., 10 and 4) and specify whether you want the area Left, Right or Between two values. For a “greater than” probability like \(P(X ≥ 15)\), choose Right and enter 15. JMP will display the area to the right.
Visual check. The calculator shows a graph of the normal curve with the relevant region shaded. Use this to verify that you selected the correct tail or interval.

Recap

Keyword	Definition
z‑score	The number of standard deviations a value \(x\) is from the mean: \(z=(x-\mu)/\sigma\). Converting to z‑scores allows probabilities from any normal distribution to be found using the standard normal distribution.

Check your understanding

Cholesterol reductions after a dietary intervention follow a \(N(20,5)\) distribution. What is the probability that a randomly selected participant’s reduction is less than 12 mg/dL? Show the z‑score and compute the probability.
Serum calcium levels in a normal population have mean 9.5 mg/dL and standard deviation 0.4 mg/dL. What proportion of individuals have calcium levels between 9.1 and 9.9 mg/dL? Sketch the problem and find the probability.

Solutions

Compute \(z=(12-20)/5=-1.6\). From a standard normal table, \(P(Z ≤ -1.6)\approx0.0548\). Therefore \(P(X ≤ 12)=0.0548\).
First convert the endpoints to z‑scores: \(z_1=(9.1-9.5)/0.4=-1.0\) and \(z_2=(9.9-9.5)/0.4=1.0\). Using the calculator, we have \(P(-1\le Z \le 1)=0.6827\)

7.5 Finding a Quantile for a Normal Distribution

“Two things are infinite: the universe and human stupidity; and I’m not sure about the universe.” – Albert Einstein

Guiding question: How can we find quantiles using the normal model?

In many applications we know a desired probability and wish to find the corresponding value of \(x\) such that \(P(X ≤ x)=p\). This value is called a quantile or percentile of the distribution. Mathematically, the quantile function \(F^{-1}(p)\) is the inverse of the cumulative distribution function. For a normal distribution, the quantile function returns the \(x\) value whose cumulative probability is \(p\).

Relationship between the CDF and quantiles

The cumulative distribution function \(F(x)\) gives the probability that a random variable \(X\) is less than or equal to \(x\). The quantile function does the reverse: it takes a probability \(p\) and returns the threshold \(x\) such that \(P(X ≤ x)=p\). For example, the 0.5 quantile is the median. Because the normal CDF does not have a simple algebraic inverse, quantiles are typically obtained from tables or software.

At times, there are special quantiles that will show up in statistical methods that we will discuss later. We denote these as \[ p = P(Z> z_p) \] In other words, \(z_p\) is the value of the standard normal distribution that will have \(p\) area to the right. For example, \(z_{0.05}\) is the value that has 0.05 area to the right.

Here the value is \[ z_{0.05}=1.645 \]

Procedure for finding quantiles

Specify the probability \(p\). Decide whether you want a lower tail (left‑side) quantile (e.g., the 5th percentile) or a two‑sided bound, or an upper tail area.
Find the corresponding z‑score. For the standard normal distribution, tables and software can give you the quantile.
Transform back to \(x\). If \(X\sim N(\mu,\sigma)\), then \(x=\mu + z\sigma\) gives the desired quantile. Some software can find the quantile right from \(x\). In which case, there is no need to find the value for \(z\)-score first.

Example: therapeutic drug monitoring

Suppose therapeutic blood levels of a drug after dosing follow a \(N(50,10)\) distribution. Physicians want to define the upper control limit beyond which a concentration is considered dangerously high. If they choose the 97.5th percentile as the cut‑off, what value should they use?

Find the 97.5th percentile for \(z\). Using the special quantile location, this would be \(z_{0.025}\). Using the Normal Probability Calculator in JMP 18 Student Edition (Student → Applets → Distribution Calculator):
Transform back.
\[ \begin{align*} x=&\mu + z\sigma\\ =& 50 + 1.96\times10\\ =& 69.6 \end{align*} \]
Interpretation. Only 2.5% of patients are expected to have concentrations above 69.6 ng/mL. Concentrations exceeding this threshold may warrant intervention.

Working in JMP Pro 17

To find quantiles in JMP:

Use the distribution calculator. Open the distribution calculator and select Normal. Switch to the Percentile (or Inverse) mode. Enter the probability \(p\) (e.g., 0.975) and the distribution parameters \(\mu\) and \(\sigma\). JMP returns the corresponding \(x\) value.
Check with the CDF. You can verify your result by switching back to the probability mode and entering the value \(x\) you found. The calculator should return \(p\).

Recap

Keyword	Definition
quantile (percentile)	For a continuous distribution with CDF \(F\), the value \(x=F^{-1}(p)\) such that \(P(X ≤ x)=p\).

Check your understanding

Birth weights in a population follow a \(zn(3.5,0.4)\) distribution. What is the weight corresponding to the 90th percentile? Show your calculation.
An assay has measurement errors that are \(N(0,2)\) distributed. What cut‑off defines the central 80% of the error distribution (i.e., the range from the 10th to the 90th percentile)?
Explain the relationship between the CDF and the quantile function in your own words. Why do we need tables or software to find quantiles for the normal distribution?

Solutions

First find the z‑score \(z_{0.10}\approx1.2816\). Then \(x=3.5 + 1.2816\times0.4 = 4.0126\) kg. Therefore, about 10% of babies weigh more than approximately 4.0 kg.
The central 80% corresponds to the interval between the 10th and 90th percentiles. From a standard normal table \(z_{0.90}\approx-1.2816\) and \(z_{0.10}\approx1.2816\). Multiply by \(\sigma=2\) and add the mean 0: the interval is from \(-1.2816×2≈-2.5632\) to \(1.2816×2≈2.5632\).
The CDF gives the probability that a random variable is less than or equal to \(x\). The quantile function reverses this: given a probability \(p\), it returns the \(x\) for which the CDF equals \(p\). The normal CDF has no simple algebraic inverse, so we rely on tables or software to compute quantiles.