A dataset follows a perfect bell-shaped distribution. Approximately what proportion of observations falls between the mean minus 3 standard deviations and the mean plus 3 standard deviations?
a) 67.8%
b) 94.8%
c) 99.7%
d) 97.9%
e) 91.5%
Answer: c) 99.7%
Detailed explanation
This is based on the empirical rule for a normal distribution, also called the Gaussian distribution. The normal distribution is symmetric and centered around the mean. In such a distribution:
- About 68.3% of values lie within 1 standard deviation of the mean
- About 95.4% of values lie within 2 standard deviations of the mean
- About 99.7% of values lie within 3 standard deviations of the mean
So when the question asks for the percentage within 3 standard deviations, the answer is 99.7%.
This rule is extremely important in medicine, biostatistics, and exam questions because it helps you quickly estimate how data are spread around the mean. For example, if a lab value is more than 2 or 3 standard deviations away from the mean, it may be considered unusual or statistically rare.
Why the other values are wrong:
- 68.3% corresponds to within 1 SD, not 3 SD
- 95.4% corresponds to within 2 SD, not 3 SD
- Numbers like 97.9% or 91.5% are distractors and are not standard empirical-rule values for the normal distribution
Also remember:
- Standard deviation tells you how spread out the data are around the mean
- Variance = SD²
- SD = square root of variance
Another high-yield point is that 95% of values lie within about 1.96 SD of the mean, which is why mean ± 1.96 SD is linked to the 95% interval in many statistical contexts.
Cheat sheet for exam
Normal distribution / bell curve
- Symmetrical distribution
- Mean = median = mode
- Central values are most common
- Extreme values become progressively rarer
Empirical rule
- 1 SD → 68.3%
- 2 SD → 95.4%
- 3 SD → 99.7%
Ultra-high-yield memory trick
- Think: 68 → 95 → 99.7
- This is the classic normal distribution rule
Standard deviation
- Measures spread around the mean
- Small SD = values tightly clustered
- Large SD = values widely scattered
- SD = √variance
Exam pearl
- 95% of values ≈ mean ± 1.96 SD
- This is often linked with the 95% confidence interval
Clinical use
- Interpreting lab ranges
- Understanding outliers
- Assessing variation in biological data
- Basis of many statistical tests and confidence intervals
Flash cards
Q: What percentage of values in a normal distribution lies within 1 standard deviation of the mean?
A: 68.3%
Explanation: This is the first part of the empirical rule and represents the central bulk of the distribution.
Q: What percentage of values in a normal distribution lies within 2 standard deviations of the mean?
A: 95.4%
Explanation: This covers most observations and is a very commonly tested number.
Q: What percentage of values in a normal distribution lies within 3 standard deviations of the mean?
A: 99.7%
Explanation: Nearly all values lie within this range in a bell-shaped distribution.
Q: In a normal distribution, what is the relationship among mean, median, and mode?
A: They are equal
Explanation: Because the distribution is perfectly symmetric, all three central tendency measures coincide.
Q: What does standard deviation measure?
A: The spread or dispersion of values around the mean
Explanation: It tells you how tightly or loosely the data cluster around the center.
Q: How is standard deviation related to variance?
A: Standard deviation is the square root of variance
Explanation: Variance is the average squared deviation from the mean, and SD brings it back to the original units.
Q: What value of SD on either side of the mean contains about 95% of values?
A: About 1.96 SD
Explanation: This is the basis for the familiar 95% interval in statistics.
Q: Why is the normal distribution important in medicine?
A: Many biological variables approximately follow it
Explanation: This makes it essential for interpreting measurements, ranges, and statistical analyses.
MCQs to cover the most important points
MCQ 1
In a truly normal distribution, the proportion of observations located between the mean minus 2 SD and mean plus 2 SD is closest to:
a) 68.3%
b) 95.4%
c) 99.7%
d) 90.0%
Answer: b) 95.4%
Explanation: The empirical rule states that about 68.3% of observations lie within 1 SD, 95.4% within 2 SD, and 99.7% within 3 SD. This is a classic statistics fact and frequently tested.
MCQ 2
Which of the following best describes the standard deviation?
a) The average of all data points
b) The most frequent value in the dataset
c) The square of the mean
d) A measure of dispersion around the mean
Answer: d) A measure of dispersion around the mean
Explanation: Standard deviation quantifies the spread of data around the mean. It does not tell you the average itself, nor the most frequent value. It reflects variability, which is central to interpreting data distributions.
MCQ 3
Which of the following is false regarding a normal distribution?
a) It is symmetric
b) Mean, median, and mode are equal
c) About 68.3% of values lie within 1 SD of the mean
d) About 99.7% of values lie within 2 SD of the mean
Answer: d) About 99.7% of values lie within 2 SD of the mean
Explanation: This is false because 99.7% corresponds to 3 SD, not 2 SD. Within 2 SD, the correct value is about 95.4%. The other statements are true properties of the normal distribution.
MCQ 4
If the variance of a dataset is 49, the standard deviation is:
a) 7
b) 14
c) 24.5
d) 49
Answer: a) 7
Explanation: Standard deviation is the square root of variance. Since √49 = 7, the SD is 7. This is a direct formula-based question and often appears in simple biostatistics items.
MCQ 5
A clinician says that a value is very unusual because it lies more than 3 SD away from the mean in a normally distributed variable. This is because:
a) Fewer than 1% of values lie beyond 3 SD on either side combined
b) Exactly 5% of values lie beyond 3 SD
c) Half of all values lie beyond 3 SD
d) Values beyond 3 SD are the most common observations
Answer: a) Fewer than 1% of values lie beyond 3 SD on either side combined
Explanation: Since 99.7% of values lie within 3 SD, only about 0.3% lie outside that range in total. Therefore, a value beyond 3 SD is statistically rare and often considered an outlier.
MCQ 6
Which one of the following percentages is correctly matched with the number of standard deviations from the mean in a normal distribution?
a) 1 SD → 95.4%
b) 2 SD → 68.3%
c) 3 SD → 99.7%
d) 3 SD → 95%
Answer: c) 3 SD → 99.7%
Explanation: The correct empirical sequence is 68.3%, 95.4%, 99.7% for 1, 2, and 3 SD respectively. The incorrect options are deliberately reversed or distorted.
MCQ 7
Which of the following is false?
a) The normal distribution is also called the Gaussian distribution
b) The bell curve is asymmetric
c) Standard deviation is expressed in the same units as the original variable
d) Variance is the square of the standard deviation
Answer: b) The bell curve is asymmetric
Explanation: This is false because the normal distribution is symmetric. That symmetry is why mean, median, and mode coincide. Standard deviation indeed remains in the original units, and variance is SD squared.
MCQ 8
The statement “95% of observations lie within mean ± 1.96 SD” is most closely related to:
a) The full definition of variance
b) The 95% interval under a normal distribution
c) The calculation of the mode
d) A skewed distribution only
Answer: b) The 95% interval under a normal distribution
Explanation: The value 1.96 is the key z-score that captures the central 95% of a normal distribution. This is foundational in confidence intervals and hypothesis testing.
MCQ 9
In a perfectly normal distribution, which measure of central tendency differs from the others?
a) Mean
b) Median
c) Mode
d) None of them differ
Answer: d) None of them differ
Explanation: In a normal distribution, the central tendency measures are identical because of perfect symmetry. This is an essential structural property of the Gaussian curve.
MCQ 10
A laboratory value lies 1 standard deviation above the mean in a normal distribution. Which statement is most accurate?
a) It is extremely rare
b) It lies within the central majority of observations
c) It is definitely abnormal
d) It lies outside the 95% range
Answer: b) It lies within the central majority of observations
Explanation: About 68.3% of values lie within 1 SD of the mean, so being 1 SD above the mean is very common and not inherently abnormal. Clinical abnormality depends on context, not merely statistical distance.
MCQ 11
If data are tightly clustered around the mean, the standard deviation will be:
a) Large
b) Negative
c) Small
d) Equal to the mean
Answer: c) Small
Explanation: A small SD means the values do not deviate much from the mean. SD can never be negative, and it does not have to equal the mean.
MCQ 12
Which one of the following sequences is correct for a normal distribution?
a) 68.3%, 95.4%, 99.7% for 1 SD, 2 SD, 3 SD
b) 95.4%, 68.3%, 99.7% for 1 SD, 2 SD, 3 SD
c) 99.7%, 95.4%, 68.3% for 1 SD, 2 SD, 3 SD
d) 50%, 75%, 100% for 1 SD, 2 SD, 3 SD
Answer: a) 68.3%, 95.4%, 99.7% for 1 SD, 2 SD, 3 SD
Explanation: This is the exact empirical rule sequence and must be memorized cold for exams. It is one of the most frequently tested statistical facts in medicine.
Summary for quick exam revision
The normal distribution, also called the Gaussian or bell-shaped distribution, is a symmetric distribution in which the mean, median, and mode are equal. Its most important exam rule is the empirical rule: about 68.3% of values lie within 1 standard deviation of the mean, 95.4% lie within 2 standard deviations, and 99.7% lie within 3 standard deviations. Therefore, when asked what proportion lies within 3 standard deviations, the answer is 99.7%. Standard deviation is a measure of spread around the mean and is calculated as the square root of the variance. A small standard deviation means values are tightly clustered, while a large one means the data are more spread out. In medicine, this helps interpret biological variation, laboratory distributions, and statistical abnormality. Values lying more than 2 or 3 standard deviations from the mean are progressively less common. Another key fact is that about 95% of values lie within mean plus or minus 1.96 standard deviations, which is closely tied to the 95% confidence interval. Distractor answers often use 68.3% and 95.4%, so you must match them correctly to 1 SD and 2 SD rather than 3 SD. For rapid recall, memorize the sequence: 68, 95, 99.7.