AP Syllabus focus:
‘Understanding that a statistic provides a numerical summary of sample data.
- Learning to calculate the mean (x-bar) as the sum of all data values divided by the number of values, showing the formula x−bar=n1∑i=1nxi, where xi represents the i-th data point and n represents the total number of data points.
- Defining the median as the middle value of an ordered dataset and explaining the calculation method for even and odd number of data points.
- Skill 2.C: Mastery in calculating and interpreting the mean and median for a set of quantitative data.’
Measures of center provide essential insight into where quantitative data tend to cluster. Understanding how to calculate and interpret these statistics helps reveal meaningful patterns within a distribution.
Understanding Measures of Center
A measure of center describes a typical or central value within a dataset. In AP Statistics, the two primary measures of center are the mean and the median, each offering a different perspective on the distribution. A statistic is a numerical summary of sample data, and measures of center play a critical role in interpreting the dataset’s overarching behavior.
Because center describes the “typical” value, it is fundamental for characterizing data distributions and comparing across groups. The syllabus emphasizes competence in calculating and interpreting the mean and median, ensuring students can apply these measures in varied data contexts.
The Mean
The mean is the arithmetic average of a set of quantitative data. When introduced, the term requires a precise statistical definition.
Mean: The arithmetic average found by dividing the sum of all data values by the number of observations in the dataset.
The mean uses every data value in its calculation, making it highly informative but also sensitive to extreme values. This sensitivity means the mean may shift considerably when unusually large or small data points are present. Understanding this behavior is crucial for interpreting results and selecting appropriate summary statistics.
The AP specification requires students to recognize and use the notation , pronounced “x-bar,” for the sample mean. Students must also understand the meaning of the formula components, including individual data values and the total number of observations.
EQUATION
= Sample mean
= Number of data values
= The th data value
Because the mean incorporates all values, it provides a powerful numerical summary by balancing the dataset. The balancing interpretation highlights that the mean is the point at which the dataset would balance if each value were placed on a number line as a weight. This conceptualization helps students understand the mean’s behavior and its role in comparing distributions.
The Median
The median is the middle value of an ordered dataset and represents another essential perspective on center.
Median: The middle value in an ordered dataset; if the number of observations is even, it is the average of the two middle values.
To calculate the median, students must know how to properly order the data and determine the location of the central point. This process varies depending on whether the dataset has an odd or even number of observations.

A visual comparison of how the median is identified in datasets with odd versus even numbers of observations, highlighting the different procedures for locating the central value. Source.
After ordering the data, the median’s position identifies the value that divides the dataset into two equal halves. This characteristic makes the median especially useful in distributions with strong skewness or notable outliers, where the mean may fail to represent a “typical” value appropriately.
Comparing Mean and Median
Understanding when the mean or the median provides a more meaningful description is essential for statistical reasoning. Because the mean is influenced by every value and the median is resistant to extremes, the two measures can differ significantly in certain distributions.

A skewed distribution showing how the mean lies toward the long tail, while the median remains between the mode and the mean, illustrating their differing sensitivity to extreme values. Source.
A key instructional focus is recognizing that both measures summarize center but emphasize different aspects of the distribution’s structure. Students should be able to articulate why the two measures might diverge and how each reflects the underlying data.
Interpreting Measures of Center in Context
AP Statistics places strong emphasis on contextual interpretation. Measures of center are not meaningful in isolation; they must be explained in terms of the variable and population under study. Students must learn to phrase interpretations clearly, using the context to describe what the mean or median suggests about a “typical” value in the dataset.
When interpreting:
Refer directly to the variable (e.g., “typical response time,” “typical weight”).
Use units consistently and accurately.
Understand how variability and unusual data points affect these measures.
Because center reflects the central tendency rather than an exact value that must occur in the data, students must communicate that the mean and median describe general tendencies, not specific, guaranteed outcomes.
Processes for Calculating Measures of Center
To promote accuracy and consistency, students should follow structured steps:
Process for Calculating the Mean
Identify all data values in the sample.
Compute the sum of these values.
Count the number of observations.
Divide the total sum by this count to obtain .
Interpret the result in context using appropriate units.
Process for Determining the Median
Arrange the data in ascending order.
Determine whether the number of observations is odd or even.
Locate the middle position using the appropriate method.
State the median value clearly.
Interpret the result within the context of the dataset.
FAQ
The mean incorporates every value in the data set, so any change to one observation directly alters the overall sum and therefore the calculated average.
By contrast, the median depends only on the relative ordering of values. Unless a change is large enough to shift an observation across the middle position, the median remains unchanged.
Large extreme values therefore have a proportionally greater effect on the mean than on the median.
In a skewed distribution, the mean can be pulled towards the tail, placing it away from where most observations lie.
This means the mean may not accurately reflect the central cluster of data, even though it remains mathematically valid.
Interpreting it as ‘typical’ requires awareness of skewness and any extreme values that distort its location.
Comparing these two measures helps identify the presence and direction of skewness.
A large difference suggests that the distribution is not symmetric and may contain outliers.
This comparison is helpful as a diagnostic tool before selecting the most appropriate summary statistic for centre.
Yes. When the data set contains an even number of observations, the median is the average of the two central values.
This averaged value might not match any actual observation in the data.
This is acceptable, as the median represents the midpoint of the ordered distribution rather than a value that must occur in the set.
The median is defined based on the position of values within an ordered list. Without ordering, the identified ‘middle’ value is meaningless.
Failing to sort the data leads to incorrect medians and can misrepresent the centre of the distribution.
Correct ordering is therefore a crucial procedural step in any median calculation.
Practice Questions
Question 1 (1–3 marks)
A data set contains the following five values representing the number of hours students studied for a test:
4, 7, 9, 10, 15
(a) State the median of this data set.
(b) Explain why the median is an appropriate measure of centre for this particular set of values.
Question 1
(a) Median = 9
• 1 mark for correct identification of the median.
(b) Explanation
• 1 mark for noting that the data include a large value or potential outlier (15).
• 1 mark for stating that the median is resistant to extreme values or is less affected by outliers.
Question 2 (4–6 marks)
A researcher records the daily number of customers visiting a small café over a seven-day period. The numbers of customers recorded are:
32, 28, 30, 120, 35, 29, 31
(a) Calculate the mean number of customers per day.
(b) Calculate the median number of customers per day.
(c) Comment on the most appropriate measure of centre for this data set, justifying your answer with reference to the values given.
Question 2
(a) Mean calculation
• 1 mark for correct summation (32 + 28 + 30 + 120 + 35 + 29 + 31 = 305).
• 1 mark for correct division by 7.
• 1 mark for correct final mean (approximately 43.6).
(b) Median calculation
• 1 mark for correctly ordering the data.
• 1 mark for identifying the central value (31).
(c) Interpretation
• 1 mark for recognising 120 as an outlier or unusually large value.
• 1 mark for correctly stating that the mean is affected by this value and thus inflated.
• 1 mark for concluding that the median is the more appropriate measure of centre due to its resistance to extreme values.
