Expert reviewed • 08 January 2025 • 13 minute read
Quartiles in the HSC mathematics advanced course are values that divide a data set into four equal parts, each representing a quarter (25%) of the data points of a sample. They are key measures in statistics that provide insight into the distribution and spread of the data. The quartiles of a dataset are defined below:
When determining the quartiles of a dataset, we must first establish if the dataset has an even or odd number of values. If there is an even number or scores, we must follow the listed steps below:
To find the quartiles of a data set with an odd number of scores, we must adhere to the following steps:
Determine the quartiles of the following dataset.
As we can see from the data set, the middle score is simply as there is an odd number of scores. By omitting this value, we are given two lists on either side of the median.
To find we must find the median of the list . From this, we see that the value is .
To find we must find the median of the list . From this, we see that the value is .
The interquartile range or measures the spread of the middle 50% of the data. Determining the is useful, as it is less affected by outliers than the range. It is calculated as the difference between the third and first quartiles.
Outliers are data points that significantly differ from other observations in a dataset. They are unusual values that can be much higher or lower than the rest of the data. We can determine outliers using the quartiles. A score of a dataset is considered an outlier when the following occurs:
Determine any outliers contained in the following data set. The quartiles of the data set are as follows: , ,
Before we find an outliers, we must first determine the interquartile range.
To find any outliers in the lower part of the data set, we must use the formula given above to determine the lower bound.
As there are no values less than in the dataset, there are no outliers in the bottom half of the dataset.
To find any outliers in the part of the data set with larger values, we must use the formula given above to determine the upper bound.
As we can see in the dataset, there are two scores larger than a value of . Thus, the scores of and are outliers.
A box and whisker plot is a graphical representation used to display the distribution of a data set. It provides a visual summary of the quartiles of a dataset, highlighting the median, spread, and potential outliers. A box and whisker plot looks like the following:
As seen in the graph, the box represents the scores that lie within the interquartile range. The lines within the box represent the quartiles of the data. The left edge of the box is the first quartile . The middle line of the box is the second quartile . The right edge of the box is the third quartile . The whiskers (the lines leading to the dots) indicate the range in which points of the data set are not considered outliers. The lower whisker extends to the smallest data point within 1.5 times the interquartile range below the first quartile. The upper whisker extends to the largest data point within 1.5 times the interquartile range above the third quartile. Any points located outside the whiskers are generally considered outliers.