When you hear the word spread, you probably think of food, where you try to spread something like jam across each inch of your bread. This could be of great importance to you, some may prefer a thin, and some may prefer a thick. This is also important to researchers. Researchers use the measures of dispersion to identify the spread of their data and if it meets psychology's preference. You'll understand this better as you read on.
Explore our app and discover over 50 million learning materials for free.
Lerne mit deinen Freunden und bleibe auf dem richtigen Kurs mit deinen persönlichen Lernstatistiken
Jetzt kostenlos anmeldenNie wieder prokastinieren mit unseren Lernerinnerungen.
Jetzt kostenlos anmeldenWhen you hear the word spread, you probably think of food, where you try to spread something like jam across each inch of your bread. This could be of great importance to you, some may prefer a thin, and some may prefer a thick. This is also important to researchers. Researchers use the measures of dispersion to identify the spread of their data and if it meets psychology's preference. You'll understand this better as you read on.
The measure of dispersion is the measure of the spread of scores in a data set. It is the extent to which the values vary around the central or average value. Now let's take a look at an example.
Imagine that you are a first-year university student, and a friend asks you about the ages of people in your psychology course. You'll say: 'Well, most people are 18, a few in their 20s and two or three over 40.'
In this example, the dispersion of the age groups of people in the course is described, as the scenario described the variations/ dispersion of age groups. I.e. how much they varied from the average age of 18 (a few in their 20s, two or three over 40.)
The values in a low dispersion data set do not have much variation, e.g., 20, 22, 23, 24, 25, 27, 28. In a high dispersion data set, there is a lot of variation, e.g., 9, 10, 14, 26, 35, 37, 39. Researchers aim to gather data that has low rather than high variation.
Measures of dispersion are necessary because if we don't know the dispersion, a mean value can be misleading.
Suppose there are two companies, and analysts compared their employees' wages. The average wage might be the same, but the variation or dispersion of the wages might be very different.
In Company A, all workers get a similar amount of wages. However, Company B has a large variation between the lowest-paid and the highest-paid employees.
Additionally, from the measures of dispersion, it is easier to understand if there are many outliers. If multiple figures in a dataset are largely varied from the average, then in some cases, this can be an issue. In the instance of research testing the effectiveness of interventions, if there is a lot of variation in participants' results, it suggests the intervention may not be effective.
The example highlights the practicality of the measures of dispersion and how it can help a researcher understand more about their findings.
The range is the easiest way to calculate dispersion. The range is calculated by subtracting the lowest number from the highest number in a data set.
If the highest value in a dataset is 50 and the lowest value is 12. Then the range would be calculated by completing the calculation 50 - 12. Therefore, it is 38.
The advantage of calculating the range is that the calculation accounts for extreme outliers, and is extremely easy to calculate.
However, it does come with disadvantages, such as the inclusion of extreme scores can cause researchers to establish a distorted measure of dispersion. Additionally, the range does not tell us much information about the dispersion of values between the highest and lowest scores.
The standard deviation (SD) is normally used when the mean is the measure of central tendency. The SD is a measure that calculates the distance of the individual scores from the mean of the dataset.
Large SD: the scores are widely spread above and below the mean. It indicates the mean is not representative of the data set.
Small SD: the mean is a good representation of the scores in the data set.
Normally, statistics programs can calculate the SD, but it is good to see the maths and understand how the SD is calculated; this is the formula for calculating SD:
SD = standard deviation
∑ = sum of
X = each value in the data set
x̅ = the mean
n = number of values in the sample
Let's take a look and simplify how the standard deviation can be calculated.
Find the mean of the data set (x̅).
Subtract the mean from each value in the data set; this is the deviation from the mean (x - x̅).
Square each deviation.
Find the sum of the squared deviations (∑).
Divide this number by n-1 (the total number of values in the data set minus 1).
Find the square root of this number.
Let us try this with a data set. Suppose we have a data set of 48, 71, 34, 62, 54, and 43.
Find the mean: x̅ = (48 + 71 + 34 + 62 + 54 + 43) ÷ 6 = 52
Subtract the mean from each value in the data set:
47-52 = -5
70-52 = 18
33-52 = -19
61-52 = 9
53-52 = 1
42-52 = -10
Square each deviation: (-5) ² = 25, 18² = 324, (-19) ² = 361, 9² = 81, 1² = 1, (-10) ² = 100
Find the sum of the squared deviations: 25 + 324 + 361 + 81 + 1 + 100 = 892
Divide this number by n-1: 892 / 6-1 = 892/5 = 178.4
Find the square root of this number: √178.4 = 13.36
Thus the SD is 13.36.
For A-Level psychology, you won't be asked to calculate the SD. However, you might be asked to interpret and explain the SD for a data set.
The advantages of calculating the standard deviation are that the SD can be used to make estimations regarding the population. And the SD is the most sensitive measure of dispersion as all values in the data set are considered. Therefore, the researcher can get a more accurate representation of the dataset's measure of dispersion compared to the range.
However, the SD value can be easily distorted by extreme outliers, and when calculated manually, it is not always easy, especially on a large dataset.
We have mentioned the mean, but what happens when we can't measure the mean? Research that collects ordinal data usually uses the median to calculate a dataset's centre point/ average.
First, let's recap on what ordinal data is.
Ordinal data is categorical, meaning there is an order, but we don't know the exact distance/ differences between each category.
Let's consider socioeconomic status to help you understand the concept of ordinal data. An example of ordinal data is a questionnaire asking whether respondents are from the working, middle or upper class. We know that a person from the upper class has higher status and money than the working class, but we can't tell by how much.
The mean can only be established in interval and ratio data as we can identify the numerical differences between responses. Therefore, either the range or standard deviation can be calculated.
However, the mean cannot be established from ordinal data. Therefore, the range is usually used to calculate the measure of dispersion in the dataset.
Measures of central tendency and dispersion both tell us vital information about a data set. Central tendency is a data set's average or central value, and dispersion is how the values in a data set vary around the average (central tendency) value.
Three measures of dispersion examples include the range, interquartile range and standard deviation.
The range is calculated by subtracting the lowest number from the highest number in a data set.
And the standard deviation is calculated using the formula:
The standard deviation is the most sensitive measure of dispersion as all values in the data set are taken into account. Unlike the range which only considers the largest and smallest value.
The measure of dispersion is the measure of the spread of scores in a data set. It is the extent to which the values vary around the central or average value.
To understand how dispersed data is, we should note in a low dispersion data set, and the values do not have much variation. In a high dispersion data set, there is a lot of variation. The dispersion of data can be calculated using the range and standard deviation.
What are measures of dispersion?
The measure of dispersion is the measure of the spread of scores in a data set. It is the extent to which the values vary around the central or average value.
Why are measures of dispersion important?
If we don’t know the dispersion, a mean value can be misleading. E.g., two datasets have the same mean, but there is a large difference in the datasets' variation of values.
How do you calculate the range?
The range is the difference between the highest and lowest values of a data set. For example, if the highest value is 50, and the lowest value is 12, the range would be 50-12 = 38.
What are the advantages of using the range?
We are able to include extreme values (outliers) when calculating the range.
It is easy to calculate
What are the disadvantages of using the range?
As extreme scores are included, the range could be distorted.
The range does not tell us much information about the dispersion of values between the top and bottom scores.
It does not give information about whether the values are close to the mean or more spaced out.
What is the standard deviation a measure of?
The standard deviation is a measure of the mean distance of scores in a data set from the mean.
Already have an account? Log in
Open in AppThe first learning app that truly has everything you need to ace your exams in one place
Sign up to highlight and take notes. It’s 100% free.
Save explanations to your personalised space and access them anytime, anywhere!
Sign up with Email Sign up with AppleBy signing up, you agree to the Terms and Conditions and the Privacy Policy of StudySmarter.
Already have an account? Log in
Already have an account? Log in
The first learning app that truly has everything you need to ace your exams in one place
Already have an account? Log in