WEBVTT
00:00:00.480 --> 00:00:10.560
In this video, weโre going to learn how to calculate the expected value or the mean of discrete random variables from both their table and a graph.
00:00:11.560 --> 00:00:17.120
Letโs begin though by recalling what we mean by a discrete random variable of a probability distribution.
00:00:17.640 --> 00:00:24.640
A probability distribution describes the likelihood of obtaining the possible values that a random variable can assume.
00:00:25.160 --> 00:00:30.880
This can be given as a function, a table of values, or even in graph form.
00:00:31.640 --> 00:00:38.800
Then a discrete variable is a variable which can only take a countable number of values.
00:00:39.600 --> 00:00:49.480
In this example, ๐ฅ is a discrete variable as it only takes the values one, two, three, four, five, and six.
00:00:50.200 --> 00:01:01.440
In this video, weโre interested in finding a formula that can help us to find the expected value, denoted as ๐ธ of ๐ฅ, in other words to find the mean of a discrete random variable.
00:01:02.080 --> 00:01:05.680
To help us develop this formula, weโre going to look at an example.
00:01:06.640 --> 00:01:13.480
An experiment produces the discrete random variable ๐ that has the probability distribution shown.
00:01:14.040 --> 00:01:20.200
If a very high number of trials were carried out, what would be the likely mean of all the outcomes?
00:01:21.320 --> 00:01:29.320
Letโs imagine the experiment is spinning a spinner with the numbers two, three, four, and five on it.
00:01:29.960 --> 00:01:35.440
The table tells us the probability of achieving each score on a single spin.
00:01:35.800 --> 00:01:44.000
And so we see itโs much more likely, for instance, that the spinner lands on the five than it does on the two.
00:01:44.720 --> 00:01:53.680
Letโs add another table, showing the number of times we spin the spinner versus the number of times we expect it to land on each number.
00:01:54.760 --> 00:02:05.520
Letโs say we were to spin the spinner 10 times; 0.1 of those times we would expect the spinner to land on two.
00:02:06.080 --> 00:02:13.400
Well, 0.1 of 10 โ in other words 0.1 times 10 โ is one.
00:02:14.280 --> 00:02:20.280
Then 0.3 of the times, we would expect the spinner to land on three.
00:02:20.520 --> 00:02:25.800
0.3 of 10 or 0.3 times 10 is three.
00:02:26.400 --> 00:02:32.960
0.2 of the times, we expect the spinner to land on four, so thatโs twice.
00:02:33.480 --> 00:02:40.200
And 0.4 of the times, thatโs four times, we would expect the spinner to land on five.
00:02:40.800 --> 00:02:45.600
Next, letโs think about what would happen if we were to spin it 20 times.
00:02:46.280 --> 00:02:52.480
0.1 of those times, we would expect it to land on two, so thatโs twice.
00:02:53.160 --> 00:03:00.480
Six times we expect the spinner to land on three, thatโs 0.3 times 20.
00:03:01.200 --> 00:03:07.640
Four times, which is 0.2 times 20, weโd expect it to land on four.
00:03:08.280 --> 00:03:15.120
And 0.4 times 20 which is eight would be the number of times weโd expect it to land on five.
00:03:15.720 --> 00:03:22.040
But letโs imagine there are a very high number of trials, say 1000 trials.
00:03:22.800 --> 00:03:24.920
0.1 times 1000 is 100.
00:03:25.440 --> 00:03:30.400
So weโd roughly expect the spinner to land on two 100 times.
00:03:31.040 --> 00:03:41.840
Weโd expect it to land on three 300 times, weโd expect it to land on four 200 times, and weโd expect it to land on five 400 times.
00:03:42.560 --> 00:03:51.840
Now, this is really useful as we can use these values to calculate the mean by using the rules for finding the mean from a frequency table.
00:03:52.440 --> 00:03:59.960
The formula we use to calculate the mean from a frequency table is the sum of ๐ times ๐ฅ divided by the sum of ๐ฅ.
00:04:00.640 --> 00:04:02.480
The sum of ๐ฅ is 1000.
00:04:02.960 --> 00:04:04.840
Weโve carried out 1000 trials.
00:04:05.200 --> 00:04:14.040
๐ times ๐ฅ will be two times 100, three times 300, four times 200, and five times 400.
00:04:14.040 --> 00:04:17.520
And so the sum of ๐๐ฅ is the sum of all these products.
00:04:17.880 --> 00:04:24.480
And so weโre able to calculate the mean, which we call the expected value ๐ธ of ๐ฅ, as shown.
00:04:25.120 --> 00:04:30.760
Now we will calculate this value in a moment, but weโre looking to find a rule for finding the expected value.
00:04:31.320 --> 00:04:33.080
So weโre going to split the fraction up a little bit.
00:04:33.720 --> 00:04:44.480
By reversing the process we perform when adding fractions, we can write it as two times 100 over 1000 plus three times 300 over 1000 and so on.
00:04:45.040 --> 00:04:46.920
And then we notice something.
00:04:47.320 --> 00:04:49.880
100 divided by 1000 is 0.1.
00:04:50.200 --> 00:04:53.360
300 divided by 1000 is 0.3.
00:04:53.640 --> 00:04:56.080
200 divided by 1000 is 0.2.
00:04:56.440 --> 00:04:59.840
And 400 divided by 1000 is 0.4.
00:05:00.600 --> 00:05:17.280
And so another way to write our calculation for the mean is as two times 0.1 plus three times 0.3 plus four times 0.2 plus five times 0.4, and thatโs equal to 3.9.
00:05:17.840 --> 00:05:26.880
And so, if a very high number of trials were carried out, the likely mean of all of our outcomes would actually be 3.9.
00:05:26.880 --> 00:05:31.040
And by this stage, you might be spotting a pattern.
00:05:31.520 --> 00:05:37.360
Two times 0.1 is the product of ๐ฅ and its corresponding ๐ of ๐ฅ.
00:05:37.960 --> 00:05:44.760
Three times 0.3 is also the product of ๐ฅ and its corresponding ๐ of ๐ฅ.
00:05:45.240 --> 00:05:51.840
And so we notice that this is simply the sum of the product of the numbers in each column.
00:05:52.960 --> 00:05:54.680
And so weโre able to generalize.
00:05:55.480 --> 00:06:00.160
The expected value, sometimes called the mean of ๐ฅ, is denoted ๐ธ of ๐ฅ or ๐ or ๐ sub ๐ฅ.
00:06:00.680 --> 00:06:09.480
It can be found by calculating the sum of the product of the variable ๐ฅ and the probability of that variable occurring, ๐ of ๐ฅ equals ๐ฅ.
00:06:10.160 --> 00:06:11.800
We write this as shown.
00:06:12.160 --> 00:06:16.840
๐ธ of ๐ฅ is the sum of ๐ฅ times ๐ of ๐ฅ equals ๐ฅ.
00:06:17.520 --> 00:06:28.280
Now that we have a formula, letโs see how we can apply this to finding the expected value of a discrete random variable given a graph.
00:06:29.480 --> 00:06:34.960
Work out the expected value of the random variable ๐ whose probability distribution is shown.
00:06:35.960 --> 00:06:42.400
The expected value denoted ๐ธ of ๐ฅ can be found by calculating the sum of the product of the variable ๐ and the probability of that variable occurring.
00:06:42.920 --> 00:06:45.000
Thatโs represented as shown.
00:06:45.640 --> 00:06:53.560
And so a nice way to work out the expected value when given a graph is actually to construct a table.
00:06:54.240 --> 00:07:02.880
We see by looking at the ๐ฅ-axis that the random variable ๐ can take the values one, two, three, four, and five.
00:07:03.440 --> 00:07:10.480
We also see that every single one of the bars in our diagram has a height of 0.2.
00:07:10.800 --> 00:07:16.240
So the associated probability for each variable is in fact 0.2.
00:07:16.240 --> 00:07:25.200
Now, a quick way that we can check whether what weโve done is likely to be correct is to check that the sum of the probabilities is indeed one.
00:07:26.080 --> 00:07:36.280
And 0.2 plus 0.2 plus 0.2 plus 0.2 plus 0.2 is one, and so we can move on.
00:07:36.720 --> 00:07:42.360
To find the expected value then, we find the sum of the products of the numbers in each column.
00:07:42.920 --> 00:07:51.560
So thatโs one times 0.2 plus two times 0.2 plus three times 0.2 plus four times 0.2.
00:07:52.160 --> 00:07:55.000
And finally, we add five times 0.2.
00:07:55.640 --> 00:08:08.680
Evaluating each of our products, and we get 0.2 plus 0.4 plus 0.6 plus 0.8 plus one, which is equal to three.
00:08:09.240 --> 00:08:12.200
And so the expected value ๐ธ of ๐ฅ is equal to three.
00:08:12.880 --> 00:08:14.880
Now, in fact, this makes a lot of sense.
00:08:15.480 --> 00:08:21.160
We saw that the probability of each variable occurring was equal; it was 0.2 every time.
00:08:21.680 --> 00:08:28.960
And so the expected value and the likely mean would actually be the mean of all of our numbers.
00:08:29.480 --> 00:08:36.680
Thatโs five plus four plus three plus two plus one divided by five, which is also equal to three.
00:08:37.720 --> 00:08:41.680
Now, this only worked because the probabilities were equal.
00:08:42.280 --> 00:08:45.560
It wouldnโt be a general rule that we could follow.
00:08:46.200 --> 00:08:52.280
Letโs look at an example where we have a graph and the probabilities are not equal.
00:08:53.040 --> 00:08:58.400
Work out the expected value of the random variable ๐ whose probability distribution is shown.
00:08:59.360 --> 00:09:06.600
The formula we use to calculate the expected value of a discrete random variable ๐ is shown.
00:09:07.360 --> 00:09:10.480
Itโs the sum of the product of ๐ and the probability that ๐ฅ occurs.
00:09:11.160 --> 00:09:21.640
And so a nice way to calculate the expected value when given a probability distribution in graph form is to transfer that into a table.
00:09:22.320 --> 00:09:28.880
The ๐ฅ-axis on our graph tells us the values our discrete random variable can take.
00:09:29.480 --> 00:09:32.280
They are one, two, three, and four.
00:09:32.840 --> 00:09:35.160
The first bar then has a height of 0.1.
00:09:35.640 --> 00:09:39.680
So the probability that ๐ is equal to one is 0.1.
00:09:40.240 --> 00:09:44.040
We see that our second bar has a height of 0.3.
00:09:44.040 --> 00:09:48.240
So the probability that ๐ is equal to two is 0.3.
00:09:48.800 --> 00:09:50.840
And then we continue in this manner.
00:09:51.280 --> 00:09:56.760
The height of our third bar, and thatโs the probability that ๐ is equal to three, is 0.4.
00:09:57.360 --> 00:10:03.240
And the height of our fourth bar, which tells us the probability that ๐ is equal to four, is 0.2.
00:10:03.760 --> 00:10:10.760
To find the expected value from our graph then, we need to find the sum of the products of the numbers in each column.
00:10:11.760 --> 00:10:20.480
So thatโs one times 0.1 plus two times 0.3 plus three times 0.4 plus four times 0.2.
00:10:21.200 --> 00:10:31.280
This becomes 0.1 plus 0.6 plus 1.2 plus 0.8, which is equal to 2.7.
00:10:32.080 --> 00:10:37.960
And so the expected value of the random variable ๐ is 2.7.
00:10:38.680 --> 00:10:46.240
Now, we can always check whether our answer is likely to be correct, or at least in the right range.
00:10:46.920 --> 00:10:50.880
When weโre finding the expected value, weโre finding the weighted mean.
00:10:51.240 --> 00:10:57.840
According to our table and our graph, itโs much more likely that ๐ฅ is equal to three than it is to one.
00:10:58.520 --> 00:11:01.520
The mean is more likely to be weighted then in this direction.
00:11:01.960 --> 00:11:12.360
And since 2.7 is roughly between one and four, though not exactly, we know that weโre likely to have performed the calculations correctly.
00:11:13.440 --> 00:11:23.760
In our next example, weโll look at how we can apply some of the other rules for working with probabilities to find the expected value of a discrete random variable.
00:11:24.560 --> 00:11:30.560
The function in the given table is the probability function of a discrete random variable ๐.
00:11:31.240 --> 00:11:33.360
Find the expected value of ๐.
00:11:34.080 --> 00:11:43.880
We know that we can find the expected value of a discrete random variable by calculating the sum of the products of the variable ๐ and the probability of that variable occurring.
00:11:44.520 --> 00:11:47.760
And we write it using this ๐ด notation as shown.
00:11:48.440 --> 00:11:51.760
Now in this case, ๐ is a probability function.
00:11:52.240 --> 00:11:58.040
So we can say that this is like saying the probability that ๐ฅ is equal to ๐ฅ sub ๐.
00:11:58.480 --> 00:12:04.800
And so to find the expected value, weโre going to begin by finding the products of the numbers in each column.
00:12:05.680 --> 00:12:08.040
But of course, there is a number missing.
00:12:08.440 --> 00:12:10.000
And thatโs this value here.
00:12:10.600 --> 00:12:14.600
Weโre told that the probability that ๐ฅ is equal to one is ๐.
00:12:15.120 --> 00:12:17.480
So how do we calculate ๐?
00:12:18.160 --> 00:12:28.200
Well, we know that the sum of the probabilities in our table must be equal to one, and so we can set up and solve an equation for ๐.
00:12:28.760 --> 00:12:36.800
Our equation is 0.1 plus ๐ plus 0.1 plus 0.4 plus 0.2 equals one.
00:12:37.560 --> 00:12:42.160
In other words, weโve added the respective probabilities and set it equal to one.
00:12:42.960 --> 00:12:50.320
0.1 plus 0.1 plus 0.4 plus 0.2 is 0.8.
00:12:50.960 --> 00:12:55.800
So our equation becomes ๐ plus 0.8 equals one.
00:12:56.480 --> 00:13:01.920
If we subtract 0.8 from both sides, we find ๐ is equal to 0.2.
00:13:02.440 --> 00:13:06.200
And so weโre ready to calculate the expected value of ๐.
00:13:06.720 --> 00:13:12.480
Itโs zero times 0.1 plus one times 0.2 plus two times 0.1.
00:13:13.080 --> 00:13:17.200
And we repeat this process with the numbers in our final two columns.
00:13:18.040 --> 00:13:28.800
This gives us zero plus 0.2 plus 0.2 plus 1.2 plus 0.8, which is equal to 2.4.
00:13:29.240 --> 00:13:34.040
The expected value of ๐ then in this case is 2.4.
00:13:35.000 --> 00:13:40.960
In our final example, weโre going to look at how to use the expected value formula to find missing values.
00:13:41.720 --> 00:13:47.560
The function in the given table is a probability function of a discrete random variable ๐ฅ.
00:13:48.320 --> 00:13:56.240
Given that the expected value of ๐ฅ is 254 over 57, find the value of ๐ต.
00:13:57.080 --> 00:14:01.560
And then we have a table with values for ๐ฅ sub ๐ and ๐ of ๐ฅ sub ๐.
00:14:02.160 --> 00:14:07.400
We begin by recalling how we calculate the expected value of a discrete random variable.
00:14:08.160 --> 00:14:13.640
Itโs the sum of the values that the variable can take multiplied by the probability of that variable occurring.
00:14:14.200 --> 00:14:20.600
Now, in this question, weโre told that the function is a probability function of the discrete random variable.
00:14:21.240 --> 00:14:27.760
So weโre essentially saying that ๐ of ๐ฅ sub ๐ is the same as the probability that ๐ฅ is equal to ๐ฅ sub ๐.
00:14:28.560 --> 00:14:32.680
And so eventually, weโre going to multiply the values in each of our columns.
00:14:33.480 --> 00:14:35.320
But we do have a bit of a problem.
00:14:35.760 --> 00:14:39.480
At the moment, our values for probability are in terms of a variable ๐.
00:14:40.080 --> 00:14:45.120
And so weโre going to use the fact that we know that the sum of these probabilities must be equal to one.
00:14:45.800 --> 00:14:53.440
In other words, eight ๐ plus three ๐ plus a third plus eight ๐ must be equal to one.
00:14:54.120 --> 00:14:59.440
Eight ๐ plus three ๐ plus eight ๐ is 19๐.
00:15:00.080 --> 00:15:05.040
So we get 19๐ plus a third equals one.
00:15:05.760 --> 00:15:13.480
We can solve for ๐ by first subtracting a third from both sides to get 19๐ equals two-thirds.
00:15:14.040 --> 00:15:20.560
And then when we divide both sides by 19, we get ๐ is equal to two over 57.
00:15:21.320 --> 00:15:28.240
So now that we know the value of ๐, we need to go back to our table and calculate the relevant probabilities.
00:15:28.920 --> 00:15:32.080
The probability that ๐ฅ is equal to one is eight ๐.
00:15:32.520 --> 00:15:39.960
So thatโs eight times two over 57, which is 16 over 57.
00:15:40.520 --> 00:15:43.960
Then the probability that ๐ฅ is equal to two is three ๐.
00:15:44.560 --> 00:15:51.360
So thatโs three times two over 57, which is six over 57.
00:15:52.080 --> 00:15:55.280
Weโre told the probability ๐ฅ is equal to ๐ต is a third.
00:15:55.760 --> 00:16:00.720
And finally, the probability that ๐ฅ is equal to seven is eight ๐ again.
00:16:01.320 --> 00:16:04.200
So thatโs 16 over 57.
00:16:04.640 --> 00:16:15.680
Now of course, a quick check we could do here would be to check that each of our probabilities does indeed sum to one, and it does, so we can move on.
00:16:16.520 --> 00:16:21.280
Our next job is to find the sum of the products of the numbers in each column.
00:16:21.680 --> 00:16:36.160
The expected value then is given by one times 16 over 57 plus two times six over 57 plus ๐ต times a third plus seven times 16 over 57.
00:16:36.760 --> 00:16:42.640
But actually, we were told the expected value is 254 over 57.
00:16:43.320 --> 00:16:46.600
So weโre going to replace ๐ธ of ๐ฅ with this number.
00:16:46.600 --> 00:16:48.640
And then we simplify the right-hand side.
00:16:49.280 --> 00:16:58.400
Now, all these fractions are making life a little bit awkward, so weโre going to multiply every single number in our equation by 57.
00:16:59.120 --> 00:17:08.840
When we do, our equation becomes 254 equals 16 plus 12 plus 19๐ต plus 112.
00:17:09.680 --> 00:17:14.080
19 is essentially 57 divided by three.
00:17:14.560 --> 00:17:20.800
And then, by adding the numerical parts, we get 140 plus 19๐ต on the right-hand side.
00:17:21.520 --> 00:17:29.680
Next, we subtract 140 from both sides, and our equation becomes 114 equals 19๐ต.
00:17:30.280 --> 00:17:33.360
Finally, we divide through by 19.
00:17:33.840 --> 00:17:41.480
And we find ๐ต is 114 divided by 19, but 114 divided by 19 is six.
00:17:41.840 --> 00:17:44.920
And so the value of ๐ต is six.
00:17:45.640 --> 00:17:49.160
Letโs now recap the key points from this lesson.
00:17:50.000 --> 00:17:53.960
In this video, we recapped what we meant by a discrete random variable.
00:17:54.320 --> 00:17:58.400
Itโs a variable that can only take a countable number of values.
00:17:59.320 --> 00:18:03.800
We saw that we denote the expected value, or sometimes called the mean of ๐ฅ, as ๐ธ of ๐ฅ or ๐ or ๐ sub ๐ฅ.
00:18:04.480 --> 00:18:10.560
And then the expected value is found by calculating the sum of the products of the variable ๐ฅ and the probability of that variable occurring.
00:18:11.120 --> 00:18:15.600
And we use the ๐ด notation to represent this as shown.