100% found this document useful (3 votes)

3K views29 pages

Course Notes For Unit 2 of The Udacity Course ST101 Introduction To Statistics

Course Notes for Unit 2 of the Udacity Course ST101 Introduction to Statistics

Uploaded by

Iain McCulloch

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

100% found this document useful (3 votes)

3K views29 pages

Course Notes For Unit 2 of The Udacity Course ST101 Introduction To Statistics

Course Notes for Unit 2 of the Udacity Course ST101 Introduction to Statistics

Uploaded by

Iain McCulloch

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 29

ST101 Unit 2: Probabilities in Data

Contents
Probability Conditional Probability Bayes Rule Programming Bayes Rule (optional) Correlation vs. Causation Answers

Probability
In this unit we will be talking about probability. In a sense, probability is just the opposite of statistics. Put differently, in statistics we are given data and try to infer possible causes, whereas in probability we are given a description of the causes and we try to predict the data.

The reason that we are studying probability rather than statistics is that it will give us a language to describe the relationship between data and the underlying causes.

Flipping coins
Flipping a coin creates data. Each flip of the coin will result in either a head or a tail result. A fair coin is one that has a 50% chance of coming up heads and a 50% chance of coming up tails. Probability is a method for describing the anticipated outcomes of these coin flips.

Fair Coin
The probability of a coin coming up heads is written using the notation: P(Heads) = In a fair coin, the chance of a coin flip coming up heads is 50%. In probability, this is given as a probability of 0.5: P(Heads) = 0.5 A probability of 1 means that the outcome will always happen. A probability of 0 means that it will never happen. In a fair coin, the probability that a flip will come up tails is: P(Tails) = 0.5 The sum of the probabilities of all possible outcomes is always 1. So: P(Heads) + P(Tails) = 1

Loaded Coin
A loaded coin is one that comes up with one outcome much more frequently than the other.

Loaded Coin Quiz

Suppose the probability of heads is 0.75 for a particular coin. What is the probability than the coin-flip will come up tails?

Complementary Outcomes
If we know the probability of an outcome, A, then the probability of the opposite outcome, A (not A), is given by: P(A) = 1 P(A) This is a very basic law of probability.

Two Flips
So what happens when we flip the same, unbiased, coin twice? What is the probability of getting two heads in a row, assuming that P(H) = 0.5? We can derive the answer to this type of problem using a truth table. A truth table enumerates every possible outcome of the experiment. In this case: Flip-1 H H T T Flip-2 H T H T Probability 0.25 0.25 0.25 0.25

In the case of two coin flips, there are four possible outcomes, and because heads and tails are equally likely, each of the four outcomes is equally likely. Since the total probability must equal 1, the probability of each outcome is 0.25. Another way to consider this is that the probability that we will see a head, followed by another head is the product of the probabilities of the two events: P(H, H) = P(H) x P(H) = 0.5 x 0.5 = 0.25 So what happens if the coin is loaded? Well, if the probability of getting a head, P(H), is 0.6, then the probability that we will see a tail, P(T), is going to be 0.4, and the truth table will be:

Flip-1 H H T T

Flip-2 H T H T

Probability 0.6 x 0.6 = 0.6 x 0.4 = 0.4 x 0.6 = 0.4 x 0.4 =

0.36 0.24 0.24 0.16

Notice that the total probability is still 1: 0.36 + 0.24 + 0.24 + 0.16 = 1 The truth table lists all possible outcomes, so the sum of the probabilities will always be 1.

Two Flips Quiz

Suppose the probability of heads, P(H) = 1. What is the probability of seeing two heads on successive flips, P(H, H)?

One Head
The truth table can get more interesting when we ask different questions. Suppose we flip the coin twice, but what we care about is that exactly one of the two flips reveals a head. For a fair coin, where P(H) = 0.5, the probability is: P(Exactly one H) = 0.5 We can see from the truth table that there are exactly two possible outcomes with exactly one head: Flip-1 H H T T Flip-2 H T H T Probability 0.25 0.25 0.25 0.25

The probability of these outcomes is 0.25 + 0.25 = 0.5.

One of Three Quiz

Suppose we take a fair coin where P(H) = 0.5, and we flip is three times. What is the probability that exactly one of those flips will be a head?

One of Three Quiz 2

What about if the coin is loaded with P(H) = 0.6. What is the probability that exactly one flip out of three will be a head?

Even Roll Quiz

Say you have a fair 6-sided die. The probability of each number appearing on any given throw of the die is 1/6. What is the probability that a throw will be even?

Doubles Quiz
Suppose we throw a fair die twice. What is the probability that we throw the same number on each throw (i.e. a double)?

Summary
In this section we learned that if we know the probability of an event, P(A) the probability of the opposite event is just 1 P(A). We also learned about composite events where the probability is given by: P(A) x P(A) x x P(A) Now technically, these conditional events imply independence. This just means that the outcome of the second coin flip does not depend on the outcome of the first. In the next section we will look at dependence.

Conditional Probability
In real life, things depend on each other. For example, people can be born smart or dumb. For simplicity, lets assume that whether theyre born smart or dumb is just natures equivalent of the flip of a coin. Now, whether they become a Stanford professor is not entirely independent of their intelligence. In general, becoming a Stanford professor is not very likely. The probability may only be 0.0001, but it also depends on their intelligence. If they are born smart, the probability may be higher.

In the previous section, subsequent events like coin tosses were independent of what had happened before. We are now going to look at some more interesting cases where the outcome of the first event does have an impact on the probability of the outcome of the second.

Cancer Example
Lets suppose that there is a patient who may be suffering from cancer. Lets say that the probability of a person getting this cancer is 0.1: P(Cancer) = 0.1 P(Cancer) = 0.9 Now, we dont know whether the person actually has cancer, but there is a blood test that we can give. The outcome of the test may be positive, or it may be negative, but like any good test, it tells us something about the thing we really care about in this case whether or not the person has cancer. Lets say that the probability of a positive test when a person has cancer is 0.9: P(Positive | Cancer) = 0.9 and, P(Negative | Cancer) = 0.1 The sum of the possible test outcomes will always e equal to 1. This is called the sensitivity of the test. Now, this notation says that the result of the test depends on whether or not the person has cancer. This is known as a conditional probability. In order to fully specify the test, we also need to specify the probability of a positive test in the case of a person who doesnt have cancer. In this case, we will say that this is 0.2: P(Positive | Cancer) = 0.2 P(Negative | Cancer) = 0.8 This is the specificity of the test. We now have all the information we need to derive the truth table: Cancer Y Y N N Test Positive Negative Positive Negative P( ) 0.1 x 0.9 0.1 x 0.1 0.9 x 0.2 0.9 x 0.8 = = = = = 0.09 0.01 0.18 0.72 1.0

We can now use the truth table to find the probability that we will see a positive tets result P(Positive) = 0.09 + 0.18 = 0.27

Total Probability
Lets put this into mathematical notation. We were given the probability of having cancer, P(C), from which we were able to derive the probability of not having cancer: P( C) = 1 - P(C) We also had the two conditional probabilities, P( + | C) and P( + | C), and from these we were able to derive the probabilities of a negative test: P( - | C) = 1 P( + | C) and P( - | C) = 1 P( + | C) Then, the probability of a positive test result was: P(+) = P(C).P( + | C) + P( C).P( + | C) This is known as total probability. Lets consider another example.

Two Coins
Image that we have a bag containing two coins. We know that coin1 is fair, and coin2 is loaded, so that: P1(H) = 0.5 and P1(T) = 0.5 P2(H) = 0.9 and P2(T) = 0.1 We now pick a coin from the bag. Each coin has an equal probability of being picked from the bag. We flip the coin once.

Two Coins Quiz 1

What is the probability that the coin comes up heads?

Two Coins Quiz 2

Lets say that we now flip the coin twice. What is the probability that we will see a head first followed by a tail?

Two Coins Quiz 3

Now the bag contains two new coins. both are loaded: P(H | 1 ) = 1 P(H | 2) = 0.6 The probability of picking coin 1 is still 0.5: P(1) = 0.5 What is the probability of flipping the coin twice and seeing two tails?

Bayes Rule
In this section, we introduce what may be the Holy Grail of probabilistic inference. Its called Bayes Rule. The rule is based on work by Reverent Thomas Bayes who used the principle to infer the existence of God. In doing so, he created a new family of methods that have vastly influenced artificial intelligence and statistics. Lets think about the cancer example from the previous section. Say that there is a specific cancer that occurs in 1% of the population. There is a test for this cancer that has a 90% chance of a positive result if the person has cancer. The specificity of the test is 90%, i.e. there is a 90% chance of a negative test result if the person doesnt have cancer: P(C) = 0.01 P(+ | C) = 0.9 P(- | C) = 0.9 So, here is the question. What is the probability that a person has cancer, given that they have had a positive test?

Lets show the figures on a diagram:

Only 1% of the people have this cancer. 99% are cancer-free. The test for this cancer catches 90% of those who have the cancer, which is 90% of the cancer circle. But the test can also give a positive result even when the person doesnt have cancer. In our case a false-positive can occur in 10% of cases which is 10% of the total population. The remaining area represents the case of people who dont have the cancer and get a negative result from the test. In fact, the area C + pos in the diagram above is actually about 8.3% of the total area representing a positive test result. So a positive test has only raised the probability that the person has cancer by a factor of about 8. So this is the basis of Bayes Rule. We start with some prior probability before we run the test, and then we get some evidence from the test itself, which leads us to what is known as a posterior probability:

In our example, we have the prior probability, P(C), and we obtain the posterior probabilities as follow. First we calculate what are known as the joint probabilities: P(C | pos) = P(C) . P(Pos | C) P( C | pos) = P( C) . P(Pos | C) Given the values in our example we get: P(C | pos) = 0.01 x 0.9 = 0.009 P( C | pos) = 0.99 x 0.1 = 0.099 These values are non-normalised they do not sum to 1. In terms of our diagram above, they are the absolute areas of the regions representing a positive test.

We obtain the posterior probabilities by normalising the joint probabilities. To do this, we divide each of the joint probabilities by the probability of a positive test result: P(pos) = P(C | pos) + P( C | pos) So the posterior probabilities are: P (C | pos ) P(C ).P( pos | C ) P( pos ) P(C ).P( pos | C ) P( pos )

P (C | pos )

We can represent the process of calculating Bayes Rule in a diagram:

Lets say we get a positive test. We have a prior probability, and a test with a given sensitivity and specificity: Prior: P(C) Sensitivity: P(pos | C) Specificity: P(pos | C) We multiply the prior by the sensitivity and by the specificity. This gives us a number that combines the cancer hypothesis with the test result for each of the cases cancer or non-cancer. We add these numbers (normally, they do not add up to 1), to get the total probability of a positive test. Now all we need to do to obtain the posterior probabilities is to normalise the two numbers by dividing by the total probability, P(pos).

This is our algorithm for Bayes Rule, and we can produce an almost exactly similar diagram for a negative test result as shown below:

Lets work through an example. We start with our prior probability, sensitivity and specificity: P(C) = 0.001 P(pos | C) = 0.9 P(neg | C) = 0.9

Cancer Probabilities Quiz

Calculate the probabilities of P( C) P(neg | C) P(pos | C)

Probability Given Test Quiz

Assume that the test comes back negative. Calculate

P(C | neg) #the combined probability of having cancer given the negative test result P( C | neg) #the combined probability of being cancer-free given the negative test result

Normaliser Quiz
Calculate the normaliser, P(neg)

Normalising Probability Quiz

What is the posterior probability of cancer, given that we had a negative test result?

What is remarkable about the result is what the posterior probabilities actually mean. Before the test, we had a 1% chance of having cancer. After a negative test result this has gone down by about a factor of 9. Conversely, before the test there was a 99% chance that we were cancer-free. That number has now gone up to 99.89%, greatly increasing our confidence that we are cancer free. Lets consider another example. In this case the prior probability, sensitivity and specificity are: P(C) = 0.1 P(pos | C) = 0.9 P(neg | C) = 0.5 So the sensitivity is high, but the specificity is much lower.

Disease Test Quiz 1

What are the values of the probabilities: P( C) P(neg | C) P(pos | C)

Disease Test Quiz 2

What are the values of the probabilities: P(C, neg) P( C, neg) P(neg)

Disease Test Quiz 3

What are the values of the probabilities: P(C | neg) P( C | neg)

Bayes Rules Summary

In Bayes Rule, we have a hidden variable that we care about, but cant measure directly. Instead we have a test. We have a prior probability of how often the variable is true. The test characterised by how often it gives a positive result when the variable is true (sensitivity), and how often it gives a negative result when the variable is false (specificity).

Bayes rule then applies the algorithm we saw earlier to calculate the posterior probabilities for the variable given a test outcome: Positive test: Negative Test:

Robot Sensing
Lets practice using Bayes Rule with a different example. Consider a robot living in a world that has exactly two places. There is a red place, R, and a green place, G:

Initially, the robot has no idea of its location, so the prior probabilities are: P(R) = P(G) = 0.5 The robot has sensors that allow it to see its environment, but these sensors are somewhat unreliable: P(see R | in R) = 0.8 p(see G | in G) = 0.8

Robot Sensing Quiz 1

Suppose the robot sees red. What is the posterior probability that the robot is in the red cell? What is the posterior probability that it is in the green cell?

Robot Sensing Quiz 2

Suppose the prior probabilities are now: P(R) = 0 P(G) = 1 Once again the robot sees red. What is the posterior probability that the robot is in the red cell? What is the posterior probability that it is in the green cell?

Robot Sensing Quiz 3

Given the probabilities: P(R) = P(G) = 0.5 P(see R | in R) = 0.8 p(see G | in G) = 0.5 Now the robot sees red. Calculate the posterior probability that the robot is in the red cell, and the posterior probability that it is in the green cell. Lets make things a little more complicated. Suppose that there are now three places in the robots world, one red and two green. For simplicity, we will label these A, B and C:

So the hidden variable now has three states. We will assume that each place has the same prior probability: P(A) = P(B) = P(C) = 1/3 The robot sees red, and we know that: P(R | A) = 0.9 P(G | B) = 0.9 P(G | C) = 0.9 We can solve for the posterior probabilities exactly as before. P(A, R) = P(A) x P(R | A) = 1/3 x 0.9 = 0.3 P(B, R) = P(B) x P(R | B) = 1/3 x 0.1 = 0.0333 P(C, R) = P(C) x P(R | C) = 1/3 x 0.1 = 0.0333 So, the normaliser is P(R) = 0.3 + 0.0333 + 0.0333 = 0.3667

Which gives the posterior probabilities: P(A | R) = 0.82 P(B | R) = 0.09 P(C | R) = 0.09

Generalising
The last example showed that there may be more than two states of the hidden variable that we are interested in. There may be 3, 4, 5 or any other number. We can solve these cases using exactly the same methods, but we have to keep track of more values. In fact, there can be more than just two outcomes of the test. For example, the robot may see red, green or blue. This means that our measurement probabilities will be more elaborate, but the actual method for calculating the posterior probabilities will remain the same. We can now deal with very large problems, that have many possible hidden causes, by applying Bayes Rule to determine the posterior probabilities.

Sebastian at Home Quiz

Sebastian has a problem. He travels a lot. It has got so bad that he sometimes wakes up in bed not knowing what country he is in. He is only at home 40% of the time P(away) = 0.6 P(home) = 0.4 In the summer, Sebastian lives in California. Typically, it doesnt rain in California in the summer. Whereas, there is a much higher chance of rain in many of the countries that Sebastian travels to: P(rain | home) = 0.01 P(rain | away) = 0.3 Lets say that he wakes up and hears it raining. What is the probability that he is at home in California?

Programming Bayes Rule (optional)

This section is optional. To print a number in Python, we just use the print statement thus: print 0.3 will cause 0.3 to be printed by the Python interpreter. A function or procedure in Python takes some inputs and produces outputs. A function lets us use the same code to operate on different data by passing that data as the input to the function. We define a Python function as follows: def <Name>(<Parameters>): <Block> For example, the following simple function, f, just returns whatever parameter p it is given: def f(p): return p print f(0.3) The print statement just prints the output of the function, given the input 0.3. This has exactly the same effect as before, but in this case, the print statement isnt printing 0.3 directly, but rather it is printing the output of the function . Lets say that we are going to provide the probability of an event (P = 0.3) to our function, and we want the function to return the probability of the inverse event. To do this we just modify the function as follows: def f(p): return 1 - p print f(0.3) This will print the result 0.7. If we change the input value, the interpreter will print a different output.

Two Flips Quiz

Suppose we have a coin with P(H) = p. Write a function that returns the probability of seeing heads twice, i.e. P(H, H)

Three Flips Quiz

Lets say that we now flip the coin three times. Write a function that returns the probability of seeing heads exactly once.

Flip Two Coins Quiz

We now have two coins. Coin 1 has a probability P(H) = p1, and coin 2 has a probability P(H) = p2. Our function will now need to take two arguments as inputs: def f(p1, p2): Write a function that returns the probability of that both coins come up heads.

Flip One of Two Quiz

We have two coins. Coin 1 has a probability P(H) = p1, and coin 2 has a probability P(H) = p2. We pick one coin from a bag. The probability that we pick coin 1 is P0, and the probability that we pick coin 2 is 1 - P0: P(C1) = P0 P(C2) = 1 - P0 What is the probability that we get heads when we flip the coin? Write a function with three input arguments, that calculates the probability that we get heads when we flip the coin.

Cancer Example Quiz

Lets return to our cancer example. We have our prior probability, P0, the probability of a positive test, given cancer, P1, and theres the probability of a negative test for not-cancer which well call P2. P = P0 P(pos | C) = P1 P(neg | C) = P2 What is the formula to calculate probability of a positive test P(pos)?

Calculate Total Quiz

Write a function to calculate the probability of a positive test.

Program Bayes Rule Quiz

What is the formula to calculate the posterior probability of having cancer following a positive test (using the variable defined above)? Write a function to calculate the posterior probability of having cancer following a positive test.

Program Bayes Rule Quiz 2

Write a function to calculate the posterior probability of having cancer following a negative test, P(C | neg).

Correlation vs. Causation

In the last unit, we described Simpsons Paradox and showed how it was surprisingly easy to draw false conclusions from data. In this section we will try to give you an insight into a common mistake that is made when interpreting statistical data as a result of confusing correlation with causation. Newspaper articles frequently confuse correlation with causation. We will show an example where data is correlated, and show why it is tempting to confuse correlation with causation. Suppose that you are sick. In fact, you are so sick that you fear that you may die. Fortunately, youre not sick enough so that you cant apply the lessons of this class in order to make a rational decision about whether to go to the hospital. You consult the data, and find that at your local hospital 40 people were hospitalised, and 4 of these people died. You also find that the majority of the population of your town, 8000 people, didnt visit the hospital, and 20 of these people died at home. So, 10% of those who were admitted to hospital died, and 0.25% of those who were at home died. This means that the chances of dying in hospital are 40 times greater than the chances of dying at home. There is therefore a correlation between whether or not you die, and whether or not you are in hospital. But this doesnt mean that hospitals cause sick people to die.

The statement: The chances of dying in hospital are 40 times greater than dying at home shows that there is a correlation between whether or not you die, and whether or not you are in hospital. Whereas, the statement: Being in a hospital increases your probability of dying by a factor of 40 is a causal statement. It says that being in hospital causes you to die, not just that being in hospital coincides with the fact that you die. People frequently get this wrong. They observe a correlation, but they suggest that the correlation is causal. To understand why this could be wrong, lets look a little deeper into our example.

Considering Health
Lets say that of the people in the hospital, 36 were sick, and 4 of these died. Four of the people in the hospital were actually healthy, and they all survived. Of the people who were at home, 40 were actually sick, and 20 of these people died. The remaining 7960 were healthy, but 20 of these people also died (perhaps due to accidents etc.). These statistics are consistent with our earlier statistics. We have just added another variable - whether a person is sick or healthy. The percentages of people who died are tabulated below: Sick Healthy Sick Healthy In Hospital 36 4 At Home 40 7960 Died 4 0 Died 20 20 11.11% 0% 50% 0.2513%

Now, if you are sick, your chances of dying at home are 50% compared with about 11% in the hospital, so you should really make your way to the hospital.

Correlation
So why does the hospital example lead us to draw such a wrong conclusion? We looked at two variables, being in hospital and the chance of dying, and we rightfully observed that these two things are correlated. If we had a scatter-plot with two categories whether a person was in hospital, & whether or not that person died we would see increased occurrence of data points as shown below:

This shows that the data correlates. So what is correlation? Well, in any plot, data is correlated if knowledge about on variable tells us something about the other.

Correlation Quiz
Are the following data pots correlated? A B

Causation Structure
In the example above, there is clearly a correlation between whether a person is in hospital, and whether or not they die. But we initially left out an important variable: whether or not a person was sick. In fact, it was the sickness that caused people to die. If we add arcs of causation to our diagram, we find that sickness causes death, and that sickness also causes people to go into hospital:

In fact, in our example, once a person knew that they were sick, being in a hospital negatively correlated with them dying. That is, being in a hospital made it less likely that they would die, given that they were sick. In statistics, we call this a confounding variable. It can be very tempting to just omit this from your data, but if you do, you might find correlations that have absolutely nothing to do with causation.

Fire Correlation
Suppose we study a number of fires. We recorded the number of fire-fighters and the surface area (size) of the fire. # Fire-fighters 10 40 200 70 # Size fire 100 400 2000 700

Clearly, the number of fire-fighters is correlated with the size of the fire. But firefighters dont cause the fires! Getting rid of all the fire-fighters will not get rid of all the fires. This is actually a case of reverse causation. The size of the fire determines the number of fire-fighters that will be sent to deal with it.

However, it is impossible to know this just from the data. We only know that this is a case of reverse causation because we already know something about fire and firefighters.

Assignment
Check out old news articles in newspapers, or online, and find some that takes data which shows a correlation, and from the data suggests causation, or tells you what to do based on that data. You will find that the news is full of such abuses of statistics.

Answers
Loaded Coin Quiz
P(Tails) = 1 P(Heads) = 0.25

Two Flips Quiz

P(H, H) = 1

One of Three Quiz

0.375 Flip-1 H H H H T T T T Flip-2 H H T T H H T T Flip-3 H T H T H T H T Probability 0.125 0.125 0.125 0.125 0.125 0.125 0.125 0.125

One of Three Quiz 2

P(H) = 0.6 P(T) = 0.4 Flip-1 H H H H T T T T Flip-2 H H T T H H T T Flip-3 H T H T H T H T Probability 0.216 0.144 0.144 0.096 0.144 0.096 0.096 0.064

P(exactly 3 Heads) = 0.288

Even Roll Quiz

Three possible outcomes are even (2, 4, 6), so the probability that a throw is even is: 3 x 1/6 = 0.5

Doubles Quiz
The truth table has 36 possible outcomes. Each outcome in the truth table will have a probability of 1/36. Six of these outcomes are doubles, so the probability of a double is: 6 x 1/36 = 1/6 =0.16667

Two Coins Quiz 1

Pick 1 1 2 2 Flip H T H T P( ) 0.25 0.25 0.45 0.05

So, P(H) = 0.25 + 0.45 = 0.7

Two Coins Quiz 2

Pick 1 1 1 1 2 2 2 2 Flip1 H H T T H H T T Flip2 H T H T H T H T P() 0.5 x 0.5 x 0.5 0.5 x 0.5 x 0.5 0.5 x 0.5 x 0.5 0.5 x 0.5 x 0.5 0.5 x 0.9 x 0.9 0.5 x 0.9 x 0.1 0.5 x 0.1 x 0.9 0.5 x 0.1 x 0.1 = = = = = = = = 0.125 0.125 0.125 0.125 0.405 0.045 0.045 0.005

P(H, T) = 0.125 + 0.045 = 0.17

Two Coins Quiz 3

Pick 1 1 1 1 2 2 2 2 Flip1 H H T T H H T T Flip2 H T H T H T H T P() 0 0 0 0 0.5 x 0.6 x 0.6 0.5 x 0.6 x 0.4 0.5 x 0.4 x 0.6 0.5 x 0.4 x 0.4

= = = =

0.18 0.12 0.12 0.08

P(T, T) = 0.08

Cancer Probabilities Quiz

P( C) = 0.99 P(neg | C) = 0.1 P(pos | C) = 0.1

Probability Given Test Quiz

Assume that the test comes back negative. Calculate

P(C | neg) = 0.01 x 0.1 = 0.001 P( C | neg) = 0.99 x 0.9 = 0.891

Normaliser Quiz
p(neg) = 0.892

Normalising Probability Quiz

P(C | neg) = 0.0011 P( C | neg) = 0.9989

Disease Test Quiz 1

P( C) = 0.9 P(neg | C) = 0.1 P(pos | C) = 0.5

Disease Test Quiz 2

P(C, neg) = 0.1 x 0.1 = 0.01 P( C, neg) = 0.9 x 0.5 = 0.45 P(neg) = 0.46

Disease Test Quiz 3

P(C | neg) = 0.0217 P( C | neg) = 0.9783

Robot Sensing Quiz 1

P(at R | see R) = 0.8 P(at G | see R) = 0.2

Robot Sensing Quiz 2

P(at R | see R) = 0 P(at G | see R) = 1

Robot Sensing Quiz 3

P(at R | see R) = 0.615 P(at G | see R) = 0.385

Sebastian at Home Quiz

Two Flips Quiz

def f(p): return (p * p)

Three Flips Quiz

def f(p): return 3 * p * (1-p) * (1-p)

Flip Two Coins Quiz

def f(p1,p2): return p1 * p2

Flip One of Two Quiz

P(H) = (P0 x P1) + ((1 - P0) x P2) def f(p0,p1,p2): return (p0 * p1) + ((1 - p0) * p2)

Cancer Example Quiz

P(pos) = (P1 x P0) + ((1 - P2) x (1 - P0))

Calculate Total Quiz

def f(p0,p1,p2): return (p0 * p1) + ((1 - p0) * (1 - p2))

Program Bayes Rule Quiz

P(C | pos) = P0 x P1 / ((P0 x P1) + ((1 - P0) x (1 P2)) def f(p0,p1,p2): return ((p0 * p1)/((p0 * p1) + ((1 - p0) * (1 - p2))))

Program Bayes Rule Quiz 2

def f(p0,p1,p2): return (p0 * (1 - p1))/((p0 * (1 - p1)) + ((1 - p0) * p2))

Correlation Quiz
A. B. C. D. Yes No No Yes

Cognitive-Computing Unit 1
No ratings yet
Cognitive-Computing Unit 1
13 pages
Why Theory of Computation
100% (1)
Why Theory of Computation
11 pages
Conditional Probability, Total Probability Theorem, Bayes Rule
No ratings yet
Conditional Probability, Total Probability Theorem, Bayes Rule
21 pages
Bayesian Reasoning in Data Analysis A Critical Introduction by Giulio D. Agostini
100% (2)
Bayesian Reasoning in Data Analysis A Critical Introduction by Giulio D. Agostini
351 pages
Geo-Risk 2017 Keynote Lectures
100% (5)
Geo-Risk 2017 Keynote Lectures
163 pages
3rd Grade ESE Word Study
100% (1)
3rd Grade ESE Word Study
17 pages
Presentation On Case Study Goal Setting by Tejas
75% (4)
Presentation On Case Study Goal Setting by Tejas
24 pages
Bayes' Theorem: New Information Application of Bayes' Theorem Posterior Probabilities Prior Probabilities
No ratings yet
Bayes' Theorem: New Information Application of Bayes' Theorem Posterior Probabilities Prior Probabilities
27 pages
Bayes Theorem PPT 1
No ratings yet
Bayes Theorem PPT 1
9 pages
Xinfeng Zhou-CH 4&5
No ratings yet
Xinfeng Zhou-CH 4&5
78 pages
Artificial Intelligence - Adversarial Search
No ratings yet
Artificial Intelligence - Adversarial Search
4 pages
Math100Part2 ProbabilitySimpleEvents
No ratings yet
Math100Part2 ProbabilitySimpleEvents
6 pages
Exam 2 Study Guide SP 2012
100% (2)
Exam 2 Study Guide SP 2012
18 pages
Case Study X Questions
No ratings yet
Case Study X Questions
83 pages
Probability Intro
No ratings yet
Probability Intro
6 pages
2014 02 03 Exploring Mathematics
100% (1)
2014 02 03 Exploring Mathematics
4 pages
Confidence Intervals: Larson/Farber 4th Ed
No ratings yet
Confidence Intervals: Larson/Farber 4th Ed
83 pages
Micro Lections en
100% (1)
Micro Lections en
84 pages
Probability II Conditional Probability, Bayes Theorem, Decision Trees. (1) - BHUVNESHWARI RATHORE
No ratings yet
Probability II Conditional Probability, Bayes Theorem, Decision Trees. (1) - BHUVNESHWARI RATHORE
16 pages
Probability Chap 2 Part 5-1
No ratings yet
Probability Chap 2 Part 5-1
13 pages
Quantitative Methods For Business 12th Edition Anderson Solutions Manual
100% (55)
Quantitative Methods For Business 12th Edition Anderson Solutions Manual
10 pages
Chapter 13
No ratings yet
Chapter 13
18 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
33 pages
Developing Neural Network Applications Using Labview
No ratings yet
Developing Neural Network Applications Using Labview
105 pages
Step1. Open The Data/bank Data - CSV Dataset
No ratings yet
Step1. Open The Data/bank Data - CSV Dataset
3 pages
A Study On NSL-KDD Dataset
100% (1)
A Study On NSL-KDD Dataset
7 pages
Basic Probability
No ratings yet
Basic Probability
35 pages
Aerial Robotics Lecture 1B - 1 Basic Mechanics
100% (3)
Aerial Robotics Lecture 1B - 1 Basic Mechanics
4 pages
Course Notes For Unit 2 of The Udacity Course CS253 Web Application Engineering
No ratings yet
Course Notes For Unit 2 of The Udacity Course CS253 Web Application Engineering
42 pages
Aiken & West (1991) Chap07 PDF
No ratings yet
Aiken & West (1991) Chap07 PDF
14 pages
Chap 005
No ratings yet
Chap 005
55 pages
5enote5 Probability
0% (1)
5enote5 Probability
81 pages
CALCULATING Standard Deviation
No ratings yet
CALCULATING Standard Deviation
4 pages
Chapter Four: Conditional Probability and Independence
0% (2)
Chapter Four: Conditional Probability and Independence
8 pages
Chap04 - Probability
No ratings yet
Chap04 - Probability
44 pages
Econ Intro
100% (1)
Econ Intro
9 pages
Chapter 06 - Probability Theory
No ratings yet
Chapter 06 - Probability Theory
64 pages
Chapter - 3 Probability: Sig Classes
No ratings yet
Chapter - 3 Probability: Sig Classes
41 pages
CBCHB Manual
No ratings yet
CBCHB Manual
111 pages
Chapter 5
No ratings yet
Chapter 5
42 pages
Markov Chain Random Walk
No ratings yet
Markov Chain Random Walk
128 pages
Confidence Intervals: Submitted To: Prof. Neeta Gupta
100% (2)
Confidence Intervals: Submitted To: Prof. Neeta Gupta
13 pages
Cse291d 2 PDF
No ratings yet
Cse291d 2 PDF
54 pages
Monte Carlo Simulation
No ratings yet
Monte Carlo Simulation
22 pages
Bayes' Theorem and Its Applications
0% (1)
Bayes' Theorem and Its Applications
11 pages
Aerial Robotics Lecture 1B - 2 Dynamics and 1-D Linear Control
100% (1)
Aerial Robotics Lecture 1B - 2 Dynamics and 1-D Linear Control
8 pages
Text Book Answers Unit 11
100% (2)
Text Book Answers Unit 11
16 pages
My October 10 2018 Psat NMSQT Score Report Student Score Reports
100% (1)
My October 10 2018 Psat NMSQT Score Report Student Score Reports
8 pages
AP Statistics 2002 MC Exam
No ratings yet
AP Statistics 2002 MC Exam
23 pages
Course Notes For Unit 4 of The Udacity Course CS253 Web Application Engineering
No ratings yet
Course Notes For Unit 4 of The Udacity Course CS253 Web Application Engineering
29 pages
Aerial Robotics Lecture 2C Supplemental - 2 Supplementary Material - Getting Started With The First Programming Assignment
50% (2)
Aerial Robotics Lecture 2C Supplemental - 2 Supplementary Material - Getting Started With The First Programming Assignment
4 pages
Bayesian Decision Theory: Intro To
No ratings yet
Bayesian Decision Theory: Intro To
56 pages
Dynamic Random-Access Memory PDF
100% (1)
Dynamic Random-Access Memory PDF
17 pages
CHAPTER 4. Probability PDF
No ratings yet
CHAPTER 4. Probability PDF
81 pages
Course Notes For Unit 1 of The Udacity Course CS262 Programming Languages
No ratings yet
Course Notes For Unit 1 of The Udacity Course CS262 Programming Languages
32 pages
Basic Probability PDF
No ratings yet
Basic Probability PDF
39 pages
Course Notes For Unit 5 of The Udacity Course CS253 Web Application Engineering
No ratings yet
Course Notes For Unit 5 of The Udacity Course CS253 Web Application Engineering
40 pages
Aerial Robotics Lecture 1B - 5 Agility and Manoeuvrability
100% (1)
Aerial Robotics Lecture 1B - 5 Agility and Manoeuvrability
4 pages
Session 2 ClassPPT
No ratings yet
Session 2 ClassPPT
39 pages
Aerial Robotics Lecture 2C - 4 Quadrotor Equations of Motion
No ratings yet
Aerial Robotics Lecture 2C - 4 Quadrotor Equations of Motion
4 pages
Statistics
No ratings yet
Statistics
18 pages
ChE132 - Case Study Writeup
0% (1)
ChE132 - Case Study Writeup
20 pages
Navidi ch6
No ratings yet
Navidi ch6
82 pages
Course Notes For Unit 7 of The Udacity Course CS253 Web Application Engineering
No ratings yet
Course Notes For Unit 7 of The Udacity Course CS253 Web Application Engineering
34 pages
Aerial Robotics Lecture 2C - 2 Newton-Euler Equations
100% (1)
Aerial Robotics Lecture 2C - 2 Newton-Euler Equations
3 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
60 pages
Practical 8 Call and Subroutine
50% (2)
Practical 8 Call and Subroutine
5 pages
Class 2 Exploratory Data Analysis
100% (1)
Class 2 Exploratory Data Analysis
18 pages
Course Notes For Unit 4 of The Udacity Course ST101 Introduction To Statistics
No ratings yet
Course Notes For Unit 4 of The Udacity Course ST101 Introduction To Statistics
25 pages
McIntyre, Peter - 52 Concepts To Add To Your Cognitive Toolkit
No ratings yet
McIntyre, Peter - 52 Concepts To Add To Your Cognitive Toolkit
13 pages
Aerial Robotics Lecture 2A - 4 Axis-Angle Representations For Rotations
No ratings yet
Aerial Robotics Lecture 2A - 4 Axis-Angle Representations For Rotations
6 pages
Chapter 05 Test Bank - Version1
No ratings yet
Chapter 05 Test Bank - Version1
38 pages
Aerial Robotics Lecture 2A - 1 Transformations
No ratings yet
Aerial Robotics Lecture 2A - 1 Transformations
10 pages
Artificial Neural Network - Hopfield Networks - Tutorialspoint
No ratings yet
Artificial Neural Network - Hopfield Networks - Tutorialspoint
3 pages
Ai SMQP
No ratings yet
Ai SMQP
24 pages
Aggregating Probability Distributions
No ratings yet
Aggregating Probability Distributions
9 pages
Buddhist Economics PDF
100% (1)
Buddhist Economics PDF
65 pages
Aerial Robotics Lecture 3B - 1 Time, Motion, and Trajectories
No ratings yet
Aerial Robotics Lecture 3B - 1 Time, Motion, and Trajectories
4 pages
Markov Chains: Stochastic Models
No ratings yet
Markov Chains: Stochastic Models
7 pages
Aerial Robotics Lecture 3A - 2 3-D Quadrotor Control
No ratings yet
Aerial Robotics Lecture 3A - 2 3-D Quadrotor Control
5 pages
Study of A 8086 Trainer Kit
0% (1)
Study of A 8086 Trainer Kit
6 pages
3-Statistical Learning - Distributions
No ratings yet
3-Statistical Learning - Distributions
33 pages
05 Random Variables
No ratings yet
05 Random Variables
327 pages
Define Central Tendency
No ratings yet
Define Central Tendency
8 pages
Extreme Value Statistics
No ratings yet
Extreme Value Statistics
41 pages
Aerial Robotics Lecture 2C - 3 Principal Axes and Principal Moments of Inertia
No ratings yet
Aerial Robotics Lecture 2C - 3 Principal Axes and Principal Moments of Inertia
3 pages
Bayes' Theorem
No ratings yet
Bayes' Theorem
10 pages
Amdahl
No ratings yet
Amdahl
2 pages
MODULE 3 Classification
No ratings yet
MODULE 3 Classification
5 pages
Aerial Robotics Lecture 3B Supplemental - 4 Supplementary Material - Linearization of Quadrotor Equations of Motion
No ratings yet
Aerial Robotics Lecture 3B Supplemental - 4 Supplementary Material - Linearization of Quadrotor Equations of Motion
6 pages
Bayes' Theorem Bayes' Theorem: Example
No ratings yet
Bayes' Theorem Bayes' Theorem: Example
3 pages
Probability and Statistics
No ratings yet
Probability and Statistics
110 pages
Bayes Theorem
No ratings yet
Bayes Theorem
2 pages
Aerial Robotics Lecture 2A - 3 Euler Angles
No ratings yet
Aerial Robotics Lecture 2A - 3 Euler Angles
5 pages
Handling Imbalanced Data
No ratings yet
Handling Imbalanced Data
21 pages
Aerial Robotics Lecture 3A - 1 2-D Quadrotor Control
No ratings yet
Aerial Robotics Lecture 3A - 1 2-D Quadrotor Control
5 pages
Lec 5 Contd Minimax Alpha Beta Algorithm
No ratings yet
Lec 5 Contd Minimax Alpha Beta Algorithm
21 pages
13.conditional Probability, Bayes Theorem Assignment Solutions
No ratings yet
13.conditional Probability, Bayes Theorem Assignment Solutions
11 pages
Continuous Probability Distribution: Continuous Random Variable Is A Random
No ratings yet
Continuous Probability Distribution: Continuous Random Variable Is A Random
25 pages
Course Notes For Unit 1 of The Udacity Course ST101 Introduction To Statistics
No ratings yet
Course Notes For Unit 1 of The Udacity Course ST101 Introduction To Statistics
26 pages
References
No ratings yet
References
2 pages
Continuous Probability Distribution
No ratings yet
Continuous Probability Distribution
7 pages
Chap - 5 - Problems With Answers
No ratings yet
Chap - 5 - Problems With Answers
15 pages
Operational Research
No ratings yet
Operational Research
13 pages
Sample Problem With Answers On Hypothesis Testing
No ratings yet
Sample Problem With Answers On Hypothesis Testing
3 pages
Chi Square Test
No ratings yet
Chi Square Test
4 pages
MG 602 Probability Theories Exercise
No ratings yet
MG 602 Probability Theories Exercise
5 pages
Cheat Sheet
No ratings yet
Cheat Sheet
4 pages
Discuss The Weibull Failure Model and Obtain The Expressions For Reliability Function
No ratings yet
Discuss The Weibull Failure Model and Obtain The Expressions For Reliability Function
1 page
How To Calculate The Expected Value: P X P P P
No ratings yet
How To Calculate The Expected Value: P X P P P
2 pages
Complex analysis A Complete Guide
From Everand
Complex analysis A Complete Guide
Gerardus Blokdyk
No ratings yet

Course Notes For Unit 2 of The Udacity Course ST101 Introduction To Statistics

Uploaded by

Course Notes For Unit 2 of The Udacity Course ST101 Introduction To Statistics

Uploaded by

ST101 Unit 2: Probabilities in Data

Loaded Coin Quiz

Probability 0.6 x 0.6 = 0.6 x 0.4 = 0.4 x 0.6 = 0.4 x 0.4 =

0.36 0.24 0.24 0.16

Two Flips Quiz

The probability of these outcomes is 0.25 + 0.25 = 0.5.

One of Three Quiz

One of Three Quiz 2

Even Roll Quiz

Two Coins Quiz 1

Two Coins Quiz 2

Two Coins Quiz 3

Lets show the figures on a diagram:

We can represent the process of calculating Bayes Rule in a diagram:

Cancer Probabilities Quiz

Probability Given Test Quiz

Normalising Probability Quiz

Disease Test Quiz 1

Disease Test Quiz 2

Disease Test Quiz 3

Bayes Rules Summary

Robot Sensing Quiz 1

Robot Sensing Quiz 2

Robot Sensing Quiz 3

Sebastian at Home Quiz

Programming Bayes Rule (optional)

Two Flips Quiz

Three Flips Quiz

Flip Two Coins Quiz

Flip One of Two Quiz

Cancer Example Quiz

Calculate Total Quiz

Program Bayes Rule Quiz

Program Bayes Rule Quiz 2

Correlation vs. Causation

Two Flips Quiz

One of Three Quiz

One of Three Quiz 2

P(exactly 3 Heads) = 0.288

Even Roll Quiz

Two Coins Quiz 1

So, P(H) = 0.25 + 0.45 = 0.7

Two Coins Quiz 2

P(H, T) = 0.125 + 0.045 = 0.17

Two Coins Quiz 3

0.18 0.12 0.12 0.08

Cancer Probabilities Quiz

Probability Given Test Quiz

P(C | neg) = 0.01 x 0.1 = 0.001 P( C | neg) = 0.99 x 0.9 = 0.891

Normalising Probability Quiz

Disease Test Quiz 1

Disease Test Quiz 2

Disease Test Quiz 3

Robot Sensing Quiz 1

Robot Sensing Quiz 2

Robot Sensing Quiz 3

Sebastian at Home Quiz

Two Flips Quiz

Three Flips Quiz

Flip Two Coins Quiz

Flip One of Two Quiz

Cancer Example Quiz

Calculate Total Quiz

Program Bayes Rule Quiz

Program Bayes Rule Quiz 2

You might also like