Sigma heat

In probability theory, the chain rule permits the calculation of any member of the joint distribution of a set of random variables using only conditional probabilities. The rule is useful in the study of Bayesian networks, which describe a probability distribution in terms of conditional probabilities.

Consider an indexed set of sets $A_{1}, \dots, A_{n}$ . To find the value of this member of the joint distribution, we can apply the definition of conditional probability to obtain:

P (A_{n}, \dots, A_{1}) = P (A_{n} | A_{n - 1}, \dots, A_{1}) \cdot P (A_{n - 1}, \dots, A_{1})

Repeating this process with each final term creates the product:

P (\cap_{k = 1}^{n} A_{k}) = \prod_{k = 1}^{n} P (A_{k} ∣ \cap_{j = 1}^{k - 1} A_{j})

With four variables, the chain rule produces this product of conditional probabilities:

P (A_{4}, A_{3}, A_{2}, A_{1}) = P (A_{4} ∣ A_{3}, A_{2}, A_{1}) \cdot P (A_{3} ∣ A_{2}, A_{1}) \cdot P (A_{2} ∣ A_{1}) \cdot P (A_{1})

This rule is illustrated in the following example. Urn 1 has 1 black ball and 2 white balls and Urn 2 has 1 black ball and 3 white balls. Suppose we pick an urn at random and then select a ball from that urn. Let event A be choosing the first urn: P(A) = P(~A) = 1/2. Let event B be the chance we choose a white ball. The chance of choosing a white ball, given that we've chosen the first urn, is P(B|A) = 2/3. Event A, B would be their intersection; choosing the first urn and a white ball from it. The probability can be found by the chain rule for probability:

P (A, B) = P (B ∣ A) P (A) = 2 / 3 \times 1 / 2 = 1 / 3

.

References

Template:Russell Norvig 2003, p. 496.
"The Chain Rule of Probability", developerWorks, Nov 3, 2012.

Sigma heat

References

Navigation menu

Search