Probability theory is a powerful tool for inferring the value of missing variables given a set of other variables. As the number of variables in a system increases, the joint probability distribution over these variables becomes overwhelmingly large. In this lecture we examine the implications of factoring one large joint probability distribution into a set of smaller conditional distributions by exploiting independencies between variables and study suitable algorithms for inference.