Joint probability distribution
From Wikipedia, the free encyclopedia
In the study of probability, given two random variables X and Y, the joint distribution of X and Y defines the probability of events defined in terms of both X and Y. In the case of only two random variables, this is called a bivariate distribution, but the concept generalizes to any number random variables, giving a multivariate distribution.
Contents |
[edit] Two dimensions
The joint probability distribution of a pair of random variables is defined by the joint cumulative distribution function;
Similarly, the joint distribution of a multivariate distribution is defined by the joint cumulative distribution function for the set of random variables.
[edit] Discrete case
For discrete random variables, the joint probability mass function is
Since these are probabilities, we have
[edit] Continuous case
Similarly for continuous random variables, the joint probability density function can be written as fX,Y(x, y) and this is
where fY|X(y|x) and fX|Y(x|y) give the conditional distributions of Y given X = x and of X given Y = y respectively, and fX(x) and fY(y) give the marginal distributions for X and Y respectively.
Again, since these are probability distributions, one has
[edit] Mixed case
In some situations X is continuous but Y is discrete. For example, in a logistic regression, one may wish to predict the probability of a binary outcome Y conditional on the value of a continuously-distributed X. In this case, (X, Y) has neither a probability density function nor a probability mass function in the sense of the terms given above. On the other hand, a "mixed joint density" can be defined in either of two ways:
Formally, fX,Y(x, y) is the probability density function of (X, Y) with respect to the product measure on the respective supports of X and Y. Either of these two decompositions can then be used to recover the joint cumulative distribution function:
The definition generalizes to a mixture of arbitrary numbers of discrete and continuous random variables.
[edit] General multidimensional distributions
The joint distribution of two random variables can be extended to many random variables X1, ... Xn by adding them sequentially with the identity
where
and
(notice, that these latter identities can be useful to generate a random variable
with given distribution function
); the density of the marginal distribution is
The joint cumulative distribution function is
and the conditional distribution function is accordingly
Expectation reads
suppose that h is smooth enough and
for
, then, by iterated integration by parts,
[edit] Joint distribution of independent variables
If for discrete random variables
for all x and y, or for continuous random variables
for all x and y, then X and Y are said to be independent.
[edit] See also
- Chow-Liu tree
- Copula (statistics)
- Bayesian networks
- Multivariate statistics
- Multivariate normal distribution
- Statistical interference













![\mathbb{E}\left[h(X_1,\dots X_n) \right]=\int_{-\infty}^\infty \dots \int_{-\infty}^\infty h(x_1,\dots x_n) f_{X_1,\dots X_n}(x_1,\dots x_n) \mathrm{d} x_1 \dots x_n;](http://upload.wikimedia.org/math/6/9/9/699fe2869d9c59fe43208fa3a8e7e607.png)
![\begin{align}\mathbb{E}\left[h(X_1,\dots X_n) \right]=& h(x_1,\dots x_n)+ \\
& (-1)^n \int_{-\infty}^{x_1} \dots \int_{-\infty}^{x_n} F_{X_1,\dots X_n}(u_1,\dots u_n) \frac{\partial^n}{\partial x_1 \dots \partial x_n} h(u_1,\dots u_n) \mathrm{d} u_1 \dots u_n.\end{align}](http://upload.wikimedia.org/math/4/7/b/47b5594b09c547915e801e45eb9f4244.png)

