A distribution function, often referred to as a cumulative distribution function (CDF), is a fundamental concept in probability and statistics. It provides a complete description of the probability distribution of a random variable by detailing the likelihood that the variable will take on a value less than or equal to a specific point. In simpler terms, it allows us to understand how probabilities accumulate over a range of values.
Mathematically, for a random variable X, the distribution function F(x) is defined as:
F(x) = P(X ≤ x)
This equation states that F(x) gives the probability that the random variable X is less than or equal to the value x. The function has several important properties:
- Non-decreasing: As x increases, F(x) does not decrease.
- Limits: F(x) approaches 0 as x approaches negative infinity and approaches 1 as x approaches positive infinity.
- Range: The values of F(x) range from 0 to 1.
Distribution functions can be used in various applications, such as determining probabilities, making predictions, and performing statistical analyses. They are foundational in fields like machine learning, where understanding the distribution of data points is crucial for developing models. Additionally, different types of distribution functions exist, such as normal, binomial, and Poisson distributions, each describing different types of data behavior.