We have already seen an example of the concept of a probability density distribution in the context of the probability of decay in a radioactive sample. It is worth reviewing the main properties of such distributions, since they are cntral to the discussion of these pages.
Properties of Probability Density Functions:
Ifis a probability density distribution defined on the interval
then
![]()
To paraphrase these properties, (1) means that the total probability that the random variabletakes values anywhere in its permissible range is defined to be 1. (We can think of this as a convention, or a "polite agreement".) (2) means that we do not attach any meaning to negative probabilities, and (3) is the working definition of what we mean by the probability density. Among other things, this means that we cannot really attach a meaning to the probabillity that x takes on an exact value, but we can talk about the probability that it is within some range of values.
This is slightly different from the way that discrete probabilities are handled, though the two are strongly related. If we were to talk about a discrete random event, such the tossing of a coin or a dice, we would assign an exact numerical probability to each possible outcome. For example, in a fair dice, each face value (1,2,...6) is equally likely, so that each one can be assigned a discrete probability of 1/6. The sum of the discrete probabilities (1/6+1/6 +...1/6) is 1. When we deal with continuous random variables, and probability densities, we can only consider ranges of values of the outcome of an experiment, as discussed in the following examples.
How far can a group of people jumpTo illustrate the idea of a continuous random variable, suppose we measure the distances that a group of people can jump at a track and field competition. Suppose that the best jumping record so far is a distance ofMeans and Mediansmeters, and the weakest jump is a mere
meters. In fact, suppose that the distance jumped is always between these two values,
![]()
Let us define
![]()
Then
![]()
would be the probability density distribution for jumps of a given distance, and among its properties are that
![]()
The above is the fraction of people that can jump a distance betweenand
. Suppose we keep track of all these jumps and find that
![]()
What can we say about the average performance of this group of people?
To answer this question, we first need to convert the above function to a probability density distribution by finding the value of the constant. This process is called Normalization.
Finding the Constant (Normalization)
To find the constant we use the fact that
![]()
Thus:
![]()
Thus the constant is, and we now have the (normalized) probability density distribution,
![]()
Finding the average jump distance
With this probability distribution, we can now caluclulate the average jump distance. (This is also refered to as the mean and the expectation value of x. We use the notationto denote this average value. Then
![]()
We calculate that
![]()
We calculate that
![]()
Thus, the average jump distance is three meters. This is not too surprising given the fact that the probability density distribution is a function which is symmetric about. We will see later that for this distribution, the average (mean) coincides with another "typical value" called the median.
Probability of Jumping more than 5 meters
We can compute this probability simply by calculating the integral
![]()
The result is:
![]()
For your consideration:
- (1) Sketch the probability density distribution in this example. Show that it is symmetric about the value
.
- (2) Show that the probability of jumping under 3 meters is the same as the probability of jumping between 3 and 6 meters - i.e. that each is equal to 1/2. We say that 3 meters is also a median of the distribution when this is true.
- (3) Suppose you are told that a metal rod of length 6 meters has a total mass 1 kg and a density distribution
. How would you determine its center of mass? What is the connection between the average jump and the center of mass in the two problems?
There is more than one way to define a typical value in the results of an experiment (such as the jumping contest above). We have already discussed the mean or average value. Below, we will also introduce the idea of a median. Essentially, the median is a value "in the middle of the distribution" such that it is equally probable to score better or worse. We will make this idea more precise in the example below. Note that this example will involve a probability distribution that is not symmetric. (When the distribution is symmetric, as in our first example, the mean and the median are at the same place.
Consider the probability density distribution
![]()
Then we might note that
- The distribution has the shape of a straight line.
- It is positive only for
. This will be the range over which we consider the distribution.
- The constant
is not yet specified, but it must have a value such that the area under the curve of
is 1.
The last constraint means that
![]()
You should be able to conclude that.
If we compute the mean of the distribution, we find that
![]()
We find that
![]()
Thus the mean is at![]()
The median is the valuesuch that
![]()
To find the median, we calculate the above integral and then determine m as follows:
![]()
Thus
![]()
We need to solve this equation for tha value of m. Rearranging leads to a quadratic equation in standard form:
![]()
The result is then
![]()
Only one of these values, namely m=0.2929 is inside the required range (in the interval [0,1]), and so is the value we are looking for. The median in this case is not at the same location as the mean, but rather slightly to the left of it. It denotes a value which subdivides the area under the graph ofinto two equal portions, each one equal to 1/2. For a probability distribution as given here, it is equally likely to score a value smaller or larger than 0.2929.
For your consideration:
- (1) Sketch the probability density distribution in this example. Mark the values associated with the mean and the median.
- (2) What is the probability of scoring a result between 0.75 and 1 ?.
- (3) Explain the differnec between the concept of a mean and a median.