Next: About this document ...
Up: No Title
Previous: No Title
Subsections
Improper integrals like the ones we have been considering in class
have many applications, for example in thermodynamics and heat
transfer. In this lab we will consider the role of improper integrals
in probability, which also has many applications in science and
engineering.
The first concept we need is that of a random variable. Intuitively, a
random variable is used to measure an outcome whose value is not
certain. For example, the number of hours that a hard disk can run
before failing is a random variable because it is not the same for
every drive, even if we only consider identical drives from the same
production run. A few other examples of random variables that are important
in science, engineering, or manufacturing are given below.
- The time it takes for a packet of information to travel from one
location to another on the Internet.
- The number of miles that an automobile tire can be driven before
it fails.
- The lengths of supposedly identical bolts manufactured by a
particular production line.
- The speed of a particular gas molecule in a sample of a gas.
You may be more familiar with what are called discrete random
variables, for example the number of heads obtained in ten tosses of a
coin, which can only take a finite number of discrete values. In the
case of a discrete random variable, the probability of a single
outcome can be positive. For example, the probability that a single
flip of a coin produces tails is 50%. The situation is very different
when we consider a random variable like the number of miles a
tire can be driven before failure, which can take any value from
zero to something over 100,000 miles. Since there are an infinite
number of possible outcomes, the probability that the tire
fails at exactly some number of miles, for example 50,000 miles, is
zero. However, we would expect that the probability that the tire
would fail between 40,000 miles and 100,000 miles would not be
zero, but would be a positive number.
A random variable that can take on a continuous range of values is
called a continuous random variable. There turn out to be lots of
applications of continuous random variables in science, engineering,
and business, so a lot of effort has gone into devising mathematical
models. These mathematical models are all based on the following
definition.
Definition
We say that a random variable X is continuous if there is a function
f(x), called the probability density function, such that
- 1.
, for all x
- 2.

- 3.
where
represents the probability that the random variable X is
greater than or equal to a but less than or equal to b.
For example, consider the following function.

This function is non-negative, and also satisfies the second
condition, since

which is pretty easy to show. So this could be a probability density
function for a continuous random variable X.
A lot of the effort involved in modeling a random process, that is, a
process whose outcome is a random variable, is in finding a suitable
probability density function. Over the years, lots of different
functions have been proposed and used. One thing that they all have in
common, though, is that they depend on parameters. For example, the
general exponential probability density function is defined as

where
is a parameter that can be adjusted to get the best
fit to any particular situation. In the exercises, you will be asked
to show that only positive values of
make sense.
The process of deciding what probability density function to use and
how to determine the parameters is very complicated and can involve
very sophisticated mathematics. However, in the simple approach we are
taking here, the problem of determining the parameter value(s) often
depends on quantities that can be determined experimentally, for
example by collecting data on tire failure. For our purposes, the two
most important quantities are the mean,
and the standard
deviation
. The mean is defined by

and the standard deviation is the square root of the variance, V,
which is defined by

In practice, the variance V is often computed as follows,

which can be easily be obtained by expanding
and writing
V as the sum of three integrals.
Probably the most important distribution is the normal distribution,
widely referred to as the bell-shaped curve. The probability density
function for a normal distribution with mean
and standard
deviation
is given by the following equation.

This distribution has a tremendous number of applications in science,
engineering, and business. The exercises provide a few simple ones.
In applications, one generally has to know in advance that the random
variable you want to model has, approximately, a certain kind of
distribution. How one would determine this is way beyond the scope of
this course, so we won't really discuss it. On the other hand, once
you know, for example, that your random variable has a normal
distribution you only need the values of the mean and the standard
deviation to be able to model it. The exponential distribution is even
simpler, since it only has one parameter, and you only need to know
the mean of your random variable to use this distribution to model it.
One thing to keep in mind when you are using the normal distribution
as a model is that calculations can involve values of your random
variable that don't make physical sense. For example, suppose that a
machining operation produces steel shafts whose diameters have
a normal distribution, with a mean of 1.005 inches and a standard
deviation of 0.01 inch. If you were asked to compute the percentage of the
shafts in a certain production run had diameters less than 0.9
inches you would use the following integral

even though negative values for the shaft diameters don't make sense.
- 1.
- Show that the probability density function given for the
exponential distribution,

satisfies the condition

as long as
is a positive number. What would happen if
was negative?
- 2.
- Show that the mean and the standard deviation of the exponential
distribution are both equal to
. - 3.
- In the kinetic theory of gases, the distance X that a
molecule travels before colliding with another molecule is described
by an exponential distribution. The parameter
is called the
mean free path because it represents the average distance a molecule
travels between collisions. For a certain gas at standard temperature
and pressure, the mean free path is given by
. What is the probability that the distance a
molecule travels between collisions is less than half the mean free path?
- 4.
- The median m of a random variable X is the value of the random
variable such that P(X > m) = 1/2. If X has an exponential
distribution with mean
, find the median m.
- 5.
- In actuarial science, one of the simplest models for describing
mortality is

where x represents the age at which a person dies.
- (a)
- Find k.
- (b)
- Find the probability that a person lives past the age of 60.
- (c)
- What value does this model give for the median life expectancy?
- 6.
- The systolic blood pressure of 18 year old women is normally
distributed with a mean of 120 mm Hg and a standard deviation of 12 mm
Hg. What is the probability that the blood pressure of a randomly
selected 18 year old woman will be greater than 150 mm Hg? Between 110
and 130?
- 7.
- A college professor teaches Chemistry 101 each fall to a large
class of first-year students. For tests, she uses standarized exams
that she knows from past experience produce normal grade distributions
with a mean of 70 and a standard deviation of 12. Her grading
philosophy is to impose standards that will yield, in the long run,
14% A's, 20% B's, 32% C's, 20% D's, and 14% F's. Where should the
cutoff be between the A's and B's? Between the B's and C's?
Next: About this document ...
Up: No Title
Previous: No Title
Dina Solitro
1/18/2000