The building blocks

A starting point for our analysis is a probability triple $ \left( \Omega ,%%
\mathscr{F},P\right) $. The ingredients of this triple are defined in the following way:

Armed with a probability triple of the above form we can define a random variable $ Y$ as a mapping from [subsets of] $ \Omega$ to $ \mathbbm{R}\cup \left\{ \pm \infty \right\} $. For example the daily return of a stock price is a random variable. It is a black box that takes the events [for example mergers, strikes, floods, etc.] and gives a real number [the stock price return]. Random experiments have qualitative aspects, like mergers above. A random variable gives a quantitative flavor to those events, in order for one to handle them effectively.

The set $ \Omega$ in this particular case is massive, since it includes everything that affects the asset price is any possible way. Theoretically speaking, the $ \sigma $-algebra would consist of collections of subsets of $ \Omega$, the possible events that are of interest. One can see why the minimal algebra is equivalent of knowing nothing, since the only events that are considered are the impossible [$ \emptyset $] and the certain [$ \Omega$] ones. The maximal algebra is equivalent to knowing everything, since the events considered are as detailed as possible [an event such as the hangover of a temporary clerk in the department of agriculture in the bank of Greenland would be included, since it affects the world markets in a chaotic manner]. In general, a typical event $ \sigma $-algebra would lie somewhere between, for instance a rise of the

US interest rates by fifty basis points would be included, while the hangover of our outgoing clerk would not.

When the sample space is just the real line, $ \Omega
=\mathbbm{R}$, we can define the Borel $ \sigma $-algebra as the smallest $ \sigma $-algebra containing all open intervals in $ \mathbbm{R}$. This would be the case if the stock return itself was considered the random experiment itself, ignoring its underlying causes. In this case, the random variable $ Y$ would just be the identity function. We denote this algebra by $ \mathscr{B}\left( \mathbbm{R}\right) $, and in a similar fashion we can construct $ \mathscr{B}\left( \mathbbm{R}^{k}\right) $. The properties listed above imply that all closed, semiopen and semiclosed intervals, as well as their unions belong to $ \mathscr{B}\left( \mathbbm{R}\right) $. We will also define the Lebesgue measure as the function that maps each interval to its length. We will denote this function with $ \mu
_{0}:\mathscr{B}\left( \mathbbm{R}\right) \rightarrow
\mathbbm{R}_{+}$.

Example 34 (Standard normal distribution)   We start by considering the space $ \left(
\mathbbm{R},\mathscr{B}\left( \mathbbm{R}\right) \right) $. Define the function $ \phi \left( x\right) =\left( 2\pi \right)
^{-1/2}\exp \left\{ -x^{2}/2\right\} $, which is just the standardized normal density. Define the function $ P$ as the Lebesgue integral

$\displaystyle P\left( F\right) =\int_{F}\phi d\mu _{0}$.

If the set $ F=\left[ \alpha ,\beta \right] $ we can then rewrite the above in the more familiar form

$\displaystyle P\left( \left[ \alpha ,\beta \right] \right) =\int_{\alpha }^{\beta }\frac{1}{\sqrt{2\pi }}e^{-\frac{x^{2}}{2}}dx$.

Example 35 (Any distribution)   In the example above we used as $ \phi $ the normal density. In fact, if $ %%
\phi $ is a nonnegative function on $ \left(
\mathbbm{R},\mathscr{B}\left( \mathbbm{R}\right) \right) $ satisfying $ \int_{R}\phi d\mu _{0}=1$, we can call it a density, and we can define the probability measure

$\displaystyle P\left( F\right) =\int_{F}\phi d\mu _{0},\forall F\in
\mathscr{B}\left( \mathbbm{R}\right)$   .

In the remainder of this chapter we will encounter measures that have a relation like the one above, namely the market measure and the risk neutral measure. From this standpoint we call the function $ \phi $ the Radon-Nikodym derivative of $ P$ with respect to $ \mu _{0}$ and we write $ %%
\phi =\frac{dP}{d\mu _{0}}$.

A random variable itself can generate a $ \sigma $-algebra. Suppose that we start with a random variable $ Y:\Omega \rightarrow
\mathbbm{R}$. We will say that a set $ A\subset \Omega $ is determined by $ Y$ if by the value of $ Y\left( \omega
\right) ,\omega \in \Omega $ alone we can determine whether or not $ \omega \in A$. The collection of subsets of $ \Omega$ determined by $ Y$ forms a $ \sigma $-algebra, which is called the $ \sigma $-algebra generated by $ Y$ and is denoted by $ \sigma \left( Y\right)
$. Apparently, just by observing the stock prices one cannot conclude whether the price went down because of a specific strike action (which could be the event $ \omega _{1}$) or because of a particular interest rates increase (which could be the event $ \omega _{2}$): these kind of events alone which belong to $ \Omega$ cannot be determined by the stock price, and therefore will not be members of $ \sigma \left( Y\right)
$, $ \left\{ \omega
_{1}\right\} \notin \sigma \left( Y\right) $ and $ \left\{ \omega
_{2}\right\} \notin \sigma \left( Y\right) $. On the other hand, collections of such events will belong to $ \sigma \left( Y\right)
$: if only $ \omega _{1}$ or $ \omega _{2}$ make the stock return take the observed value, then $ \left\{ \omega _{1},\omega
_{2}\right\} \in \sigma \left( Y\right) $. If $ \mathscr{F}=\mathscr{B}\left( \mathbbm{R}\right) $, then in general $ \sigma \left( Y\right) =\left\{ Y^{-1}\left( B\right)
:B\in \mathscr{B}\left( \mathbbm{R}\right) \right\} $. One can now see the notion of a filtration which is generated by a random variable: by observing the random variable we understand more and more about what its underlying causes have been.

We will now turn to the definition of the conditional expectation. To illustrate the importance of the $ \sigma $-algebra that we defined above we start with an discrete sample space example.

Example 36 (Stock price and coin tosses)   Say that the market decides whether the stock price will go up or down by tossing a coin. If we consider a three period setting, the sample space would be

$\displaystyle \Omega =\left\{ HHH,HHT,HTH,\ldots ,THT,TTH,TTT\right\}$   ,

having $ 3^{2}=9$ elements. Now we write $ S_{k}\left( \omega \right) $ the stock price after $ k$ periods. Apparently
$\displaystyle S_{1}\left( \omega \right)$ $\displaystyle =$ $\displaystyle S_{1}\left( \omega _{1},\omega _{2},\omega
_{3}\right) =S_{1}\left( \omega _{1}\right)$   ,  
$\displaystyle S_{2}\left( \omega \right)$ $\displaystyle =$ $\displaystyle S_{2}\left( \omega _{1},\omega _{2},\omega
_{3}\right) =S_{2}\left( \omega _{1},\omega _{2}\right)$   , and  
$\displaystyle S_{3}\left( \omega \right)$ $\displaystyle =$ $\displaystyle S_{3}\left( \omega _{1},\omega _{2},\omega
_{3}\right)$   .  

Every $ S_{k}$ is a random variable defined on $ \Omega$. Now suppose that $ %%
\mathscr{F}=\mathcal{P}\left( \Omega \right) $; which is the maximal $ \sigma $-algebra. We can easily verify that every $ S_{k}:\Omega \rightarrow \mathbbm{R}$ is $ \mathscr{F}$-measurable: if we know the sequence $ \left( \omega
_{1},\omega _{2},\omega _{3}\right) $ we can be sure about the stock price $ S_{k}$ at every period $ k$. On the other hand, we can observe that in order to be sure about $ S_{2}$ we do not really need the whole sequence $ %%
\left( \omega _{1},\omega _{2},\omega _{3}\right) $; the first two elements are sufficient. We can do something about that, by introducing the notion of a sub-$ \sigma $-algebra.

Example 37 (cont. Stock price and coin tosses)   Let us see now what kind of sets are determined by the random variables $ %%
S_{k}$. If we consider the Cox-Ross-Rubinstein recombining tree $ S_{1}$ will determine the set $ \mathscr{F}_{1}$ with elements the sets

$\displaystyle \begin{tabular}{l}
$F_{H}=\left\{ HHH,HHT,HTH,HTT\right\} $ \\
...
...ft\{ THH,THT,TTH,TTT\right\} $ \\
$\emptyset $ \\
$\Omega $%%
\end{tabular}$

The random variable $ S_{2}$ will determine the set $ \mathscr{F}_{2}$ with elements the sets

$\displaystyle \begin{tabular}{l}
$F_{HH}=\left\{ HHH,HHT\right\} $ \\
$F_{TT}...
...\\
All the unions of the above \\
$\emptyset $ \\
$\Omega $%%
\end{tabular}$

Example 38 (cont. Stock price and coin tosses)   It is very important to illustrate the significance of the random variable when one derives the generated $ \sigma $-algebras. If $ S_{2}\left(
H,T\right) \neq S_{2}\left( T,H\right) $, the generated set $ \mathscr{F}%%
_{2}^{\star }$ would have elements the sets

$\displaystyle \begin{tabular}{l}
$F_{HH}=\left\{ HHH,HHT\right\} $ \\
$F_{TT}...
...\\
All the unions of the above \\
$\emptyset $ \\
$\Omega $%%
\end{tabular}$

Now we can turn into defining the conditional expectation of the stock price. Apparently we have to begin by introducing a probability measure [so far we have only dealt with the space $ \left( \Omega ,\mathscr{F}\right) $ and the subspaces $ \left(
\Omega ,\mathscr{F}_{k}\right) $]. So let's denote with $ p\in
\left( 0,1\right) $ the probability of heads, and let us also assume that the tosses are independent. This implies that $ P\left(
\omega \right) =P\left( \omega _{1},\omega _{2},\omega _{3}\right)
=p^{n}\left( 1-p\right) ^{3-n}$, where $ n$ is the number of heads in the set $ \left\{ \omega _{1},\omega _{2},\omega _{3}\right\} $. Then of course for every subset $ A\subseteq \Omega $ we define

$\displaystyle P\left( A\right) =\sum_{\omega \in A}P\left( \omega \right)$   .

If the set was continuous we generalize by writing

$\displaystyle P\left( A\right) =\int_{A}dP\left( \omega \right)$   .

The expectation is defined for any random variable $ Y\left( \omega \right) $ as

$\displaystyle \mathbf{E}Y$ $\displaystyle =$ $\displaystyle \sum_{\omega \in \Omega }Y\left( \omega \right) P\left(
\omega \right)$   , or  
$\displaystyle \mathbf{E}Y$ $\displaystyle =$ $\displaystyle \int_{\Omega }Y\left( \omega \right) dP\left( \omega \right)$   .  

By denoting $ \mathtt{I}_{A}$ the indicator function we can define the partial averaging as
$\displaystyle \mathbf{E}\mathtt{I}_{A}Y$ $\displaystyle =$ $\displaystyle \sum_{\omega \in A}Y\left( \omega
\right) P\left(
\omega \right)$   , or  
$\displaystyle \mathbf{E}\mathtt{I}_{A}Y$ $\displaystyle =$ $\displaystyle \int_{A}Y\left( \omega \right)
dP\left( \omega \right)$   .  

Example 39 (cont. Stock price and coin tosses)   Now suppose that we want to compute the estimate of $ S_{1}$ given $ S_{2}$. This estimate is denoted by $ \mathbf{E}\left[ S_{1}\vert S_{2}\right] =\mathbf{E}%%
\left[ S_{1}\vert S_{2}\right] \left( \omega \right) $ to illustrate its dependence on $ \omega $. Its value will be equal to

$\displaystyle \mathbf{E}\left[ S_{1}\vert S_{2}\right] \left( \omega \right) =\mathbf{E}\left[
S_{1}\vert S_{2}=S_{2}\left( \omega \right) \right]$   .

We can check that $ \mathbf{E}\left[ S_{1}\vert S_{2}\right] $ is $ \sigma \left(
S_{2}\right) $-measurable: if we know the value of $ S_{2}\left( \omega
\right) $ we can be sure of the value $ \mathbf{E}\left[ S_{1}\vert S_{2}\right] $, the uncertainty comes only through the value of $ S_{2}$.

More formally, let $ \left( \Omega ,\mathscr{F},P\right) $ be a probability space. In addition let $ \mathscr{G}$ be a sub-$ \sigma $-algebra of $ \mathscr{F}$, and $ %%
Y$ a random variable on $ \left( \Omega ,\mathscr{F},P\right) $. Then $ %%
\mathbf{E}\left[ Y\vert\mathscr{G}\right] $ is defined to be a random variable that satisfies:

  1. $ \mathbf{E}\left[ Y\vert\mathscr{G}\right] $ is $ \mathscr{G}$-measurable, and

  2. For every set $ G\in \mathscr{G}$ the partial averaging property

    $\displaystyle \int_{G}\mathbf{E}\left[ Y\vert\mathscr{G}\right] \left( \omega
\...
...P\left( \omega \right) =\int_{G}Y\left( \omega \right)
dP\left( \omega \right)
$

    holds.

Such a random variable always exists and it is unique with probability one. For two random variables $ X$ and $ Y$ we also write $ \mathbf{E}\left[ Y\vert X\right] $ and we mean $ \mathbf{E}\left[ Y\vert\sigma \left( X\right) \right] $. There are two things to remember:

  1. A random experiment is performed, that is to say an element $ \omega $ of $ \Omega$ is selected. The value of $ \omega $ is partially revealed to us, and therefore we cannot compute the exact value of a random variable $ %%
Y\left( \omega \right) $. Nevertheless, based on the partial information that we have we can form the expectation about $ Y\left( \omega \right) $. Since the estimate will depend on the partial information that we have about $ \omega $ it will also depend on $ \omega $, thus making the estimate a random variable [something which is usually ignored].

  2. If the $ \sigma $-algebra $ \mathscr{G}$ contains many sets, there will be the smallest one containing $ \omega $, which we denote with $ G$ [we can easily observe that $ G=\cap \left\{ G^{\star }:\omega \in
G^{\star }\in \mathscr{G}\right\} $]. This is the way that the information has been revealed to us: we know that $ \omega \in G$, but we don't know which exact element it is. How are we going to form our expectation? By partial averaging over the set $ G$.

Example 40 (cont. Stock price and coin tosses)   Say that we want to compute $ \mathbf{E}\left[ S_{2}\vert\mathscr{F}_{1}\right] $. Recall that $ \mathscr{F}_{1}=\left\{ \emptyset ,\Omega
,F_{H},F_{T}\right\} $. In addition, $ \mathbf{E}\left[ S_{2}\vert\mathscr{F}_{1}%%
\right] $ must be constant on $ F_{H}$ and $ F_{T}$, since it is $ \mathscr{F}%%
_{1}$-measurable. The partial averaging property implies that
$\displaystyle \int_{F_{H}}\mathbf{E}\left[ S_{2}\vert\mathscr{F}_{1}\right] \left(
\omega \right) dP\left( \omega \right)$ $\displaystyle =$ $\displaystyle \int_{F_{H}}S_{2}\left(
\omega \right)
dP\left( \omega \right)$   , and  
$\displaystyle \int_{F_{T}}\mathbf{E}\left[ S_{2}\vert\mathscr{F}_{1}\right] \left(
\omega \right) dP\left( \omega \right)$ $\displaystyle =$ $\displaystyle \int_{F_{T}}S_{2}\left(
\omega \right) dP\left( \omega \right)$   .  

We can compute
$\displaystyle \int_{F_{H}}\mathbf{E}\left[ S_{2}\vert\mathscr{F}_{1}\right] \left(
\omega \right) dP\left( \omega \right)$ $\displaystyle =$ $\displaystyle P\left( F_{H}\right)
\mathbf{E}\left[
S_{2}\vert\mathscr{F}_{1}\right] \left( \omega \right)$  
  $\displaystyle =$ $\displaystyle p\mathbf{E}\left[ S_{2}\vert\mathscr{F}_{1}\right] \left( \omega
\right) ,\forall \omega \in F_{H}$.  

In addition,

$\displaystyle \int_{F_{H}}S_{2}\left( \omega \right) dP\left( \omega \right)
=p^{2}u^{2}S_{0}+p\left( 1-p\right) udS_{0}$.

The above two equations can be solved to yield
$\displaystyle \mathbf{E}\left[ S_{2}\vert\mathscr{F}_{1}\right] \left( \omega
\right)$ $\displaystyle =$ $\displaystyle pu^{2}S_{0}+\left( 1-p\right) udS_{0}$  
  $\displaystyle =$ $\displaystyle \left[ pu+\left( 1-p\right) d\right] uS_{0}$  
  $\displaystyle =$ $\displaystyle \left[ pu+\left( 1-p\right) d\right] S_{1}\left( \omega \right) ,\forall
\omega \in F_{H}$.  

Similarly

$\displaystyle \mathbf{E}\left[ S_{2}\vert\mathscr{F}_{1}\right] \left( \omega
\...
...\left( 1-p\right) d\right] S_{1}\left( \omega
\right) ,\forall \omega \in F_{T}$,

which conclude with the familiar

$\displaystyle \mathbf{E}\left[ S_{2}\vert\mathscr{F}_{1}\right] \left( \omega
\...
...left( 1-p\right) d\right] S_{1}\left( \omega
\right) ,\forall \omega \in \Omega$   .

Kyriakos 2003-03-17