Explicit Solution and Distribution¶
This section covers the explicit solution and distribution of the Vasicek short rate in the context of Chapter 18.
Learning Objectives
By the end of this section, you will be able to:
- Derive the explicit solution to the Vasicek SDE
- Understand the distribution of short rates at any future time
- Calculate expected values and variances
- Interpret the asymptotic behavior of the process
- Use distributional properties for risk management
Overview¶
While the Vasicek SDE is elegant and intuitive, having explicit solutions and understanding the distribution of future rates is crucial for practical implementation. This section derives closed-form expressions for the mean and variance of Vasicek short rates, which can then be used for forecasting, option pricing, and risk measurement.
Solving the Vasicek SDE¶
Method: Integrating Factor¶
The Vasicek SDE is:
This is a linear SDE with a known explicit solution. We can solve it using the integrating factor method.
First, rewrite the equation in standard form:
The integrating factor is \(e^{\alpha t}\). Multiply both sides:
The left side is \(d(e^{\alpha t}r_t)\):
Integrating from \(0\) to \(t\):
The deterministic integral evaluates to:
Therefore:
Solving for \(r_t\):
This is the explicit solution of the Vasicek SDE.
Alternative Form¶
The solution can also be written as:
This form shows that \(r_t\) is a weighted average of: 1. The initial rate \(r_0\) (decaying exponentially) 2. The long-run mean \(\theta\) (approaching with weight \(1 - e^{-\alpha t}\)) 3. Accumulated random shocks weighted by their exponential decay
Distribution of Short Rates¶
Conditional Distribution Given r_0¶
Since the solution is a linear combination of the initial condition, a constant, and a Gaussian integral, \(r_t\) is normally distributed conditional on \(r_0\):
where the conditional mean and variance are:
Derivation of Conditional Mean¶
Taking the expectation of the explicit solution:
The Itô integral has zero expectation:
Therefore:
This can be rewritten as:
Interpretation: The expected rate is a weighted average of the initial rate and the long-run mean, where the weight on the initial rate exponentially decays at rate \(\alpha\).
Derivation of Conditional Variance¶
The variance is determined by the stochastic integral component. Using Itô's isometry:
Substituting \(u = t - s\):
Therefore:
Asymptotic Distribution¶
As t → ∞¶
As time progresses to infinity, the effect of the initial condition \(r_0\) vanishes, and the conditional distribution converges to the stationary distribution:
Therefore, the stationary distribution is:
Interpretation of Stationary Variance¶
The long-run volatility \(\sqrt{\frac{\sigma^2}{2\alpha}}\) depends on two competing effects:
Higher volatility \(\sigma\): Increases long-run variance (more shocks) Higher mean reversion \(\alpha\): Decreases long-run variance (shocks are damped faster)
The ratio \(\frac{\sigma^2}{2\alpha}\) balances these effects. For example: - If we double \(\sigma\) (more turbulent), variance quadruples - If we double \(\alpha\) (stronger mean reversion), variance halves
Sample Paths and Monte Carlo Simulation¶
Discretization Scheme¶
For numerical simulation, the Euler-Maruyama discretization is:
where \(Z \sim \mathcal{N}(0, 1)\) and \(\Delta t\) is the time step.
Alternatively, using the exact solution for discrete times:
The second form is exact (no discretization error) when moving between time steps.
Path Characteristics¶
Sample paths of Vasicek rates typically exhibit:
- Mean-reversion clustering: Paths oscillate around \(\theta\)
- Exponential decay of perturbations: Deviations from equilibrium exponentially decay at rate \(\alpha\)
- Bounded variance growth: Unlike Brownian motion, variance bounded at \(\frac{\sigma^2}{2\alpha}\)
- Continuous paths: No jumps (in the basic model)
Effect of Parameters on Paths¶
Increasing \(\alpha\) (faster mean reversion): - Paths cluster more tightly around \(\theta\) - Faster return to mean after shocks - Less variation from equilibrium
Increasing \(\sigma\) (higher volatility): - More dramatic excursions from \(\theta\) - Greater amplitude of fluctuations - Wider band of typical values
Changing \(\theta\) (different equilibrium): - Shifts the entire path up or down - Mean-reversion target changes - Doesn't affect volatility structure
Transition Density¶
The transition probability density from \(r_s\) at time \(s\) to \(r_t\) at time \(t > s\) is Gaussian:
where:
This transition density is essential for: - Computing likelihood functions for parameter estimation - Filtering and state estimation - Computing conditional probabilities in option pricing
Practical Implications¶
Forecasting¶
For a one-year forecast with \(r_0 = 0.03\), \(\theta = 0.04\), \(\alpha = 0.2\), \(\sigma = 0.01\):
The forecast is between the current rate and the long-run mean, closer to the current rate due to the short horizon.
Confidence intervals can be computed from the variance:
A 95% confidence interval: \([0.0357 - 1.96(0.0048), 0.0357 + 1.96(0.0048)] = [0.0261, 0.0453]\)
Risk Management¶
The stationary variance \(\frac{\sigma^2}{2\alpha}\) provides a measure of long-run interest rate risk. Portfolio managers use this to set: - Duration targets - Hedge ratios - Value-at-risk (VaR) limits
Summary¶
The Vasicek model admits a closed-form solution with explicit distributional properties:
- Conditional mean: Exponentially weighted average of initial rate and long-run mean
- Conditional variance: Grows from 0 to \(\frac{\sigma^2}{2\alpha}\) as time increases
- Stationary distribution: Normal with mean \(\theta\) and variance \(\frac{\sigma^2}{2\alpha}\)
- Transition density: Gaussian between any two times
These explicit formulas enable precise forecasting, efficient simulation, and rigorous risk quantification without resorting to approximations.
Based on: QuantPie Lecture Notes - Interest Rate Modeling
Exercises¶
Exercise 1. For Vasicek parameters \(\kappa = 0.5\) (using \(\alpha = \kappa\)), \(\theta = 0.05\), \(\sigma = 0.02\), and \(r_0 = 0.03\), compute the conditional mean \(\mathbb{E}[r_t | r_0]\) and conditional standard deviation \(\sqrt{\text{Var}(r_t | r_0)}\) at \(t = 1, 5, 10\) years.
Solution to Exercise 1
Using \(\kappa = 0.5\), \(\theta = 0.05\), \(\sigma = 0.02\), \(r_0 = 0.03\).
The conditional mean is \(\mathbb{E}[r_t | r_0] = \theta + (r_0 - \theta)e^{-\kappa t} = 0.05 + (0.03 - 0.05)e^{-0.5t} = 0.05 - 0.02\,e^{-0.5t}\).
The conditional standard deviation is \(\text{SD}(r_t) = \sigma\sqrt{\frac{1 - e^{-2\kappa t}}{2\kappa}} = 0.02\sqrt{\frac{1 - e^{-t}}{1.0}}\).
At \(t = 1\):
At \(t = 5\):
At \(t = 10\):
The mean converges to \(\theta = 0.05\) and the standard deviation converges to \(\sigma/\sqrt{2\kappa} = 0.02/\sqrt{1.0} = 0.02\), both essentially reached by \(t = 10\).
Exercise 2. Derive the stationary distribution by taking \(t \to \infty\) in the conditional mean and variance formulas. Show that \(r_\infty \sim \mathcal{N}(\theta, \sigma^2/(2\kappa))\). For the parameters in Exercise 1, compute the stationary mean and standard deviation.
Solution to Exercise 2
The conditional mean is \(\mathbb{E}[r_t | r_0] = \theta + (r_0 - \theta)e^{-\kappa t}\). As \(t \to \infty\), \(e^{-\kappa t} \to 0\) (since \(\kappa > 0\)), so:
The conditional variance is \(\text{Var}(r_t | r_0) = \frac{\sigma^2}{2\kappa}(1 - e^{-2\kappa t})\). As \(t \to \infty\), \(e^{-2\kappa t} \to 0\), so:
Since \(r_t\) is Gaussian for all \(t\) (being a linear functional of Gaussian noise), the limiting distribution is also Gaussian:
For the parameters in Exercise 1 (\(\kappa = 0.5\), \(\theta = 0.05\), \(\sigma = 0.02\)):
In steady state, the short rate fluctuates around 5% with a standard deviation of 2%, so approximately 95% of the time it lies in the interval \([1\%, 9\%]\).
Exercise 3. Using the explicit solution \(r_t = r_0 e^{-\kappa t} + \theta(1 - e^{-\kappa t}) + \sigma\int_0^t e^{-\kappa(t-s)}dW_s\), explain why \(r_t\) is Gaussian. Identify the deterministic part and the stochastic part. What distribution does the stochastic integral have?
Solution to Exercise 3
The explicit solution is:
The deterministic part \(\mu(t) = r_0 e^{-\kappa t} + \theta(1 - e^{-\kappa t})\) is a non-random function of \(t\) that interpolates from \(r_0\) (at \(t = 0\)) to \(\theta\) (as \(t \to \infty\)).
The stochastic part is the Ito integral \(X_t = \sigma\int_0^t e^{-\kappa(t-s)}\,dW_s\). Since the integrand \(\sigma e^{-\kappa(t-s)}\) is a deterministic (non-random) function of \(s\), this Ito integral is Gaussian with:
- Mean: \(\mathbb{E}[X_t] = 0\) (Ito integrals of deterministic integrands have zero mean)
- Variance: \(\text{Var}(X_t) = \sigma^2\int_0^t e^{-2\kappa(t-s)}\,ds = \frac{\sigma^2}{2\kappa}(1 - e^{-2\kappa t})\) (by Ito's isometry)
Since \(r_t = \mu(t) + X_t\) is the sum of a constant and a Gaussian random variable, \(r_t\) is itself Gaussian:
The key property is that the Ito integral of a deterministic function against Brownian motion is always normally distributed---this is a fundamental result of stochastic calculus.
Exercise 4. Compute \(\mathbb{P}(r_5 > 0.08 | r_0 = 0.03)\) for \(\kappa = 0.5\), \(\theta = 0.05\), \(\sigma = 0.02\). Also compute \(\mathbb{P}(r_5 < 0 | r_0 = 0.03)\).
Solution to Exercise 4
With \(\kappa = 0.5\), \(\theta = 0.05\), \(\sigma = 0.02\), \(r_0 = 0.03\), at \(t = 5\):
Probability \(r_5 > 0.08\):
There is approximately a 5.6% probability that rates exceed 8% in 5 years.
Probability \(r_5 < 0\):
There is approximately a 0.76% probability of negative rates at the 5-year horizon. This illustrates the well-known feature that the Vasicek model assigns positive probability to negative rates, though for these parameters the probability is small.
Exercise 5. The half-life of mean reversion is \(t_{1/2} = \ln 2/\kappa\). Compute \(t_{1/2}\) for \(\kappa = 0.1, 0.5, 2.0\). After two half-lives, what fraction of the initial deviation \(r_0 - \theta\) remains?
Solution to Exercise 5
The half-life is the time for the expected deviation \(\mathbb{E}[r_t - \theta | r_0] = (r_0 - \theta)e^{-\kappa t}\) to decay to half its initial value:
For the three values:
- \(\kappa = 0.1\): \(t_{1/2} = \ln 2 / 0.1 = 6.93\) years
- \(\kappa = 0.5\): \(t_{1/2} = \ln 2 / 0.5 = 1.39\) years
- \(\kappa = 2.0\): \(t_{1/2} = \ln 2 / 2.0 = 0.347\) years \(\approx\) 4.2 months
After two half-lives (\(t = 2t_{1/2}\)):
So one-quarter (25%) of the initial deviation \(r_0 - \theta\) remains. This is analogous to radioactive decay: after \(n\) half-lives, the fraction remaining is \((1/2)^n\). After two half-lives, only 25% of the original shock persists.
Exercise 6. Show that the conditional variance \(\text{Var}(r_t | r_0) = \frac{\sigma^2}{2\kappa}(1 - e^{-2\kappa t})\) is monotonically increasing in \(t\) and bounded above by \(\sigma^2/(2\kappa)\). Sketch the variance as a function of \(t\) for \(\kappa = 0.5\), \(\sigma = 0.02\).
Solution to Exercise 6
The conditional variance is \(V(t) = \frac{\sigma^2}{2\kappa}(1 - e^{-2\kappa t})\).
Monotonicity: Taking the derivative with respect to \(t\):
Since \(V'(t) > 0\) for all \(t > 0\), the variance is strictly increasing in \(t\).
Upper bound: As \(t \to \infty\), \(e^{-2\kappa t} \to 0\), so:
The variance is bounded above by \(\sigma^2/(2\kappa)\) and approaches this bound asymptotically but never reaches it for finite \(t\).
Sketch for \(\kappa = 0.5\), \(\sigma = 0.02\): The upper bound is \(V_\infty = 0.0004/1.0 = 0.0004\). The curve \(V(t) = 0.0004(1 - e^{-t})\) starts at 0, rises steeply at first (slope \(\sigma^2 = 0.0004\) at \(t = 0\)), then decelerates and asymptotes to \(0.0004\). By \(t = 3\) (about three time constants \(1/(2\kappa) = 1\)), the variance has reached \(1 - e^{-3} \approx 95\%\) of its limiting value.
Exercise 7. The transition density \(p(r_t | r_0)\) is Gaussian with known mean and variance. Write the log-likelihood function for \(N\) equally spaced observations \(r_0, r_{\Delta t}, r_{2\Delta t}, \ldots\) and show that maximizing it is equivalent to an OLS regression of \(r_{t+\Delta t}\) on \(r_t\).
Solution to Exercise 7
Given \(N\) equally spaced observations \(r_0, r_{\Delta t}, r_{2\Delta t}, \ldots, r_{N\Delta t}\), the transition density is:
where \(\mu_i = \theta + (r_{i\Delta t} - \theta)e^{-\kappa\Delta t}\) and \(v^2 = \frac{\sigma^2}{2\kappa}(1 - e^{-2\kappa\Delta t})\).
The log-likelihood is:
Define \(\phi = e^{-\kappa\Delta t}\), \(a = \theta(1 - \phi)\), and \(b = \phi\). Then \(\mu_i = a + b\,r_{i\Delta t}\), and:
This is the log-likelihood of a linear regression model \(Y_i = a + b\,X_i + \epsilon_i\) where \(Y_i = r_{(i+1)\Delta t}\), \(X_i = r_{i\Delta t}\), and \(\epsilon_i \sim \mathcal{N}(0, v^2)\).
Maximizing \(\ell\) with respect to \(a\) and \(b\) (for fixed \(v^2\)) is equivalent to minimizing \(\sum(Y_i - a - bX_i)^2\), which is exactly the ordinary least squares (OLS) criterion. The OLS estimators are:
The residual variance \(\hat{v}^2 = \frac{1}{N}\sum(Y_i - \hat{a} - \hat{b}X_i)^2\) completes the MLE. The Vasicek parameters are then recovered: \(\hat{\kappa} = -\ln\hat{b}/\Delta t\), \(\hat{\theta} = \hat{a}/(1-\hat{b})\), \(\hat{\sigma} = \hat{v}\sqrt{2\hat{\kappa}/(1-e^{-2\hat{\kappa}\Delta t})}\).