Variance Process Moments¶
The moments of the CIR variance process -- both conditional (given \(v_t\)) and unconditional (stationary) -- are essential building blocks in the Heston framework. The conditional mean \(\mathbb{E}[v_T | v_t]\) reveals the mean-reversion dynamics, the conditional variance \(\text{Var}(v_T | v_t)\) quantifies uncertainty about future variance levels, and the integrated variance \(\mathbb{E}[\int_t^T v_s\,ds | v_t]\) determines variance swap fair values. All these moments have closed-form expressions, derivable either from the non-central chi-squared transition density or directly from the SDE.
This section derives the first two conditional moments by two methods (SDE-based and distribution-based), extends to higher moments via a recursive formula, and computes the moments of the integrated variance. We assume familiarity with the CIR transition density.
Learning Objectives
After completing this section, you should be able to:
- Derive \(\mathbb{E}[v_T | v_t]\) and \(\text{Var}(v_T | v_t)\) from the CIR SDE
- Verify these moments using the non-central chi-squared distribution
- Compute higher conditional moments via the recursive ODE method
- Derive \(\mathbb{E}[\int_t^T v_s\,ds | v_t]\) and \(\text{Var}(\int_t^T v_s\,ds | v_t)\)
- State the stationary (unconditional) mean, variance, skewness, and kurtosis
Conditional Mean¶
Derivation from the SDE¶
Proposition: Conditional Mean of v_T
For the CIR process \(dv_t = \kappa(\theta - v_t)\,dt + \sigma_v\sqrt{v_t}\,dW_t\):
where \(\tau = T - t\).
Proof. Define \(m(\tau) = \mathbb{E}[v_{t+\tau} | v_t]\). Taking expectations of the SDE:
The stochastic integral \(\mathbb{E}[\sigma_v\sqrt{v_s}\,dW_s] = 0\) vanishes because it is a martingale. This is a first-order linear ODE with initial condition \(m(0) = v_t\). The solution is:
\(\square\)
Interpretation¶
The conditional mean is a weighted average of the current variance \(v_t\) and the long-run level \(\theta\), with the weight on \(v_t\) decaying exponentially at rate \(\kappa\):
- At \(\tau = 0\): \(\mathbb{E}[v_T | v_t] = v_t\) (trivially)
- As \(\tau \to \infty\): \(\mathbb{E}[v_T | v_t] \to \theta\) (mean reversion)
- At \(\tau = \ln 2/\kappa\) (the half-life): \(\mathbb{E}[v_T | v_t] = \frac{1}{2}(v_t + \theta)\)
Verification from the Non-Central Chi-Squared Distribution¶
From the transition density, \(v_T | v_t \sim c\,\chi^2_\delta(\lambda)\) where \(c = \sigma_v^2(1-e^{-\kappa\tau})/(4\kappa)\), \(\delta = 4\kappa\theta/\sigma_v^2\), and \(\lambda = 4\kappa v_t e^{-\kappa\tau}/(\sigma_v^2(1-e^{-\kappa\tau}))\). The mean of a non-central chi-squared is \(\delta + \lambda\), so:
which matches the SDE-derived formula. \(\square\)
Conditional Variance¶
Proposition: Conditional Variance of v_T
For the CIR process:
Proof. Define \(s(\tau) = \mathbb{E}[v_{t+\tau}^2 | v_t]\). Applying Ito's lemma to \(v^2\):
Taking expectations:
where \(m(\tau) = \mathbb{E}[v_{t+\tau}|v_t] = \theta + (v_t-\theta)e^{-\kappa\tau}\). This is a first-order linear ODE in \(s\) with initial condition \(s(0) = v_t^2\).
Solving via the integrating factor \(e^{2\kappa\tau}\):
After evaluating the integral and simplifying:
Since \(\text{Var}(v_T|v_t) = s(\tau) - m(\tau)^2\) and the first term equals \(m(\tau)^2\), the result follows. \(\square\)
Interpretation¶
The conditional variance has two components:
- Initial-condition-dependent term: \(v_t \cdot \frac{\sigma_v^2 e^{-\kappa\tau}}{\kappa}(1 - e^{-\kappa\tau})\), which is proportional to \(v_t\) and decays with \(\tau\)
- Stationary term: \(\theta \cdot \frac{\sigma_v^2}{2\kappa}(1 - e^{-\kappa\tau})^2\), which grows from zero and saturates at \(\theta\sigma_v^2/(2\kappa)\)
For short \(\tau\): \(\text{Var}(v_T|v_t) \approx v_t\sigma_v^2\tau\) (the variance grows linearly, proportional to the current level).
For large \(\tau\): \(\text{Var}(v_T|v_t) \to \theta\sigma_v^2/(2\kappa)\) (the stationary variance, independent of \(v_t\)).
Higher Conditional Moments¶
Higher conditional moments \(\mathbb{E}[v_T^n | v_t]\) can be computed recursively.
Proposition: Recursive Moment Formula
Define \(m_n(\tau) = \mathbb{E}[v_{t+\tau}^n | v_t]\). Then for \(n \geq 1\):
with initial condition \(m_n(0) = v_t^n\).
Proof. Apply Ito's lemma to \(v^n\):
Taking expectations and noting \(\mathbb{E}[dW_t] = 0\):
\(\square\)
The recursion shows that \(m_n\) depends on \(m_{n-1}\), so moments can be computed sequentially: \(m_0 = 1 \to m_1 \to m_2 \to m_3 \to \cdots\)
Third Conditional Moment
The third moment satisfies:
This ODE can be solved given \(m_2(\tau)\) (already computed), but the closed-form expression is lengthy. In practice, the first two moments suffice for most applications (moment matching in simulation schemes, variance swap pricing).
Integrated Variance Moments¶
The integrated variance \(\int_t^T v_s\,ds\) appears in variance swap pricing and in the Broadie-Kaya exact simulation scheme. Its moments have closed-form expressions.
Proposition: Integrated Variance Moments
Define \(V_\tau = \int_t^{t+\tau} v_s\,ds\). Then:
Proof (mean). Integrate the conditional mean formula:
\(\square\)
Variance Swap Fair Value
The fair strike of a variance swap with maturity \(T\) is:
For short maturities (\(\kappa T \ll 1\)), \(K_{\text{var}} \approx v_0\). For long maturities (\(\kappa T \gg 1\)), \(K_{\text{var}} \approx \theta\). This formula is one of the rare closed-form results for exotic pricing under Heston.
Stationary (Unconditional) Moments¶
As \(\tau \to \infty\), the CIR process reaches its stationary Gamma distribution (see CIR transition density). The unconditional moments are:
| Moment | Formula | Value (typical parameters) |
|---|---|---|
| Mean | \(\theta\) | \(0.04\) |
| Variance | \(\frac{\theta\sigma_v^2}{2\kappa}\) | \(\frac{0.04 \times 0.25}{4} = 0.0025\) |
| Skewness | \(\frac{\sigma_v}{\sqrt{\kappa\theta}} \cdot \sqrt{2}\) | \(\frac{0.5}{\sqrt{0.08}} \cdot 1.414 = 2.5\) |
| Excess kurtosis | \(\frac{3\sigma_v^2}{\kappa\theta}\) | \(\frac{0.75}{0.08} = 9.375\) |
The typical parameters used: \(\kappa = 2\), \(\theta = 0.04\), \(\sigma_v = 0.5\).
Heavy Tails of the Stationary Distribution
The stationary distribution of the CIR process is a Gamma distribution, which has positive skewness and excess kurtosis. With typical calibrated parameters, the skewness exceeds 2 and the kurtosis exceeds 9, indicating a strongly right-skewed, heavy-tailed distribution for the variance. This means variance spikes (e.g., during market crises) are much more likely than a Gaussian model would suggest.
Worked Example: Moment Calculations¶
Full Moment Calculation
Parameters: \(\kappa = 2.0\), \(\theta = 0.04\), \(\sigma_v = 0.5\), \(v_t = 0.06\), \(\tau = 0.25\) years.
Conditional mean:
Conditional variance:
Standard deviation: \(\sqrt{0.002176} \approx 0.0467\).
Coefficient of variation: \(0.0467 / 0.05213 \approx 0.90\). The standard deviation is 90\% of the mean, confirming the high uncertainty in future variance levels.
Integrated variance mean:
The annualized variance swap strike is \(K_{\text{var}} = 0.01394/0.25 = 0.05574\), corresponding to a volatility of \(\sqrt{0.05574} \approx 23.6\%\).
Summary¶
The CIR variance process has closed-form conditional moments at all orders, derivable from a recursive ODE system or from the non-central chi-squared transition density. The conditional mean \(\mathbb{E}[v_T|v_t] = \theta + (v_t - \theta)e^{-\kappa\tau}\) exhibits exponential mean-reversion, while the conditional variance captures both the current-state-dependent and stationary components. The integrated variance \(\int_t^T v_s\,ds\) has a closed-form mean that directly gives the variance swap fair value. The stationary distribution is Gamma with mean \(\theta\), variance \(\theta\sigma_v^2/(2\kappa)\), and substantial positive skewness.
The next section examines the mean-reversion mechanism in detail, including the half-life, the term structure of variance, and the distinction between physical and risk-neutral mean-reversion parameters.
Exercises¶
Exercise 1. Derive \(\mathbb{E}[V_T \mid V_t] = \theta + (V_t - \theta)e^{-\kappa\tau}\) by solving the ODE \(\frac{d}{d\tau}\mathbb{E}[V_{t+\tau}] = \kappa(\theta - \mathbb{E}[V_{t+\tau}])\) with initial condition \(\mathbb{E}[V_t] = V_t\).
Solution to Exercise 1
We solve the ODE \(\frac{d}{d\tau}m(\tau) = \kappa(\theta - m(\tau))\) with initial condition \(m(0) = V_t\), where \(m(\tau) = \mathbb{E}[V_{t+\tau} \mid V_t]\).
Step 1: Rewrite the ODE.
This is a first-order linear ODE with constant coefficients.
Step 2: Find the integrating factor. The integrating factor is \(\mu(\tau) = e^{\kappa\tau}\). Multiplying both sides:
Step 3: Integrate both sides from \(0\) to \(\tau\).
Step 4: Solve for \(m(\tau)\).
Rearranging:
This confirms the conditional mean formula. The derivation uses only the linearity of expectation and the fact that \(\mathbb{E}[\sigma_v\sqrt{V_s}\,dW_s] = 0\) (the stochastic integral is a martingale with zero expectation).
Exercise 2. Compute the conditional variance \(\operatorname{Var}(V_T \mid V_t) = V_t\frac{\sigma_v^2}{\kappa}(e^{-\kappa\tau} - e^{-2\kappa\tau}) + \frac{\theta\sigma_v^2}{2\kappa}(1 - e^{-\kappa\tau})^2\) for \(\kappa = 2\), \(\theta = 0.04\), \(\sigma_v = 0.3\), \(V_t = 0.04\), \(\tau = 1\). Is the conditional standard deviation of \(V_T\) large relative to its mean?
Solution to Exercise 2
With \(\kappa = 2\), \(\theta = 0.04\), \(\sigma_v = 0.3\), \(V_t = 0.04\), \(\tau = 1\):
First compute the needed quantities:
Conditional mean:
Since \(V_t = \theta\), the expected value remains at \(\theta\) for all \(\tau\).
Conditional variance:
Computing each term:
First term:
Second term:
Total:
Conditional standard deviation:
Relative to the mean: The coefficient of variation is:
The conditional standard deviation is about 74% of the mean. This is large, indicating substantial uncertainty about future variance levels even when starting at the long-run mean. This high variability is characteristic of the CIR process with its square-root diffusion and is consistent with the empirically observed leptokurtic (heavy-tailed) distribution of realized variance.
Exercise 3. The integrated variance \(\bar{V} = \frac{1}{\tau}\int_t^{t+\tau} V_s\,ds\) has conditional mean \(\mathbb{E}[\bar{V} \mid V_t] = \theta + (V_t - \theta)\frac{1 - e^{-\kappa\tau}}{\kappa\tau}\). Compute this for \(\tau = 0.25\) (3-month variance swap) with \(V_t = 0.05\), \(\theta = 0.04\), \(\kappa = 2\).
Solution to Exercise 3
The conditional mean of the average variance (variance swap fair strike) is:
With \(\tau = 0.25\), \(V_t = 0.05\), \(\theta = 0.04\), \(\kappa = 2\):
The 3-month variance swap fair strike is \(K_{\text{var}} = 0.04787\), corresponding to a fair volatility strike of \(\sqrt{0.04787} = 21.88\%\).
Note that this is between \(V_t = 0.05\) (current annualized variance, or 22.36% volatility) and \(\theta = 0.04\) (long-run variance, or 20% volatility). The factor \(0.787\) indicates that over 3 months, the average variance is still dominated by the current variance level, with only partial mean reversion toward \(\theta\).
Exercise 4. The stationary variance of \(V_t\) is \(\operatorname{Var}_\infty(V) = \theta\sigma_v^2/(2\kappa)\). Compute the coefficient of variation \(\text{CV} = \sqrt{\operatorname{Var}_\infty(V)}/\theta\) for the parameters in Exercise 2. A CV greater than 1 indicates very heavy-tailed variance dynamics.
Solution to Exercise 4
The stationary variance of \(V_t\) is:
With \(\kappa = 2\), \(\theta = 0.04\), \(\sigma_v = 0.3\):
The coefficient of variation is:
A coefficient of variation of 0.75 means the standard deviation is 75% of the mean. This is less than 1, so the variance dynamics are not extremely heavy-tailed, but they are still substantially more variable than a Gaussian model would suggest.
For comparison, if we increased \(\sigma_v\) to \(0.5\) (keeping other parameters fixed):
With \(\sigma_v = 0.5\), the CV exceeds 1, indicating very heavy-tailed variance dynamics. The threshold CV \(= 1\) corresponds to \(\sigma_v = \sqrt{2\kappa\theta} = \sqrt{0.16} = 0.4\), which is also the Feller condition boundary. When \(\sigma_v > \sqrt{2\kappa\theta}\) (Feller condition violated), the CV exceeds 1 and the variance process can touch zero.
Exercise 5. Show that \(\mathbb{E}[\int_t^T V_s\,ds \mid V_t] = \theta\tau + (V_t - \theta)\frac{1 - e^{-\kappa\tau}}{\kappa}\) by integrating \(\mathbb{E}[V_s \mid V_t]\) from \(t\) to \(T\). This formula gives the fair value of a variance swap starting at \(t\) with maturity \(T\).
Solution to Exercise 5
Starting from the conditional mean \(\mathbb{E}[V_s \mid V_t] = \theta + (V_t - \theta)e^{-\kappa(s-t)}\) for \(s \geq t\):
The interchange of expectation and integration is justified by Fubini's theorem (the integrand is non-negative and integrable).
Substituting the conditional mean:
Using the substitution \(u = s - t\) with \(du = ds\), and the limits changing to \(u \in [0, \tau]\) where \(\tau = T - t\):
This is the variance swap fair value formula. For a variance swap with strike \(K_{\text{var}}\) and maturity \(T\), the annualized fair strike is:
For short maturities (\(\kappa\tau \ll 1\)), the factor \(\frac{1 - e^{-\kappa\tau}}{\kappa\tau} \approx 1\), so \(K_{\text{var}} \approx V_t\). For long maturities (\(\kappa\tau \gg 1\)), the factor vanishes and \(K_{\text{var}} \approx \theta\).
Exercise 6. For the higher moments, the conditional third central moment of \(V_T\) is needed for skewness. Without deriving it fully, explain why the CIR process has positive skewness (heavier right tail than left) and relate this to the square-root diffusion.
Solution to Exercise 6
The CIR process \(dV_t = \kappa(\theta - V_t)\,dt + \sigma_v\sqrt{V_t}\,dW_t\) has positive skewness for two related reasons:
1. The square-root diffusion creates asymmetric volatility. The diffusion coefficient \(\sigma_v\sqrt{V_t}\) is an increasing function of \(V_t\). When \(V_t\) is above its mean \(\theta\), the diffusion is larger, producing larger random fluctuations that can push \(V_t\) even further above the mean. When \(V_t\) is below the mean, the diffusion is smaller, limiting how far below the mean the process can go. This creates a natural asymmetry: upward excursions are amplified while downward excursions are dampened.
2. The non-negativity constraint truncates the left tail. The process satisfies \(V_t \geq 0\) a.s. (and \(V_t > 0\) a.s. under the Feller condition). This provides a hard lower bound but no upper bound, automatically creating a right-skewed distribution.
3. The stationary distribution is Gamma. The CIR process converges to a \(\text{Gamma}(\alpha, \beta)\) stationary distribution with \(\alpha = 2\kappa\theta/\sigma_v^2\) and \(\beta = 2\kappa/\sigma_v^2\). The skewness of the Gamma distribution is \(2/\sqrt{\alpha}\), which is always positive. For typical parameters (\(\alpha\) in the range 1--4), the skewness ranges from 1 to 2, confirming a strongly right-skewed distribution.
Connection to the conditional third central moment: The positive skewness at the distributional level implies that the conditional third central moment \(\mathbb{E}[(V_T - \mathbb{E}[V_T|V_t])^3 | V_t]\) is positive. Heuristically, from the recursive moment ODE, the third moment depends on the second moment and inherits the asymmetry of the square-root diffusion. The detailed derivation (through the \(n=3\) case of the recursive formula) produces a lengthy but positive expression for all positive parameter values.
Financial implication: The positive skewness of the variance process means that variance spikes (large upward moves in \(V_t\)) are more common than variance troughs (large downward moves) relative to a symmetric model. This is consistent with the empirical observation that realized volatility has a long right tail -- extreme volatility spikes during market crises are much more dramatic than periods of unusually low volatility.