SDE Simulation¶

Most SDEs lack closed-form solutions, making numerical simulation essential for practical applications. This section develops simulation methods from first principles, starting with an intuitive discrete approximation and progressing to rigorous numerical schemes.

Every Python code block on this page is fully standalone: a reader can copy any single block into a fresh Python file or notebook cell and run it immediately without relying on earlier blocks.

Learning Goals

After completing this section you should be able to:

understand how discrete random walks approximate continuous SDEs
implement the Euler-Maruyama scheme for general SDEs
explain strong and weak convergence and their practical implications
apply the Milstein scheme when the diffusion derivative is available
use exact simulation for GBM and Ornstein-Uhlenbeck processes
simulate multidimensional SDEs with correlated Brownian motions
apply variance reduction techniques to improve Monte Carlo efficiency

1. From Coin Flips to Brownian Motion¶

Discrete Random Walk¶

Before simulating continuous SDEs, we build intuition through a discrete random walk that approximates Brownian motion.

Setup: Divide the time interval $[0, T]$ into $n$ equal steps of size $\Delta t = T/n$.

\[ \begin{array}{ccccccccccc} S_0 && S_1 && S_2 && \cdots && S_n \\ t_0 & < & t_1 & < & t_2 & < & \cdots & < & t_n \end{array} \]

Discretizing GBM¶

For the geometric Brownian motion SDE

\[ \frac{dS}{S} = \mu\,dt + \sigma\,dB_t \]

we discretize at times $t_0 < t_1 < \cdots < t_n < t_{n+1}$:

\[ \begin{array}{ccccccccccccc} \frac{dS}{S} & = & \mu & dt & + & \sigma & dB_t \\ \downarrow && \downarrow & \downarrow && \downarrow & \downarrow \\ \frac{S_{n+1} - S_n}{S_n} && \mu & \Delta t && \sigma & \Delta B_n \end{array} \]

where $\Delta B_n = B_{t_{n+1}} - B_{t_n} \sim \mathcal{N}(0, \Delta t)$.

Key insight: A simple random walk uses $\pm\sqrt{\Delta t}$ increments. As the step size shrinks, this converges to Brownian motion (Donsker's theorem). The actual Brownian increments are Gaussian, not Bernoulli, but the coin-flip model provides the correct intuition.

Paper-and-Pencil Simulation¶

Example: Simulate GBM with $S_0 = 100$, $\mu = 0.10$, $\sigma = 0.30$, $T = 1$, $n = 10$ (so $\Delta t = 1/10$).

Updating rule:

\[ \begin{array}{ccccccccccccc} \frac{dS}{S} & = & \mu & dt & + & \sigma & dB_t \\ \uparrow && \uparrow & \uparrow && \uparrow & \uparrow \\ \frac{S_{n+1} - S_n}{S_n} && 0.10 & \frac{1}{10} && 0.30 & \pm\sqrt{\frac{1}{10}} \end{array} \]

Simulation table:

\[ \begin{array}{lrrrrrrrrrrrrrr} \text{Time} & 0/10 & 1/10 & 2/10 & 3/10 \\ \text{Coin flip} & - & H & H & T \\ \text{Conversion} & - & 1 & 1 & -1 \\ \text{Cum sum} & 0 & 1 & 2 & 1 \\ B_t & 0 & \frac{1}{\sqrt{10}} & \frac{2}{\sqrt{10}} & \frac{1}{\sqrt{10}} \\ dt & - & \frac{1}{10} & \frac{1}{10} & \frac{1}{10} \\ dB_t = B_t - B_{t-dt} & - & \frac{1}{\sqrt{10}} & \frac{1}{\sqrt{10}} & \frac{-1}{\sqrt{10}} \\ \mu \cdot dt + \sigma \cdot dB_t & - & \frac{0.1}{10} + \frac{0.3}{\sqrt{10}} & \frac{0.1}{10} + \frac{0.3}{\sqrt{10}} & \frac{0.1}{10} - \frac{0.3}{\sqrt{10}} \\ S_{t-dt} \cdot (\mu \cdot dt + \sigma \cdot dB_t) & - & 10.4868 & 11.5866 & -10.3602 \\ S_t = S_{t-dt} + S_{t-dt} \cdot (\mu \cdot dt + \sigma \cdot dB_t) & 100 & 110.4868 & 122.0734 & 111.7132 \\ \\ \text{Time} & 4/10 & 5/10 & 6/10 & 7/10 \\ \text{Coin flip} & H & T & T & H \\ \text{Conversion} & 1 & -1 & -1 & 1 \\ \text{Cum sum} & 2 & 1 & 0 & 1 \\ B_t & \frac{2}{\sqrt{10}} & \frac{1}{\sqrt{10}} & \frac{0}{\sqrt{10}} & \frac{1}{\sqrt{10}} \\ dt & \frac{1}{10} & \frac{1}{10} & \frac{1}{10} & \frac{1}{10} \\ dB_t = B_t - B_{t-dt} & \frac{1}{\sqrt{10}} & \frac{-1}{\sqrt{10}} & \frac{-1}{\sqrt{10}} & \frac{1}{\sqrt{10}} \\ \mu \cdot dt + \sigma \cdot dB_t & \frac{0.1}{10} + \frac{0.3}{\sqrt{10}} & \frac{0.1}{10} - \frac{0.3}{\sqrt{10}} & \frac{0.1}{10} - \frac{0.3}{\sqrt{10}} & \frac{0.1}{10} + \frac{0.3}{\sqrt{10}} \\ S_{t-dt} \cdot (\mu \cdot dt + \sigma \cdot dB_t) & 11.7152 & -10.4752 & -9.5861 & 10.8399 \\ S_t = S_{t-dt} + S_{t-dt} \cdot (\mu \cdot dt + \sigma \cdot dB_t) & 123.4284 & 112.9532 & 103.3671 & 114.2070 \\ \\ \text{Time} && 8/10 & 9/10 & 10/10 \\ \text{Coin flip} && H & H & T \\ \text{Conversion} && 1 & 1 & -1 \\ \text{Cum sum} && 2 & 3 & 2 \\ B_t && \frac{2}{\sqrt{10}} & \frac{3}{\sqrt{10}} & \frac{2}{\sqrt{10}} \\ dt && \frac{1}{10} & \frac{1}{10} & \frac{1}{10} \\ dB_t = B_t - B_{t-dt} && \frac{1}{\sqrt{10}} & \frac{1}{\sqrt{10}} & \frac{-1}{\sqrt{10}} \\ \mu \cdot dt + \sigma \cdot dB_t && \frac{0.1}{10} + \frac{0.3}{\sqrt{10}} & \frac{0.1}{10} + \frac{0.3}{\sqrt{10}} & \frac{0.1}{10} - \frac{0.3}{\sqrt{10}} \\ S_{t-dt} \cdot (\mu \cdot dt + \sigma \cdot dB_t) && 11.9767 & 13.2327 & -11.8320 \\ S_t = S_{t-dt} + S_{t-dt} \cdot (\mu \cdot dt + \sigma \cdot dB_t) && 126.1837 & 139.4164 & 127.5844 \end{array} \]

Result: Starting from $S_0 = 100$, we end at $S_T \approx 127.58$ after 10 coin flips.

From Discrete to Continuous¶

As $n \to \infty$ (i.e., $\Delta t \to 0$):

The random walk converges to Brownian motion (by Donsker's theorem)
The discrete stock prices converge to the solution of the SDE
Coin flips $\{\pm 1\}$ become Gaussian increments $\mathcal{N}(0, \Delta t)$

This limiting process is the foundation of Euler-Maruyama discretization.

2. Euler-Maruyama Scheme¶

Derivation¶

For a general SDE

\[ dX_t = b(t, X_t)\,dt + \sigma(t, X_t)\,dW_t, \quad X_0 = x \]

integrate from $t_n$ to $t_{n+1}$:

\[ X_{t_{n+1}} = X_{t_n} + \int_{t_n}^{t_{n+1}} b(s, X_s)\,ds + \int_{t_n}^{t_{n+1}} \sigma(s, X_s)\,dW_s \]

Euler approximation: Replace $X_s$ with $X_{t_n}$ (constant on $[t_n, t_{n+1}]$):

\[ X_{t_{n+1}} \approx X_{t_n} + b(t_n, X_{t_n})\Delta t + \sigma(t_n, X_{t_n})\Delta W_n \]

where $\Delta t = t_{n+1} - t_n$ and $\Delta W_n = W_{t_{n+1}} - W_{t_n} \sim \mathcal{N}(0, \Delta t)$.

Euler-Maruyama scheme:

\[ X_{n+1} = X_n + b(t_n, X_n)\Delta t + \sigma(t_n, X_n)\Delta W_n \]

Algorithm¶

flowchart LR
A["X_n"] --> B["compute drift b(X_n)"]
A --> C["compute diffusion σ(X_n)"]
D["ΔW ~ N(0,Δt)"] --> E[" "]
B --> E
C --> E
E --> F["X_{n+1} = X_n + b Δt + σ ΔW"]

Euler-Maruyama Algorithm

Set time grid: $t_n = n\Delta t$ for $n = 0, 1, \ldots, N$
Initialize: $X_0 = x$
For $n = 0, 1, \ldots, N-1$:
- Generate $\Delta W_n \sim \mathcal{N}(0, \Delta t)$
- Update: $X_{n+1} = X_n + b(t_n, X_n)\Delta t + \sigma(t_n, X_n)\Delta W_n$

Example: Geometric Brownian Motion¶

```python import numpy as np import matplotlib.pyplot as plt

def euler_maruyama(b, sigma, X0, T, N, num_paths=1, seed=None): """ Simulate the SDE dX_t = b(t, X_t) dt + sigma(t, X_t) dW_t using the Euler-Maruyama scheme.

Parameters
----------
b : callable
    Drift function b(t, x).
sigma : callable
    Diffusion function sigma(t, x).
X0 : float
    Initial value.
T : float
    Terminal time.
N : int
    Number of time steps.
num_paths : int, optional
    Number of simulated sample paths.
seed : int or None, optional
    Random seed for reproducibility.

Returns
-------
t : np.ndarray
    Time grid of shape (N + 1,).
X : np.ndarray
    Simulated paths of shape (num_paths, N + 1).
"""
rng = np.random.default_rng(seed)
dt = T / N
t = np.linspace(0.0, T, N + 1)

X = np.zeros((num_paths, N + 1), dtype=float)
X[:, 0] = X0

for path in range(num_paths):
    for n in range(N):
        dW = np.sqrt(dt) * rng.normal()
        X[path, n + 1] = (
            X[path, n]
            + b(t[n], X[path, n]) * dt
            + sigma(t[n], X[path, n]) * dW
        )

return t, X

=== Geometric Brownian motion ===¶

dS_t = mu S_t dt + sig S_t dW_t¶

mu = 0.10 sig = 0.20 S0 = 100.0 T = 1.0 N = 1000 num_paths = 20

def b(t, S): return mu * S

def sigma(t, S): return sig * S

t, S = euler_maruyama(b, sigma, S0, T, N, num_paths=num_paths, seed=123)

fig, ax = plt.subplots(figsize=(10, 6)) for i in range(num_paths): ax.plot(t, S[i], alpha=0.7)

ax.axhline(S0, linestyle="--", alpha=0.6, label=r"$S_0$") ax.set_xlabel("Time $t$") ax.set_ylabel("Stock price $S_t$") ax.set_title("Geometric Brownian Motion via Euler-Maruyama") ax.grid(True, alpha=0.3) ax.legend() plt.tight_layout() plt.show() ```

geometric_brownian_motion_via_euler_maruyama

Example: Ornstein-Uhlenbeck Process¶

```python import numpy as np import matplotlib.pyplot as plt

def euler_maruyama(b, sigma, X0, T, N, num_paths=1, seed=None): """ Simulate the SDE dX_t = b(t, X_t) dt + sigma(t, X_t) dW_t using the Euler-Maruyama scheme. """ rng = np.random.default_rng(seed) dt = T / N t = np.linspace(0.0, T, N + 1)

X = np.zeros((num_paths, N + 1), dtype=float)
X[:, 0] = X0

for path in range(num_paths):
    for n in range(N):
        dW = np.sqrt(dt) * rng.normal()
        X[path, n + 1] = (
            X[path, n]
            + b(t[n], X[path, n]) * dt
            + sigma(t[n], X[path, n]) * dW
        )

return t, X

=== Ornstein-Uhlenbeck process ===¶

dX_t = kappa (theta - X_t) dt + sig dW_t¶

kappa = 2.0 theta = 1.0 sig = 0.30 X0 = 0.50 T = 2.0 N = 1000 num_paths = 20

def b(t, x): return kappa * (theta - x)

def sigma(t, x): return sig

t, X = euler_maruyama(b, sigma, X0, T, N, num_paths=num_paths, seed=123)

fig, ax = plt.subplots(figsize=(10, 6)) for i in range(num_paths): ax.plot(t, X[i], alpha=0.7)

ax.axhline(theta, linestyle="--", linewidth=2, label=rf"Long-run mean $\theta={theta}$") ax.set_xlabel("Time $t$") ax.set_ylabel(r"$X_t$") ax.set_title("Ornstein-Uhlenbeck Process via Euler-Maruyama") ax.grid(True, alpha=0.3) ax.legend() plt.tight_layout() plt.show() ```

Ornstein_Uhlenbeck_Process_via_Euler_Maruyama

3. Convergence Analysis¶

Strong Convergence¶

Definition: A numerical scheme has strong convergence of order $\gamma$ if

\[ \mathbb{E}[|X_T - X_T^{\Delta t}|] = O(\Delta t^\gamma) \]

where $X_T$ is the true solution and $X_T^{\Delta t}$ is the numerical approximation.

Strong convergence measures pathwise accuracy: how closely the numerical path tracks a specific realization of the SDE when driven by the same Brownian increments.

Theorem: Under Lipschitz and growth conditions on $b$ and $\sigma$, the Euler-Maruyama scheme has strong order $\gamma = 0.5$:

\[ \mathbb{E}[|X_T - X_T^{\Delta t}|] \leq C\sqrt{\Delta t} \]

Interpretation: To reduce the strong error by a factor of 10, we need $\Delta t \to \Delta t / 100$ (100 times more steps).

Weak Convergence¶

Definition: A scheme has weak convergence of order $\beta$ if for all sufficiently smooth functions $g$:

\[ |\mathbb{E}[g(X_T)] - \mathbb{E}[g(X_T^{\Delta t})]| = O(\Delta t^\beta) \]

Weak convergence measures distributional accuracy: how closely the numerical approximation reproduces expected values of functionals of the process. This is the relevant notion for pricing applications, where one computes $\mathbb{E}[g(X_T)]$ (e.g., an option payoff).

Theorem: Euler-Maruyama has weak order $\beta = 1.0$:

\[ |\mathbb{E}[g(X_T)] - \mathbb{E}[g(X_T^{\Delta t})]| \leq C\Delta t \]

Practical implication: For computing expectations (e.g., option prices), Euler-Maruyama converges faster than for pathwise accuracy. The two convergence notions are distinct: strong order 0.5 does not imply weak order 0.5.

Numerical Verification¶

To test strong convergence, we must compare Euler-Maruyama against the exact solution using the same Brownian path. We generate a fine reference path and construct coarser approximations by aggregating increments. The Brownian increments for the coarse grid are constructed by summing the fine-grid increments so that both schemes use the same underlying Brownian path.

```python import numpy as np import matplotlib.pyplot as plt

def convergence_test_gbm(num_paths=5000, seed=123): """ Numerically verify strong convergence of Euler-Maruyama for GBM by comparing against the exact solution using the same Brownian paths. """ rng = np.random.default_rng(seed)

mu = 0.10
sig = 0.20
S0 = 100.0
T = 1.0

N_values = [10, 20, 40, 80, 160, 320, 640]
N_ref = max(N_values)

# Generate fine Brownian increments
dt_ref = T / N_ref
dW_ref = np.sqrt(dt_ref) * rng.normal(size=(num_paths, N_ref))

# Exact terminal value from the same Brownian path
W_T = dW_ref.sum(axis=1)
S_exact = S0 * np.exp((mu - 0.5 * sig**2) * T + sig * W_T)

errors = []

for N in N_values:
    block = N_ref // N
    dt = T / N

    # Aggregate fine increments into coarse increments
    dW_coarse = dW_ref.reshape(num_paths, N, block).sum(axis=2)

    # Euler-Maruyama with coarse increments
    S = np.full(num_paths, S0, dtype=float)
    for n in range(N):
        S = S + mu * S * dt + sig * S * dW_coarse[:, n]

    error = np.mean(np.abs(S - S_exact))
    errors.append(error)

dt_values = T / np.array(N_values, dtype=float)

# Estimate convergence order via log-log regression
coeffs = np.polyfit(np.log(dt_values), np.log(errors), 1)
estimated_order = coeffs[0]

fig, ax = plt.subplots(figsize=(8, 6))
ax.loglog(dt_values, errors, "o-", label="Euler-Maruyama error")
ax.loglog(
    dt_values,
    errors[0] * (dt_values / dt_values[0]) ** 0.5,
    "--",
    label=r"Reference slope $1/2$",
)
ax.set_xlabel(r"Step size $\Delta t$")
ax.set_ylabel("Mean absolute terminal error")
ax.set_title("Strong Convergence Test for Euler-Maruyama on GBM")
ax.grid(True, alpha=0.3)
ax.legend()
plt.tight_layout()
plt.show()

print("N values:      ", N_values)
print("Errors:        ", [f"{e:.6f}" for e in errors])
print(f"Estimated strong order: {estimated_order:.3f}")

if name == "main": convergence_test_gbm() ```

Strong_Convergence_Test_for_Euler_Maruyama_on_GBM

4. Milstein Scheme¶

Motivation¶

Euler-Maruyama has strong order 0.5. Can we do better? The idea is to include more terms from the Itô-Taylor expansion.

Itô-Taylor Expansion¶

For $Y_t = X_{t+\Delta t}$, expand using Itô's lemma:

\[ \begin{align} dX_t &= b(X_t)\,dt + \sigma(X_t)\,dW_t \\ d\sigma(X_t) &= \sigma'(X_t)\,dX_t + \frac{1}{2}\sigma''(X_t)(dX_t)^2 \\ &= \sigma'(X_t)[b(X_t)\,dt + \sigma(X_t)\,dW_t] + \frac{1}{2}\sigma''(X_t)\sigma^2(X_t)\,dt \end{align} \]

Keeping terms up to order $\Delta t$:

\[ X_{t+\Delta t} = X_t + b(X_t)\Delta t + \sigma(X_t)\Delta W + \frac{1}{2}\sigma(X_t)\sigma'(X_t)[(\Delta W)^2 - \Delta t] \]

Milstein Scheme¶

\[ X_{n+1} = X_n + b(X_n)\Delta t + \sigma(X_n)\Delta W_n + \frac{1}{2}\sigma(X_n)\sigma'(X_n)[(\Delta W_n)^2 - \Delta t] \]

Key term: $(\Delta W_n)^2 - \Delta t$ captures the quadratic variation correction.

Convergence¶

Theorem: The Milstein scheme has:

Strong order $\gamma = 1.0$ (vs. 0.5 for Euler-Maruyama)
Weak order $\beta = 1.0$ (same as Euler-Maruyama)

Example: GBM with Euler-Maruyama vs Milstein¶

```python import numpy as np import matplotlib.pyplot as plt

def euler_maruyama(b, sigma, X0, T, N, num_paths=1, seed=None): """Euler-Maruyama simulation for a scalar SDE.""" rng = np.random.default_rng(seed) dt = T / N t = np.linspace(0.0, T, N + 1)

X = np.zeros((num_paths, N + 1), dtype=float)
X[:, 0] = X0

for path in range(num_paths):
    for n in range(N):
        dW = np.sqrt(dt) * rng.normal()
        X[path, n + 1] = (
            X[path, n]
            + b(t[n], X[path, n]) * dt
            + sigma(t[n], X[path, n]) * dW
        )

return t, X

def milstein(b, sigma, sigma_prime, X0, T, N, num_paths=1, seed=None): """Milstein simulation for a scalar SDE.""" rng = np.random.default_rng(seed) dt = T / N t = np.linspace(0.0, T, N + 1)

X = np.zeros((num_paths, N + 1), dtype=float)
X[:, 0] = X0

for path in range(num_paths):
    for n in range(N):
        dW = np.sqrt(dt) * rng.normal()
        x_n = X[path, n]
        correction = 0.5 * sigma(t[n], x_n) * sigma_prime(t[n], x_n) * (dW**2 - dt)
        X[path, n + 1] = (
            x_n + b(t[n], x_n) * dt + sigma(t[n], x_n) * dW + correction
        )

return t, X

=== GBM parameters ===¶

mu = 0.10 sig = 0.20 S0 = 100.0 T = 1.0 N = 100 num_paths = 20

def b(t, S): return mu * S

def sigma(t, S): return sig * S

def sigma_prime(t, S): return sig

t_em, S_em = euler_maruyama(b, sigma, S0, T, N, num_paths=num_paths, seed=123) t_mil, S_mil = milstein(b, sigma, sigma_prime, S0, T, N, num_paths=num_paths, seed=123)

fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(14, 5))

for i in range(num_paths): ax1.plot(t_em, S_em[i], alpha=0.7) ax1.set_title("Euler-Maruyama") ax1.set_xlabel("Time $t$") ax1.set_ylabel(r"$S_t$") ax1.grid(True, alpha=0.3)

for i in range(num_paths): ax2.plot(t_mil, S_mil[i], alpha=0.7) ax2.set_title("Milstein") ax2.set_xlabel("Time $t$") ax2.set_ylabel(r"$S_t$") ax2.grid(True, alpha=0.3)

plt.tight_layout() plt.show() ```

GBM_with_Euler_Maruyama_vs_Milstein

5. Exact Simulation¶

When closed-form solutions are available, we can simulate without discretization error. Only Monte Carlo error remains.

Geometric Brownian Motion¶

The exact solution

\[ S_t = S_0 \exp\!\left[\left(\mu - \frac{\sigma^2}{2}\right)t + \sigma W_t\right] \]

can be sampled directly by generating $W_t \sim \mathcal{N}(0, t)$.

Ornstein-Uhlenbeck Process¶

The conditional distribution of the OU process is Gaussian:

\[ X_{t+\Delta t} \mid X_t \sim \mathcal{N}\!\left(X_t\,e^{-\kappa\Delta t} + \theta(1 - e^{-\kappa\Delta t}),\; \frac{\sigma^2}{2\kappa}(1 - e^{-2\kappa\Delta t})\right) \]

For coding, this is equivalent to:

\[ X_{t+\Delta t} = X_t\,e^{-\kappa\Delta t} + \theta(1 - e^{-\kappa\Delta t}) + \sigma\sqrt{\frac{1 - e^{-2\kappa\Delta t}}{2\kappa}}\;Z, \qquad Z \sim \mathcal{N}(0, 1) \]

This allows exact step-by-step simulation without any approximation error. In the code below, kappa corresponds to $\kappa$, theta to $\theta$, and sigma to $\sigma$.

Comparison: Exact OU vs Euler-Maruyama¶

```python import numpy as np import matplotlib.pyplot as plt

def euler_maruyama(b, sigma, X0, T, N, num_paths=1, seed=None): """Euler-Maruyama simulation for a scalar SDE.""" rng = np.random.default_rng(seed) dt = T / N t = np.linspace(0.0, T, N + 1)

X = np.zeros((num_paths, N + 1), dtype=float)
X[:, 0] = X0

for path in range(num_paths):
    for n in range(N):
        dW = np.sqrt(dt) * rng.normal()
        X[path, n + 1] = (
            X[path, n]
            + b(t[n], X[path, n]) * dt
            + sigma(t[n], X[path, n]) * dW
        )

return t, X

def exact_ou(X0, kappa, theta, sigma, T, N, num_paths=1, seed=None): """Exact simulation of the Ornstein-Uhlenbeck process on a discrete grid.""" rng = np.random.default_rng(seed) dt = T / N t = np.linspace(0.0, T, N + 1)

X = np.zeros((num_paths, N + 1), dtype=float)
X[:, 0] = X0

exp_kappa_dt = np.exp(-kappa * dt)
mean_coef = 1.0 - exp_kappa_dt
var = (sigma**2 / (2.0 * kappa)) * (1.0 - np.exp(-2.0 * kappa * dt))
std = np.sqrt(var)

for path in range(num_paths):
    for n in range(N):
        mean = X[path, n] * exp_kappa_dt + theta * mean_coef
        X[path, n + 1] = mean + std * rng.normal()

return t, X

=== OU parameters ===¶

kappa = 2.0 theta = 1.0 sig = 0.30 X0 = 0.50 T = 2.0 N = 50 num_paths = 5000

def b(t, x): return kappa * (theta - x)

def sigma_fn(t, x): return sig

t_exact, X_exact = exact_ou(X0, kappa, theta, sig, T, N, num_paths=num_paths, seed=123) t_em, X_em = euler_maruyama(b, sigma_fn, X0, T, N, num_paths=num_paths, seed=123)

fig, ax = plt.subplots(figsize=(10, 6)) ax.hist(X_exact[:, -1], bins=50, density=True, alpha=0.5, label="Exact") ax.hist(X_em[:, -1], bins=50, density=True, alpha=0.5, label="Euler-Maruyama") ax.set_xlabel(r"$X_T$") ax.set_ylabel("Density") ax.set_title(f"OU Terminal Distribution Comparison (N = {N})") ax.grid(True, alpha=0.3) ax.legend() plt.tight_layout() plt.show()

print(f"Exact: mean = {np.mean(X_exact[:, -1]):.4f}, std = {np.std(X_exact[:, -1]):.4f}") print(f"Euler-Maruyama: mean = {np.mean(X_em[:, -1]):.4f}, std = {np.std(X_em[:, -1]):.4f}") ```

OU_Terminal_Distribution_Comparison_Exact_vs_Euler_Maruyama

6. Advanced Schemes¶

Predictor-Corrector Methods¶

A Heun-style predictor-corrector uses Euler to predict, then corrects using the average of drift and diffusion at both endpoints.

```python import numpy as np import matplotlib.pyplot as plt

def predictor_corrector(b, sigma, X0, T, N, num_paths=1, seed=None): """ Heun-style predictor-corrector method for a scalar SDE

    dX_t = b(t, X_t) dt + sigma(t, X_t) dW_t

Parameters
----------
b : callable
    Drift function b(t, x).
sigma : callable
    Diffusion function sigma(t, x).
X0 : float
    Initial value.
T : float
    Terminal time.
N : int
    Number of time steps.
num_paths : int, optional
    Number of simulated paths.
seed : int or None, optional
    Random seed for reproducibility.

Returns
-------
t : np.ndarray
    Time grid of shape (N + 1,).
X : np.ndarray
    Simulated paths of shape (num_paths, N + 1).
"""
rng = np.random.default_rng(seed)
dt = T / N
t = np.linspace(0.0, T, N + 1)

X = np.zeros((num_paths, N + 1), dtype=float)
X[:, 0] = X0

for path in range(num_paths):
    for n in range(N):
        x_n = X[path, n]
        dW = np.sqrt(dt) * rng.normal()

        # Predictor step
        x_pred = x_n + b(t[n], x_n) * dt + sigma(t[n], x_n) * dW

        # Corrector step
        X[path, n + 1] = (
            x_n
            + 0.5 * (b(t[n], x_n) + b(t[n + 1], x_pred)) * dt
            + 0.5 * (sigma(t[n], x_n) + sigma(t[n + 1], x_pred)) * dW
        )

return t, X

=== Example: Ornstein-Uhlenbeck process ===¶

kappa = 2.0 theta = 1.0 sig = 0.30 X0 = 0.50 T = 2.0 N = 1000 num_paths = 20

def b(t, x): return kappa * (theta - x)

def sigma(t, x): return sig

t, X = predictor_corrector(b, sigma, X0, T, N, num_paths=num_paths, seed=123)

fig, ax = plt.subplots(figsize=(10, 6)) for i in range(num_paths): ax.plot(t, X[i], alpha=0.7)

ax.axhline(theta, linestyle="--", linewidth=2, label=rf"Long-run mean $\theta={theta}$") ax.set_xlabel("Time $t$") ax.set_ylabel(r"$X_t$") ax.set_title("Ornstein-Uhlenbeck Process via Predictor-Corrector") ax.grid(True, alpha=0.3) ax.legend() plt.tight_layout() plt.show() ```

Ornstein_Uhlenbeck_Process_via_Predictor_Corrector

7. Multidimensional SDEs¶

System of SDEs¶

For $X_t = (X_t^1, \ldots, X_t^d)$ driven by $W_t = (W_t^1, \ldots, W_t^m)$:

\[ dX_t^i = b^i(t, X_t)\,dt + \sum_{j=1}^m \sigma^{ij}(t, X_t)\,dW_t^j \]

Correlated Brownian Motions¶

Correlation structure: $d\langle W^i, W^j \rangle_t = \rho_{ij}\,dt$

Cholesky decomposition: Write $W_t = L Z_t$ where $Z_t$ has independent components and $L L^T = \rho$.

```python import numpy as np import matplotlib.pyplot as plt

def correlated_BM(rho, T, N, num_paths=1, seed=None): """ Generate correlated Brownian motions on a time grid.

Parameters
----------
rho : np.ndarray
    Correlation matrix of shape (d, d).
T : float
    Terminal time.
N : int
    Number of time steps.
num_paths : int, optional
    Number of paths.
seed : int or None, optional
    Random seed for reproducibility.

Returns
-------
t : np.ndarray
    Time grid of shape (N + 1,).
W : np.ndarray
    Brownian paths of shape (num_paths, d, N + 1).
"""
rng = np.random.default_rng(seed)
rho = np.asarray(rho, dtype=float)

if rho.ndim != 2 or rho.shape[0] != rho.shape[1]:
    raise ValueError("rho must be a square correlation matrix.")

d = rho.shape[0]
dt = T / N
t = np.linspace(0.0, T, N + 1)

L = np.linalg.cholesky(rho)
W = np.zeros((num_paths, d, N + 1), dtype=float)

for path in range(num_paths):
    Z = rng.normal(size=(d, N))
    dW = np.sqrt(dt) * (L @ Z)
    W[path, :, 1:] = np.cumsum(dW, axis=1)

return t, W

=== Example: 2D correlated Brownian motion ===¶

rho = np.array([ [1.0, 0.7], [0.7, 1.0], ])

T = 1.0 N = 1000 num_paths = 1

t, W = correlated_BM(rho, T, N, num_paths=num_paths, seed=123)

fig, ax = plt.subplots(figsize=(10, 6)) for path in range(num_paths): ax.plot(t, W[path, 0], alpha=0.7, label="W1" if path == 0 else None) ax.plot(t, W[path, 1], alpha=0.7, linestyle="--", label="W2" if path == 0 else None)

ax.set_xlabel("Time $t$") ax.set_ylabel("Brownian value") ax.set_title("Sample Paths of Two Correlated Brownian Motions") ax.grid(True, alpha=0.3) ax.legend() plt.tight_layout() plt.show()

Check empirical terminal correlation¶

terminal_corr = np.corrcoef(W[:, 0, -1], W[:, 1, -1])[0, 1] print(f"Target correlation: {rho[0, 1]:.3f}") print(f"Empirical correlation: {terminal_corr:.3f}") ```

Sample_Paths_of_Two_Correlated_Brownian_Motions

Example: Heston Model¶

\[ \begin{cases} dS_t = \mu S_t\,dt + \sqrt{V_t}\,S_t\,dW_t^1 \\ dV_t = \kappa(\theta - V_t)\,dt + \xi\sqrt{V_t}\,dW_t^2 \end{cases} \]

with $d\langle W^1, W^2 \rangle_t = \rho\,dt$.

```python import numpy as np import matplotlib.pyplot as plt

def heston_euler(S0, V0, mu, kappa, theta, xi, rho, T, N, num_paths=1, seed=None): """ Euler-Maruyama simulation for the Heston model with full truncation:

    dS_t = mu S_t dt + sqrt(V_t) S_t dW_t^1
    dV_t = kappa(theta - V_t) dt + xi sqrt(V_t) dW_t^2

with corr(dW^1, dW^2) = rho.

Parameters
----------
S0, V0 : float
    Initial stock price and variance.
mu, kappa, theta, xi, rho : float
    Heston parameters.
T : float
    Terminal time.
N : int
    Number of time steps.
num_paths : int, optional
    Number of simulated paths.
seed : int or None, optional
    Random seed for reproducibility.

Returns
-------
t : np.ndarray
    Time grid.
S : np.ndarray
    Simulated stock price paths, shape (num_paths, N + 1).
V : np.ndarray
    Simulated variance paths, shape (num_paths, N + 1).
"""
rng = np.random.default_rng(seed)
dt = T / N
t = np.linspace(0.0, T, N + 1)

S = np.zeros((num_paths, N + 1), dtype=float)
V = np.zeros((num_paths, N + 1), dtype=float)
S[:, 0] = S0
V[:, 0] = V0

corr = np.array([[1.0, rho], [rho, 1.0]], dtype=float)
L = np.linalg.cholesky(corr)

for path in range(num_paths):
    for n in range(N):
        Z = rng.normal(size=2)
        dW = np.sqrt(dt) * (L @ Z)

        s_n = S[path, n]
        v_plus = max(V[path, n], 0.0)  # full truncation

        S[path, n + 1] = s_n + mu * s_n * dt + np.sqrt(v_plus) * s_n * dW[0]
        V[path, n + 1] = V[path, n] + kappa * (theta - v_plus) * dt + xi * np.sqrt(v_plus) * dW[1]

return t, S, V

=== Example parameters ===¶

S0 = 100.0 V0 = 0.04 mu = 0.05 kappa = 2.0 theta = 0.04 xi = 0.30 rho = -0.7 T = 1.0 N = 1000 num_paths = 10

t, S, V = heston_euler( S0=S0, V0=V0, mu=mu, kappa=kappa, theta=theta, xi=xi, rho=rho, T=T, N=N, num_paths=num_paths, seed=123, )

fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(14, 5))

for i in range(num_paths): ax1.plot(t, S[i], alpha=0.7) ax1.set_title("Heston Model: Stock Price Paths") ax1.set_xlabel("Time $t$") ax1.set_ylabel(r"$S_t$") ax1.grid(True, alpha=0.3)

for i in range(num_paths): ax2.plot(t, V[i], alpha=0.7) ax2.set_title("Heston Model: Variance Paths") ax2.set_xlabel("Time $t$") ax2.set_ylabel(r"$V_t$") ax2.grid(True, alpha=0.3)

plt.tight_layout() plt.show()

print(f"Final stock mean: {np.mean(S[:, -1]):.4f}") print(f"Final variance mean: {np.mean(V[:, -1]):.4f}") print(f"Minimum variance seen: {np.min(V):.6f}") ```

Heston_Model_Stock_Price_Paths_and_Variance_Paths

text Final stock mean: 111.8044 Final variance mean: 0.0315 Minimum variance seen: 0.000447

8. Error Sources and Variance Reduction¶

Two Types of Error¶

Total error = Discretization error + Monte Carlo error

\[ \text{Error} \approx C_1 \Delta t^\gamma + \frac{C_2}{\sqrt{M}} \]

where $\gamma$ is the strong convergence order and $M$ is the number of Monte Carlo paths.

Optimal Allocation¶

To minimize computational cost for fixed total error $\varepsilon$, balance the two error sources:

\[ \Delta t^\gamma \approx \frac{1}{\sqrt{M}} \]

For Euler-Maruyama ($\gamma = 0.5$, targeting strong error $\varepsilon$): set $\Delta t \sim \varepsilon^2$ so that $N \sim \varepsilon^{-2}$, and $M \sim \varepsilon^{-2}$, giving total cost $\sim \varepsilon^{-4}$. If the goal is only weak error $\varepsilon$ (e.g., option pricing), the weaker convergence rate $\beta = 1$ allows $N \sim \varepsilon^{-1}$, reducing total cost to $\sim \varepsilon^{-3}$.

For Milstein ($\gamma = 1.0$, strong-error targets): the improved strong order reduces the required $N$, giving total cost $\sim \varepsilon^{-3}$.

Antithetic Variates¶

For each path with increments $\Delta W_n$, simulate another with $-\Delta W_n$. The pair average has lower variance than independent paths.

```python import numpy as np import matplotlib.pyplot as plt

def euler_maruyama_antithetic(b, sigma, X0, T, N, num_paths=2, seed=None): """ Euler-Maruyama with antithetic variates for a scalar SDE

    dX_t = b(t, X_t) dt + sigma(t, X_t) dW_t

The function generates paths in pairs using increments dW and -dW.

Parameters
----------
b : callable
    Drift function b(t, x).
sigma : callable
    Diffusion function sigma(t, x).
X0 : float
    Initial value.
T : float
    Terminal time.
N : int
    Number of time steps.
num_paths : int, optional
    Requested number of paths. Rounded down to the nearest even number.
seed : int or None, optional
    Random seed.

Returns
-------
t : np.ndarray
    Time grid.
X : np.ndarray
    Simulated paths of shape (effective_num_paths, N + 1).
"""
rng = np.random.default_rng(seed)
dt = T / N
t = np.linspace(0.0, T, N + 1)

effective_num_paths = 2 * (num_paths // 2)
if effective_num_paths == 0:
    raise ValueError("num_paths must be at least 2.")

X = np.zeros((effective_num_paths, N + 1), dtype=float)
X[:, 0] = X0

for pair in range(effective_num_paths // 2):
    dW = np.sqrt(dt) * rng.normal(size=N)

    # Positive path
    for n in range(N):
        x_n = X[2 * pair, n]
        X[2 * pair, n + 1] = x_n + b(t[n], x_n) * dt + sigma(t[n], x_n) * dW[n]

    # Antithetic path
    for n in range(N):
        x_n = X[2 * pair + 1, n]
        X[2 * pair + 1, n + 1] = x_n + b(t[n], x_n) * dt + sigma(t[n], x_n) * (-dW[n])

return t, X

=== Example: GBM with antithetic paths ===¶

mu = 0.10 sig = 0.20 S0 = 100.0 T = 1.0 N = 500 num_paths = 20

def b(t, S): return mu * S

def sigma(t, S): return sig * S

t, X = euler_maruyama_antithetic(b, sigma, S0, T, N, num_paths=num_paths, seed=123)

fig, ax = plt.subplots(figsize=(10, 6)) for i in range(X.shape[0]): ax.plot(t, X[i], alpha=0.7)

ax.set_xlabel("Time $t$") ax.set_ylabel(r"$X_t$") ax.set_title("GBM via Euler-Maruyama with Antithetic Variates") ax.grid(True, alpha=0.3) plt.tight_layout() plt.show()

Variance reduction diagnostic¶

pair_means = 0.5 * (X[0::2, -1] + X[1::2, -1]) plain_terminals = X[:, -1]

print(f"Number of simulated paths: {X.shape[0]}") print(f"Mean terminal value (all paths): {np.mean(plain_terminals):.4f}") print(f"Mean terminal value (pair averages): {np.mean(pair_means):.4f}") print(f"Std of terminal values: {np.std(plain_terminals):.4f}") print(f"Std of pair averages: {np.std(pair_means):.4f}") ```

GBM_via_Euler_Maruyama_with_Antithetic_Variates

text Number of simulated paths: 20 Mean terminal value (all paths): 108.7344 Mean terminal value (pair averages): 108.7344 Std of terminal values: 9.4768 Std of pair averages: 0.4801

Other variance reduction techniques include control variates (using a known expectation to reduce variance) and importance sampling (changing measure to concentrate samples in the relevant region).

9. Practical Considerations¶

Choosing the Time Step¶

Accuracy requirement: $\Delta t \lesssim \varepsilon^{2/\gamma}$ for target strong error $\varepsilon$
Stability: For mean-reverting SDEs, $a \Delta t < 1$
Non-negativity: Euler-Maruyama may produce negative values for square-root processes such as CIR. Specialized schemes (full truncation, exact simulation) are preferred
Computational budget: Balance the number of time steps $N$ and paths $M$

Scheme Comparison¶

```python import numpy as np

def euler_maruyama_terminal_gbm(S0, mu, sig, T, N, num_paths, seed=None): rng = np.random.default_rng(seed) dt = T / N S = np.full(num_paths, S0, dtype=float)

for _ in range(N):
    dW = np.sqrt(dt) * rng.normal(size=num_paths)
    S = S + mu * S * dt + sig * S * dW

return S

def milstein_terminal_gbm(S0, mu, sig, T, N, num_paths, seed=None): rng = np.random.default_rng(seed) dt = T / N S = np.full(num_paths, S0, dtype=float)

for _ in range(N):
    dW = np.sqrt(dt) * rng.normal(size=num_paths)
    correction = 0.5 * (sig * S) * sig * (dW**2 - dt)
    S = S + mu * S * dt + sig * S * dW + correction

return S

def exact_terminal_gbm(S0, mu, sig, T, num_paths, seed=None): rng = np.random.default_rng(seed) W_T = np.sqrt(T) * rng.normal(size=num_paths) return S0 * np.exp((mu - 0.5 * sig**2) * T + sig * W_T)

mu = 0.10 sig = 0.20 S0 = 100.0 T = 1.0 N = 100 num_paths = 10000

S_exact = exact_terminal_gbm(S0, mu, sig, T, num_paths, seed=123) S_em = euler_maruyama_terminal_gbm(S0, mu, sig, T, N, num_paths, seed=123) S_mil = milstein_terminal_gbm(S0, mu, sig, T, N, num_paths, seed=123)

print(f"Exact: mean = {np.mean(S_exact):.4f}, std = {np.std(S_exact):.4f}") print(f"Euler: mean = {np.mean(S_em):.4f}, std = {np.std(S_em):.4f}") print(f"Milstein: mean = {np.mean(S_mil):.4f}, std = {np.std(S_mil):.4f}")

print("\nMean absolute error versus exact terminal distribution sample:") print(f"Euler: {np.mean(np.abs(S_em - S_exact)):.4f}") print(f"Milstein: {np.mean(np.abs(S_mil - S_exact)):.4f}") ```

```text Euler: mean = 110.6295, std = 22.2744 Milstein: mean = 110.6284, std = 22.2810

Mean absolute error versus exact terminal distribution sample: Euler: 23.7281 Milstein: 23.7279 ```

10. Log-Euler Scheme for Geometric Brownian Motion¶

Motivation¶

The Euler-Maruyama scheme applied directly to GBM can produce negative prices if the time step is large. A simple fix is to simulate the log price $X_t = \log S_t$.

Using Itô's lemma:

\[ dX_t = \left(\mu - \frac{\sigma^2}{2}\right)dt + \sigma\,dW_t \]

Now the diffusion is additive, so Euler discretization is exact.

Log-Euler Scheme¶

\[ S_{n+1} = S_n \exp\!\left[\left(\mu - \frac{\sigma^2}{2}\right)\Delta t + \sigma \Delta W_n\right] \]

This guarantees $S_n > 0$ for all steps.

```python import numpy as np import matplotlib.pyplot as plt

def log_euler_gbm(S0, mu, sigma, T, N, num_paths=1, seed=None): """ Log-Euler simulation for geometric Brownian motion.

    dS_t = mu S_t dt + sigma S_t dW_t

Parameters
----------
S0 : float
    Initial stock price.
mu : float
    Drift parameter.
sigma : float
    Volatility parameter.
T : float
    Terminal time.
N : int
    Number of time steps.
num_paths : int, optional
    Number of simulated paths.
seed : int or None, optional
    Random seed for reproducibility.

Returns
-------
t : np.ndarray
    Time grid of shape (N + 1,).
S : np.ndarray
    Simulated paths of shape (num_paths, N + 1).
"""
rng = np.random.default_rng(seed)

dt = T / N
t = np.linspace(0, T, N + 1)

S = np.zeros((num_paths, N + 1))
S[:, 0] = S0

for path in range(num_paths):
    for n in range(N):
        dW = np.sqrt(dt) * rng.normal()

        S[path, n + 1] = S[path, n] * np.exp(
            (mu - 0.5 * sigma**2) * dt + sigma * dW
        )

return t, S

=== Parameters ===¶

S0 = 100 mu = 0.1 sigma = 0.2 T = 1 N = 500 num_paths = 20

t, S = log_euler_gbm(S0, mu, sigma, T, N, num_paths=num_paths, seed=123)

fig, ax = plt.subplots(figsize=(10, 6)) for i in range(num_paths): ax.plot(t, S[i], alpha=0.7)

ax.set_title("GBM via Log-Euler Scheme") ax.set_xlabel("Time $t$") ax.set_ylabel("Stock price $S_t$") ax.grid(True, alpha=0.3) plt.tight_layout() plt.show() ```

GBM_via_Log_Euler_Scheme

Advantages¶

Log-Euler:

preserves positivity ($S_n > 0$ always)
matches the exact GBM distribution (since the log is an additive SDE)
has strong order 1 for GBM

This is the preferred scheme for GBM in financial simulations.

11. Exact Simulation of the CIR Process¶

CIR Model¶

The Cox-Ingersoll-Ross process

\[ dV_t = \kappa(\theta - V_t)\,dt + \sigma\sqrt{V_t}\,dW_t \]

is widely used for short-rate models and Heston volatility dynamics. The process remains non-negative when the Feller condition $2\kappa\theta \geq \sigma^2$ holds.

Euler simulation can violate positivity, so exact sampling is preferred.

Exact Transition Distribution¶

The CIR transition law is

\[ V_{t+\Delta t} \sim c \cdot \chi'^2_d(\lambda) \]

where

\[ c = \frac{\sigma^2(1 - e^{-\kappa\Delta t})}{4\kappa}, \qquad d = \frac{4\kappa\theta}{\sigma^2}, \qquad \lambda = \frac{4\kappa\,e^{-\kappa\Delta t}}{\sigma^2(1 - e^{-\kappa\Delta t})}\,V_t \]

and $\chi'^2_d(\lambda)$ is the noncentral chi-square distribution with $d$ degrees of freedom and noncentrality parameter $\lambda$ (standard convention).

```python import numpy as np import matplotlib.pyplot as plt

def exact_cir(V0, kappa, theta, sigma, T, N, num_paths=1, seed=None): """ Exact simulation of the CIR process using the noncentral chi-square transition distribution.

Parameters
----------
V0 : float
    Initial variance.
kappa : float
    Mean reversion speed.
theta : float
    Long-run variance.
sigma : float
    Volatility of variance (vol of vol).
T : float
    Terminal time.
N : int
    Number of time steps.
num_paths : int, optional
    Number of simulated paths.
seed : int or None, optional
    Random seed for reproducibility.

Returns
-------
t : np.ndarray
    Time grid of shape (N + 1,).
V : np.ndarray
    Simulated paths of shape (num_paths, N + 1).
"""
rng = np.random.default_rng(seed)

dt = T / N
t = np.linspace(0, T, N + 1)

V = np.zeros((num_paths, N + 1))
V[:, 0] = V0

c = sigma**2 * (1 - np.exp(-kappa * dt)) / (4 * kappa)
d = 4 * kappa * theta / sigma**2

for path in range(num_paths):
    for n in range(N):
        lam = (
            4 * kappa * np.exp(-kappa * dt)
            * V[path, n]
            / (sigma**2 * (1 - np.exp(-kappa * dt)))
        )
        V[path, n + 1] = c * rng.noncentral_chisquare(d, lam)

return t, V

=== Parameters ===¶

V0 = 0.04 kappa = 2.0 theta = 0.04 sigma = 0.3 T = 1 N = 500 num_paths = 20

t, V = exact_cir(V0, kappa, theta, sigma, T, N, num_paths=num_paths, seed=123)

fig, ax = plt.subplots(figsize=(10, 6)) for i in range(num_paths): ax.plot(t, V[i], alpha=0.7)

ax.set_title("Exact CIR Simulation") ax.set_xlabel("Time $t$") ax.set_ylabel("Variance $V_t$") ax.grid(True, alpha=0.3) plt.tight_layout() plt.show() ```

Exact_CIR_Simulation

12. Multilevel Monte Carlo¶

Motivation¶

Standard Monte Carlo requires cost $\sim \varepsilon^{-2}$ paths to achieve statistical error $\varepsilon$ (holding discretization fixed, e.g., using exact simulation or sufficiently fine grids). Multilevel Monte Carlo (MLMC) dramatically reduces this cost by distributing work across multiple discretization levels.

Key Idea¶

Compute expectations using a telescoping sum:

\[ \mathbb{E}[X_L] = \mathbb{E}[X_0] + \sum_{l=1}^{L} \mathbb{E}[X_l - X_{l-1}] \]

where $X_l$ is the approximation using timestep $\Delta t_l = T \cdot 2^{-l}$. Each difference uses coupled Brownian paths. Because $X_l - X_{l-1}$ has small variance, far fewer samples are needed at fine levels.

MLMC Algorithm¶

Simulate many paths on coarse grids (cheap, high variance)
Simulate fewer paths on fine grids (expensive, low variance of the correction)
Combine estimates across levels

The optimal cost for Euler-Maruyama as the base scheme is

\[ \text{cost} \sim \varepsilon^{-2}(\log \varepsilon)^2 \]

which is substantially cheaper than the standard $O(\varepsilon^{-3})$ Monte Carlo cost for strong-error targets. The exact exponent depends on the convergence rates of the base scheme.

```python import numpy as np

def gbm_em_step(S, mu, sigma, dt, dW): """Single Euler-Maruyama step for GBM.""" return S + mu * S * dt + sigma * S * dW

def mlmc_gbm(S0, mu, sigma, T, L=4, M=10000, seed=123): """ Multilevel Monte Carlo for GBM terminal value E[S_T].

Parameters
----------
S0 : float
    Initial stock price.
mu : float
    Drift parameter.
sigma : float
    Volatility parameter.
T : float
    Terminal time.
L : int, optional
    Number of levels.
M : int, optional
    Base number of Monte Carlo paths (coarsest level).
seed : int or None, optional
    Random seed for reproducibility.

Returns
-------
float
    MLMC estimate of E[S_T].
"""
rng = np.random.default_rng(seed)

estimate = 0.0

for level in range(L + 1):

    N = 2**level
    dt = T / N
    num_samples = max(M // (2**level), 1)

    sumY = 0.0

    for _ in range(num_samples):

        # Fine path
        S_f = S0
        dW_fine = np.sqrt(dt) * rng.normal(size=N)

        for n in range(N):
            S_f = gbm_em_step(S_f, mu, sigma, dt, dW_fine[n])

        # Coarse path (aggregate pairs of fine increments)
        if level > 0:
            S_c = S0
            dt_c = 2 * dt

            for n in range(0, N, 2):
                dW_c = dW_fine[n] + dW_fine[n + 1]
                S_c = gbm_em_step(S_c, mu, sigma, dt_c, dW_c)

            Y = S_f - S_c
        else:
            Y = S_f

        sumY += Y

    estimate += sumY / num_samples

return estimate

=== Parameters ===¶

S0 = 100.0 mu = 0.1 sigma = 0.2 T = 1.0

est = mlmc_gbm(S0, mu, sigma, T) exact_mean = S0 * np.exp(mu * T)

print(f"MLMC estimate of E[S_T] = {est:.4f}") print(f"Exact E[S_T] = {exact_mean:.4f}") ```

text MLMC estimate of E[S_T] = 110.6207 Exact E[S_T] = 110.5171

13. Summary¶

Scheme Selection Guide¶

Scheme	Strong Order	Weak Order	When to Use
Euler-Maruyama	0.5	1.0	general purpose, simple
Milstein	1.0	1.0	when $\sigma'$ is available
Predictor-Corrector	0.5	1.0	better stability
Log-Euler	1.0	1.0	GBM (preserves positivity)
Exact	N/A	N/A	GBM, OU, CIR (when possible)
MLMC	N/A	N/A	cost reduction for expectations

SDE Simulation Hierarchy¶

flowchart TD
    A["Discretization-based"] --> B["Euler-Maruyama"]
    A --> C["Milstein"]
    A --> D["Predictor-Corrector"]
    A --> E["Log-Euler"]

    F["Exact simulation"] --> G["GBM"]
    F --> H["OU"]
    F --> I["CIR"]

    J["Variance reduction"] --> K["Antithetic variates"]
    J --> L["Control variates"]
    J --> M["MLMC"]

Key Takeaway

Euler-Maruyama is the workhorse of SDE simulation: simple, robust, and applicable to any SDE. Milstein improves pathwise accuracy when the diffusion derivative is tractable. For GBM, the Log-Euler scheme preserves positivity and achieves strong order 1. Exact simulation eliminates time discretization error when closed-form transition distributions are available (GBM, OU, CIR). Multilevel Monte Carlo reduces the cost of computing expectations substantially compared to standard Monte Carlo. The total simulation error combines discretization error (controlled by step size) and Monte Carlo error (controlled by path count), and balancing these two sources is essential for efficient computation.

Exercises¶

Exercise 1. Implement the Euler–Maruyama scheme for the SDE $dX_t = \sin(X_t)\,dt + 0.5\,dW_t$ with $X_0 = 0$, $T = 5$, and $N = 1000$. Plot 20 sample paths. Does the process appear to have a stationary distribution?

Solution to Exercise 1

This is a conceptual and implementation exercise. The SDE $dX_t = \sin(X_t)\,dt + 0.5\,dW_t$ has a bounded drift ($|\sin(x)| \leq 1$) and constant diffusion. The Euler-Maruyama update is:

\[ X_{n+1} = X_n + \sin(X_n)\,\Delta t + 0.5\,\Delta W_n \]

with $\Delta t = 5/1000 = 0.005$ and $\Delta W_n \sim \mathcal{N}(0, \Delta t)$.

The drift $\sin(X_t)$ has stable equilibria at $X = 2k\pi$ (even multiples of $\pi$) and unstable equilibria at $X = (2k+1)\pi$ (odd multiples). The process should exhibit a stationary distribution: the bounded, periodic drift combined with constant noise prevents the process from drifting to infinity. The sample paths will fluctuate around multiples of $2\pi$, and a histogram of $X_T$ across many paths should reveal a periodic density on $\mathbb{R}$.

Exercise 2. For GBM with $\mu = 0.05$, $\sigma = 0.3$, $S_0 = 100$, and $T = 1$:

(a) Simulate $M = 10{,}000$ terminal values using Euler–Maruyama with $N = 10$, $N = 100$, and $N = 1000$ steps.

(b) Simulate $M = 10{,}000$ terminal values using exact simulation.

(c) Compare the sample mean and standard deviation across all four cases. How does the Euler–Maruyama bias change as $N$ increases?

Solution to Exercise 2

(a)-(c) For GBM with $\mu = 0.05$, $\sigma = 0.3$, $S_0 = 100$, $T = 1$, the exact terminal distribution is log-normal with:

\[ \mathbb{E}[S_1] = 100\,e^{0.05} \approx 105.127 \]

\[ \operatorname{Std}[S_1] = 100\,e^{0.05}\sqrt{e^{0.09} - 1} \approx 105.127 \times 0.30681 \approx 32.26 \]

Euler-Maruyama introduces a weak-order bias of $O(\Delta t)$. As $N$ increases:

$N = 10$ ($\Delta t = 0.1$): noticeable bias in the sample mean; the distribution is slightly distorted.
$N = 100$ ($\Delta t = 0.01$): bias is much smaller, sample statistics approach exact values.
$N = 1000$ ($\Delta t = 0.001$): bias is negligible for practical purposes.
Exact simulation: no discretization error at all; only Monte Carlo sampling error remains.

The Euler-Maruyama bias decreases linearly with $\Delta t$ (weak order 1). For GBM specifically, the Log-Euler scheme eliminates this bias entirely regardless of step size.

Exercise 3. Derive the Milstein correction term for the CIR process $dr_t = a(\theta - r_t)\,dt + \sigma\sqrt{r_t}\,dW_t$. That is, compute $\sigma(r)\,\sigma'(r)$ where $\sigma(r) = \sigma\sqrt{r}$, and write down the full Milstein update step.

Solution to Exercise 3

For the CIR process $dr_t = a(\theta - r_t)\,dt + \sigma\sqrt{r_t}\,dW_t$, the diffusion function is $\sigma(r) = \sigma\sqrt{r}$.

The derivative is:

\[ \sigma'(r) = \frac{\sigma}{2\sqrt{r}} \]

The Milstein correction term is:

\[ \frac{1}{2}\sigma(r)\sigma'(r)(\Delta W^2 - \Delta t) = \frac{1}{2} \cdot \sigma\sqrt{r} \cdot \frac{\sigma}{2\sqrt{r}} \cdot (\Delta W^2 - \Delta t) = \frac{\sigma^2}{4}(\Delta W^2 - \Delta t) \]

The full Milstein update step is:

\[ r_{n+1} = r_n + a(\theta - r_n)\Delta t + \sigma\sqrt{r_n}\,\Delta W_n + \frac{\sigma^2}{4}(\Delta W_n^2 - \Delta t) \]

Note that the correction term $\frac{\sigma^2}{4}(\Delta W_n^2 - \Delta t)$ does not depend on $r_n$, which is a special feature of the square-root diffusion. However, this scheme can still produce negative values; boundary modifications (e.g., full truncation or reflection) are needed in practice.

Exercise 4. Implement exact simulation for the OU process with parameters $\kappa = 1$, $\theta = 0$, $\sigma = 1$, and $X_0 = 5$.

(a) Simulate $10{,}000$ paths to time $T = 10$ and plot a histogram of $X_T$.

(b) Overlay the theoretical stationary density $\mathcal{N}(\theta, \sigma^2/(2\kappa))$ on the histogram.

(c) Compute the sample mean and variance and compare with the theoretical values.

Solution to Exercise 4

OU process with $\kappa = 1$, $\theta = 0$, $\sigma = 1$, $X_0 = 5$.

(a)-(b) The exact conditional distribution is:

\[ X_{t+\Delta t} \mid X_t \sim \mathcal{N}\!\left(X_t\,e^{-\Delta t},\; \frac{1}{2}(1 - e^{-2\Delta t})\right) \]

At $T = 10$, $e^{-10} \approx 4.5 \times 10^{-5}$, so the initial condition has essentially decayed to zero.

The stationary distribution is $\mathcal{N}(\theta, \sigma^2/(2\kappa)) = \mathcal{N}(0, 0.5)$. The theoretical stationary density is:

\[ p(x) = \frac{1}{\sqrt{2\pi \times 0.5}}\exp\!\left(-\frac{x^2}{2 \times 0.5}\right) = \frac{1}{\sqrt{\pi}}\exp(-x^2) \]

(c) Theoretical values at $T = 10$:

\[ \mathbb{E}[X_{10}] = 5\,e^{-10} \approx 0.0000 \]

\[ \operatorname{Var}[X_{10}] = \frac{1}{2}(1 - e^{-20}) \approx 0.5000 \]

The sample mean and variance from $10{,}000$ paths should be very close to $0$ and $0.5$, respectively.

Exercise 5. Implement antithetic variates for estimating $\mathbb{E}[e^{-rT}\max(S_T - K, 0)]$ (a European call option price) under GBM with $r = 0.05$, $\sigma = 0.2$, $S_0 = 100$, $K = 100$, and $T = 1$.

(a) Estimate the price using $M = 10{,}000$ standard Monte Carlo paths.

(b) Estimate the price using $M/2 = 5{,}000$ antithetic pairs (same total number of paths).

(c) Compare the standard errors of the two estimators.

Solution to Exercise 5

This is an implementation exercise. Under risk-neutral GBM, $S_T = S_0\exp[(r - \sigma^2/2)T + \sigma\sqrt{T}\,Z]$ with $Z \sim \mathcal{N}(0,1)$.

The Black-Scholes price for the at-the-money call ($S_0 = K = 100$, $r = 0.05$, $\sigma = 0.2$, $T = 1$) serves as a benchmark: approximately $10.45.

(a) Standard Monte Carlo with $M = 10{,}000$ paths: generate $Z_1, \ldots, Z_M$ independently, compute $\hat{C} = e^{-rT}\frac{1}{M}\sum_{i=1}^M \max(S_T^{(i)} - K, 0)$ with standard error $\text{SE} = \hat{\sigma}/\sqrt{M}$.

(b) Antithetic variates with $M/2 = 5{,}000$ pairs: for each $Z_i$, compute payoffs for both $Z_i$ and $-Z_i$, then average each pair before averaging across pairs.

(c) The antithetic estimator typically reduces the standard error by a factor of 3-5 for this problem, because $\max(S_T(Z) - K, 0)$ and $\max(S_T(-Z) - K, 0)$ are negatively correlated (when one is large, the other tends to be small). The variance of the pair average $\frac{1}{2}[f(Z) + f(-Z)]$ is $\frac{1}{4}[\operatorname{Var}(f(Z)) + \operatorname{Var}(f(-Z)) + 2\operatorname{Cov}(f(Z), f(-Z))]$, and the negative covariance reduces total variance significantly.

Exercise 6. Explain why the Euler–Maruyama scheme applied directly to GBM can produce negative stock prices. For what combination of $\mu$, $\sigma$, and $\Delta t$ is this most likely to occur? Show that the Log-Euler scheme avoids this problem by construction.

Solution to Exercise 6

The Euler-Maruyama update for GBM $dS_t = \mu S_t\,dt + \sigma S_t\,dW_t$ is:

\[ S_{n+1} = S_n(1 + \mu\,\Delta t + \sigma\,\Delta W_n) \]

This can produce $S_{n+1} < 0$ whenever $1 + \mu\,\Delta t + \sigma\,\Delta W_n < 0$, i.e., when:

\[ \Delta W_n < -\frac{1 + \mu\,\Delta t}{\sigma} \]

Since $\Delta W_n \sim \mathcal{N}(0, \Delta t)$, this event has probability:

\[ \Phi\!\left(-\frac{1 + \mu\,\Delta t}{\sigma\sqrt{\Delta t}}\right) \]

This probability is largest when $\sigma$ is large and $\Delta t$ is large (making $\sigma\sqrt{\Delta t}$ comparable to $1$). For example, with $\sigma = 0.3$ and $\Delta t = 0.1$, the threshold is approximately $\Delta W < -10.5\sqrt{\Delta t}$, which is extremely unlikely. But with $\sigma = 1.0$ and $\Delta t = 1.0$, the threshold becomes $\Delta W < -1.05$, which has probability $\Phi(-1.05) \approx 15\%$.

The Log-Euler scheme avoids this by construction. The update is:

\[ S_{n+1} = S_n \exp\!\left[(\mu - \sigma^2/2)\Delta t + \sigma\,\Delta W_n\right] \]

Since the exponential function is always positive, $S_{n+1} > 0$ for any value of $\Delta W_n$, regardless of step size or parameters. This is because the scheme operates on $\log S$, which is an additive SDE, and then exponentiates the result.

Exercise 7. Consider the stability of the Euler–Maruyama scheme for the OU process $dX_t = a(\theta - X_t)\,dt + \sigma\,dW_t$.

(a) Simulate with $a = 50$, $\theta = 1$, $\sigma = 0.1$, $X_0 = 1$, $T = 2$, and $N = 100$. Does the scheme remain stable? What is $a\,\Delta t$?

(b) Increase $N$ until the scheme stabilizes. What is the critical value of $a\,\Delta t$ for stability?

(c) Compare with exact OU simulation using the same parameters. Why is exact simulation immune to this stability issue?

Solution to Exercise 7

(a) With $a = 50$, $\Delta t = T/N = 2/100 = 0.02$, so $a\,\Delta t = 50 \times 0.02 = 1.0$.

The Euler-Maruyama update for the OU process is:

\[ X_{n+1} = X_n + a(\theta - X_n)\Delta t + \sigma\,\Delta W_n = X_n(1 - a\,\Delta t) + a\theta\,\Delta t + \sigma\,\Delta W_n \]

(b) The critical stability condition is $|1 - a\,\Delta t| < 1$, which gives $a\,\Delta t < 2$. Thus $N > aT/2 = 50$ suffices. More conservatively, $a\,\Delta t < 1$ avoids oscillatory behavior entirely, requiring $N > aT = 100$. With $N = 200$ ($a\,\Delta t = 0.5$), the scheme should be clearly stable.

(c) Exact OU simulation uses the conditional distribution:

\[ X_{n+1} \mid X_n \sim \mathcal{N}(X_n e^{-a\Delta t} + \theta(1 - e^{-a\Delta t}),\; \tfrac{\sigma^2}{2a}(1 - e^{-2a\Delta t})) \]

The factor $e^{-a\Delta t}$ is always in $[0, 1)$ for $a > 0$ and $\Delta t > 0$, so the scheme is unconditionally stable regardless of step size. The exact transition distribution correctly captures the exponential decay without any discretization amplification. Even with $a\,\Delta t = 100$, the exact scheme simply gives $e^{-100} \approx 0$, meaning each step essentially resamples from near the stationary distribution. This is why exact simulation is immune to stability issues that plague explicit Euler schemes for stiff equations.