Strong Markov Property¶

Concept Definition¶

Throughout, \((\Omega, \mathcal{F}, (\mathcal{F}_t)_{t \ge 0}, \mathbb{P})\) is a filtered probability space satisfying the usual conditions. A stopping time is a random variable \(\tau : \Omega \to [0, \infty]\) such that \(\{\tau \le t\} \in \mathcal{F}_t\) for all \(t \ge 0\). The stopped \(\sigma\)-algebra is

\[ \mathcal{F}_\tau := \{ A \in \mathcal{F} : A \cap \{\tau \le t\} \in \mathcal{F}_t \text{ for all } t \ge 0 \}. \]

Definition: Markov Property vs Strong Markov Property

A time-homogeneous Markov process \((X_t)_{t \ge 0}\) satisfies the Markov property if for all bounded measurable \(\varphi\) and all \(0 \le s \le t\),

\[ \mathbb{E}[\varphi(X_t) \mid \mathcal{F}_s] = \mathbb{E}^{X_s}[\varphi(X_{t-s})]. \]

It satisfies the strong Markov property if the same identity holds when the deterministic time \(s\) is replaced by any stopping time \(\tau < \infty\) \(\mathbb{P}\)-a.s.: for all \(t \ge 0\),

\[ \mathbb{E}[\varphi(X_{\tau + t}) \mid \mathcal{F}_\tau] = \mathbb{E}^{X_\tau}[\varphi(X_t)]. \]

The key difference: the Markov property is memoryless at fixed times; the strong Markov property is memoryless at random times (first passage times, exit times, etc.).

Explanation¶

Why Stopping Times Require a Separate Condition¶

For a fixed time \(s\), the Markov property follows from the independence of Brownian increments and measurability. At a stopping time \(\tau\), the event \(\{\tau = t\}\) depends on the entire history up to \(t\), so the argument requires additional structure — specifically, right-continuity of the filtration \((\mathcal{F}_t)\) and regularity of the transition semigroup.

Strong Markov Property for Brownian Motion¶

Theorem

Let \((W_t)_{t \ge 0}\) be a standard Brownian motion and \(\tau\) a stopping time with \(\mathbb{P}(\tau < \infty) = 1\). Define

\[ B_t := W_{\tau + t} - W_\tau, \qquad t \ge 0. \]

Then \((B_t)_{t \ge 0}\) is a standard Brownian motion and is independent of \(\mathcal{F}_\tau\).

This is the prototype. The proof uses right-continuity of \((\mathcal{F}_t)\) to approximate \(\tau\) by discrete stopping times \(\tau_n = \lceil 2^n \tau \rceil / 2^n\), apply the ordinary Markov property at each level, and pass to the limit.

Strong Markov Property for Diffusions¶

Theorem

Let \(X_t\) solve the SDE

\[ \mathrm{d}X_t^{i} = b^{i}(X_t)\,\mathrm{d}t + \sigma^{i\alpha}(X_t)\,\mathrm{d}W_t^{\alpha} \]

with \(b, \sigma\) locally Lipschitz with linear growth, so that a unique strong solution exists. Then \(X\) is a Feller process, and every Feller process with continuous paths is strong Markov.

Precision note

The Lipschitz/growth conditions guarantee existence and uniqueness of the strong solution. The strong Markov property then follows from the Feller property of the semigroup \(P_t f(x) = \mathbb{E}^x[f(X_t)]\): namely, \(P_t\) maps \(C_0(\mathbb{R}^d)\) (continuous functions vanishing at infinity) into itself, and \(\|P_t f - f\|_\infty \to 0\) as \(t \to 0\). Combined with right-continuity of \((\mathcal{F}_t)\), this yields strong Markov. The Lipschitz condition alone does not directly imply strong Markov; it is the route to the Feller property that does.

Distinction from the Optional Sampling Theorem¶

The strong Markov property concerns the conditional law of the future path given \(\mathcal{F}_\tau\). The Optional Sampling Theorem concerns the expected value of a martingale at \(\tau\). These are logically independent results, though both involve stopping times.

Diagram / Example¶

Exit Times and the Restart Argument¶

Let \(D \subset \mathbb{R}^d\) be open and define the exit time

\[ \tau_D := \inf\{t \ge 0 : X_t \notin D\}. \]

\(\tau_D\) is a stopping time. The strong Markov property asserts: conditionally on \(\mathcal{F}_{\tau_D}\), the post-exit process \((X_{\tau_D + t})_{t \ge 0}\) behaves like a fresh copy of \(X\) started from \(X_{\tau_D} \in \partial D\).

This restart argument is used repeatedly in:

Dirichlet boundary value problems (harmonic functions and BM hitting distributions),
computing the distribution of \(\max_{0 \le s \le t} W_s\) via the reflection principle,
establishing the strong Markov property at iterated stopping times (the process restarts freshly each time it hits a boundary).

Reflection Principle (Consequence)¶

For standard Brownian motion \(W\) and \(a > 0\):

\[ \mathbb{P}\!\left(\max_{0 \le s \le t} W_s \ge a\right) = 2\,\mathbb{P}(W_t \ge a). \]

Proof sketch. Let \(\tau_a = \inf\{s : W_s = a\}\). By the strong Markov property, \(B_s := W_{\tau_a + s} - a\) is a fresh standard BM independent of \(\mathcal{F}_{\tau_a}\). On \(\{\tau_a \le t\}\), the event \(\{W_t \ge a\}\) equals \(\{B_{t-\tau_a} \ge 0\}\) and by symmetry of BM has probability \(1/2\) conditionally on \(\mathcal{F}_{\tau_a}\). Therefore \(\mathbb{P}(W_t \ge a,\, \tau_a \le t) = \frac{1}{2}\mathbb{P}(\tau_a \le t)\). Since \(\mathbb{P}(\max_{s \le t} W_s \ge a) = \mathbb{P}(\tau_a \le t)\) and \(\mathbb{P}(W_t \ge a) = \mathbb{P}(W_t \ge a,\, \tau_a \le t)\) (as \(a > 0\)), the result follows. \(\square\)

Proof / Derivation¶

We sketch the proof for Brownian motion via discrete approximation.

Step 1. Define \(\tau_n := \frac{\lceil 2^n \tau \rceil}{2^n}\), a sequence of stopping times taking values in \(\{k/2^n : k \ge 0\}\) with \(\tau_n \ge \tau\) and \(\tau_n \downarrow \tau\) (i.e. \(\tau_{n+1} \le \tau_n\) for all \(n\), decreasing to \(\tau\) from above).

Step 2. For each fixed \(n\), on the event \(\{\tau_n = k/2^n\}\), we have \(\mathcal{F}_{\tau_n} = \mathcal{F}_{k/2^n}\) (by definition of the stopped \(\sigma\)-algebra restricted to this event), so:

\[ \mathbb{E}[\varphi(W_{\tau_n + t}) \mid \mathcal{F}_{\tau_n}] = \mathbb{E}^{W_{k/2^n}}[\varphi(W_t)] \]

by the ordinary Markov property at the deterministic time \(k/2^n\).

Step 3. Since \(\varphi\) is bounded and \(W\) has continuous paths, \(\varphi(W_{\tau_n + t}) \to \varphi(W_{\tau + t})\) a.s. as \(n \to \infty\). Since \(\tau_n \downarrow \tau\) (each \(\tau_n \ge \tau\)), stopping later gives more information: \(\mathcal{F}_{\tau_n} \supseteq \mathcal{F}_\tau\) (finer \(\sigma\)-algebras for larger stopping times), and \(\bigcap_n \mathcal{F}_{\tau_n} = \mathcal{F}_\tau\) by right-continuity. Dominated convergence and backward martingale convergence complete the passage to the limit. \(\square\)

What to Remember¶

The Markov property holds at fixed times; the strong Markov property holds at stopping times.
For Brownian motion: \((W_{\tau+t} - W_\tau)_{t \ge 0}\) is a fresh BM independent of \(\mathcal{F}_\tau\).
For diffusions: strong Markov follows from the Feller property (not directly from Lipschitz conditions).
The canonical application is the restart argument at first exit times, connecting SDEs to boundary value PDEs.

Exercises¶

Exercise 1. Let \((W_t)_{t \ge 0}\) be a standard Brownian motion and define \(\tau_a = \inf\{t \ge 0 : W_t = a\}\) for \(a > 0\). Using the strong Markov property, show that \(B_t := W_{\tau_a + t} - a\) is a standard Brownian motion independent of \(\mathcal{F}_{\tau_a}\). Explain why the ordinary Markov property at a fixed time is insufficient for this conclusion.

Solution to Exercise 1

Let \(\tau_a = \inf\{t \ge 0 : W_t = a\}\) with \(a > 0\). Define \(B_t := W_{\tau_a + t} - W_\tau = W_{\tau_a + t} - a\).

By the strong Markov property of Brownian motion: conditionally on \(\mathcal{F}_{\tau_a}\), the process \((W_{\tau_a + t})_{t \ge 0}\) is independent of \(\mathcal{F}_{\tau_a}\) and has the same law as a Brownian motion started from \(W_{\tau_a} = a\). Therefore \(B_t = W_{\tau_a + t} - a\) is a standard Brownian motion (started from \(0\)) independent of \(\mathcal{F}_{\tau_a}\).

To verify the defining properties: (1) \(B_0 = 0\); (2) for \(0 \le s < t\), the increment \(B_t - B_s = W_{\tau_a + t} - W_{\tau_a + s}\) is independent of \(\mathcal{F}_{\tau_a + s}\) (and hence of \(\mathcal{F}_{\tau_a}\)) and is \(\mathcal{N}(0, t-s)\); (3) \(B\) has continuous paths since \(W\) does.

Why the ordinary Markov property is insufficient: The ordinary Markov property holds at fixed (deterministic) times \(s\). But \(\tau_a\) is a random time that depends on the entire trajectory of \(W\) up to the moment it hits \(a\). The event \(\{\tau_a = t\}\) belongs to \(\mathcal{F}_t\) and depends on the path history, so we cannot simply apply the Markov property at a fixed time. We need the extension to stopping times — the strong Markov property — which requires right-continuity of the filtration.

Exercise 2. Let \(D = (-1, 1) \subset \mathbb{R}\) and let \(W_t\) be a standard Brownian motion started at \(x \in D\). Define the exit time \(\tau_D = \inf\{t \ge 0 : W_t \notin D\}\). Using the strong Markov property and the fact that \(f(x) = x\) is harmonic, compute \(\mathbb{E}^x[W_{\tau_D}]\). Then use \(g(x) = x^2 - t\) (which satisfies \(\mathcal{L}g = 0\) for BM) to find \(\mathbb{E}^x[\tau_D]\).

Solution to Exercise 2

Computing \(\mathbb{E}^x[W_{\tau_D}]\): The function \(f(x) = x\) is harmonic for Brownian motion (\(\mathcal{L}f = \frac{1}{2}f'' = 0\)), so \(M_t = W_{t \wedge \tau_D}\) is a martingale (by the optional sampling theorem / strong Markov property). Since \(W\) exits \(D = (-1,1)\) at either \(-1\) or \(+1\):

\[ \mathbb{E}^x[W_{\tau_D}] = \mathbb{E}^x[M_0] = x \]

by the optional sampling theorem (with \(\tau_D\) bounded in \(L^1\) for BM on a bounded interval).

Computing \(\mathbb{E}^x[\tau_D]\): Consider \(g(x, t) = x^2 - t\). For BM, \(\mathcal{L}g = \frac{1}{2} \cdot 2 - 1 = 0\) (where the \(-1\) comes from \(\partial_t g\)), so \(g(W_t, t) = W_t^2 - t\) is a martingale. By optional sampling at \(\tau_D\):

\[ \mathbb{E}^x[W_{\tau_D}^2 - \tau_D] = x^2 - 0 \]

Since \(W_{\tau_D} \in \{-1, +1\}\), we have \(W_{\tau_D}^2 = 1\) a.s., so

\[ 1 - \mathbb{E}^x[\tau_D] = x^2 \]

Therefore

\[ \mathbb{E}^x[\tau_D] = 1 - x^2 \]

Exercise 3. Let \(\tau\) be a stopping time with \(\mathbb{P}(\tau < \infty) = 1\). Define the discrete approximation \(\tau_n = \lceil 2^n \tau \rceil / 2^n\). Show that \(\tau_n\) is a stopping time, \(\tau_n \ge \tau\), and \(\tau_n \downarrow \tau\) as \(n \to \infty\). Explain why right-continuity of the filtration \((\mathcal{F}_t)\) is needed in Step 3 of the proof to conclude \(\bigcap_n \mathcal{F}_{\tau_n} = \mathcal{F}_\tau\).

Solution to Exercise 3

\(\tau_n\) is a stopping time: For any \(t \ge 0\), the event \(\{\tau_n \le t\} = \{\lceil 2^n \tau \rceil / 2^n \le t\} = \{\tau \le \lfloor 2^n t \rfloor / 2^n\}\). Since \(\lfloor 2^n t \rfloor / 2^n\) is a deterministic value and \(\tau\) is a stopping time, \(\{\tau \le \lfloor 2^n t \rfloor / 2^n\} \in \mathcal{F}_{\lfloor 2^n t \rfloor / 2^n} \subseteq \mathcal{F}_t\). So \(\tau_n\) is a stopping time.

\(\tau_n \ge \tau\): By definition, \(\lceil 2^n \tau \rceil \ge 2^n \tau\), so \(\tau_n = \lceil 2^n \tau \rceil / 2^n \ge \tau\).

\(\tau_n \downarrow \tau\): We have \(\tau_n - \tau \le 1/2^n\) since \(\lceil k \rceil - k < 1\) for any real \(k\). Therefore \(\tau_n \to \tau\) as \(n \to \infty\).

Why right-continuity is needed: We have \(\mathcal{F}_{\tau_n} \supseteq \mathcal{F}_\tau\) for all \(n\) (since \(\tau_n \ge \tau\)), so \(\bigcap_n \mathcal{F}_{\tau_n} \supseteq \mathcal{F}_\tau\). For the reverse inclusion, we need \(\bigcap_n \mathcal{F}_{\tau_n} \subseteq \mathcal{F}_\tau\). Since \(\tau_n \downarrow \tau\), the stopped \(\sigma\)-algebras satisfy \(\bigcap_n \mathcal{F}_{\tau_n} = \mathcal{F}_{\tau+}\) (the right-limit \(\sigma\)-algebra). Right-continuity of the filtration means \(\mathcal{F}_{\tau+} = \mathcal{F}_\tau\), which gives \(\bigcap_n \mathcal{F}_{\tau_n} = \mathcal{F}_\tau\). Without right-continuity, \(\mathcal{F}_{\tau+}\) could be strictly larger than \(\mathcal{F}_\tau\), and the backward martingale convergence argument in Step 3 would not yield the strong Markov property at \(\tau\).

Exercise 4. Give an example of a Markov process that satisfies the ordinary Markov property at all fixed times but fails the strong Markov property at some stopping time. (Hint: consider a process whose transition mechanism depends on the time parameter in a discontinuous way.)

Solution to Exercise 4

Consider the following process on \(\mathbb{R}\). Let \(\xi\) be a uniform random variable on \([0,1]\), and define

\[ X_t = \mathbf{1}_{[0,\infty)}(t - \xi) \]

That is, \(X_t = 0\) for \(t < \xi\) and \(X_t = 1\) for \(t \ge \xi\). This process has the Markov property at fixed times: given \(X_s\), the conditional law of \(X_t\) for \(t > s\) depends only on \(X_s\) (if \(X_s = 1\), then \(X_t = 1\); if \(X_s = 0\), then \(X_t = 1\) with probability \((t-s)/(1-s)\) conditional on \(\xi > s\)).

Now consider the stopping time \(\tau = \inf\{t : X_t = 1\} = \xi\). The strong Markov property would require that, conditionally on \(\mathcal{F}_\tau\), the post-\(\tau\) process \((X_{\tau+t})_{t \ge 0}\) depends only on \(X_\tau = 1\). The future is indeed deterministic (\(X_{\tau+t} = 1\) for all \(t\)), so this particular stopping time does not reveal a failure.

A cleaner example: let \((Y_t)_{t \ge 0}\) be defined as follows. Let \(U \sim \text{Uniform}(0,1)\) and set \(Y_t = 0\) for \(t \le U\), and for \(t > U\) define \(Y_t\) to follow one of two deterministic trajectories depending on the fractional part of \(U\) in a fine way that is invisible at fixed times but revealed at the jump time \(\tau = U\).

More concretely, a classical counterexample uses a non-Feller Markov chain in continuous time. Let the state space be \(\{0,1,2\}\) with transition rates that depend on time in a discontinuous manner: from state \(0\), the process jumps to state \(1\) at rate \(q(t) = \mathbf{1}_{\mathbb{Q}}(t)\) and to state \(2\) at rate \(1 - q(t)\). At any fixed time \(t\), the transition probabilities are well-defined and the Markov property holds. But at the stopping time \(\tau = \inf\{t : X_t \ne 0\}\), the jump destination reveals whether \(\tau\) is rational or irrational — information not determined by \(X_\tau\) alone — so the strong Markov property fails.

Exercise 5. Using the reflection principle (which follows from the strong Markov property), compute

\[ \mathbb{P}\!\left(\max_{0 \le s \le t} W_s \ge a,\; W_t \le b\right) \]

for \(a > 0\) and \(b < a\), where \(W\) is a standard Brownian motion. Express your answer in terms of the standard normal CDF \(\Phi\).

Solution to Exercise 5

For \(a > 0\) and \(b < a\), define \(\tau_a = \inf\{s : W_s = a\}\). By the strong Markov property, \((W_{\tau_a + s} - a)_{s \ge 0}\) is a standard BM independent of \(\mathcal{F}_{\tau_a}\).

We compute \(\mathbb{P}(\max_{0 \le s \le t} W_s \ge a,\, W_t \le b)\). On the event \(\{\tau_a \le t\}\) (equivalently \(\{\max_{s \le t} W_s \ge a\}\)), we have

\[ W_t = a + (W_t - W_{\tau_a}) = a + B_{t - \tau_a} \]

where \(B_s = W_{\tau_a + s} - a\) is a standard BM independent of \(\tau_a\). So

\[ \mathbb{P}(\tau_a \le t,\, W_t \le b) = \mathbb{P}(\tau_a \le t,\, a + B_{t-\tau_a} \le b) \]

By the reflection principle: on \(\{\tau_a \le t\}\), \(W_t\) and \(2a - W_t\) have the same conditional distribution (since \(B_{t-\tau_a}\) and \(-B_{t-\tau_a}\) have the same law). Therefore

\[ \mathbb{P}(\tau_a \le t,\, W_t \le b) = \mathbb{P}(\tau_a \le t,\, 2a - W_t \le b) = \mathbb{P}(\tau_a \le t,\, W_t \ge 2a - b) \]

Since \(b < a\) implies \(2a - b > a\), the event \(\{W_t \ge 2a - b\}\) implies \(\{\tau_a \le t\}\), so

\[ \mathbb{P}(\tau_a \le t,\, W_t \ge 2a - b) = \mathbb{P}(W_t \ge 2a - b) \]

Therefore

\[ \mathbb{P}\!\left(\max_{0 \le s \le t} W_s \ge a,\; W_t \le b\right) = \mathbb{P}(W_t \ge 2a - b) = 1 - \Phi\!\left(\frac{2a - b}{\sqrt{t}}\right) \]

where \(\Phi\) is the standard normal CDF.

Exercise 6. Let \(X_t\) solve \(\mathrm{d}X_t = -\theta X_t\,\mathrm{d}t + \sigma\,\mathrm{d}W_t\) with \(\theta > 0\) (Ornstein–Uhlenbeck process). Let \(\tau = \inf\{t \ge 0 : X_t = c\}\) for some level \(c\). Explain why the strong Markov property guarantees that the post-\(\tau\) process \((X_{\tau + t})_{t \ge 0}\), conditioned on \(\mathcal{F}_\tau\), behaves like an OU process started from \(c\). What property of the OU semigroup (related to Feller continuity) is needed?

Solution to Exercise 6

The Ornstein–Uhlenbeck process \(\mathrm{d}X_t = -\theta X_t\,\mathrm{d}t + \sigma\,\mathrm{d}W_t\) with \(\theta > 0\) has coefficients \(b(x) = -\theta x\) and \(\sigma(x) = \sigma\) that are globally Lipschitz with linear growth. By the standard existence and uniqueness theorem, the SDE has a unique strong solution.

The OU semigroup \(P_t f(x) = \mathbb{E}^x[f(X_t)]\) satisfies the Feller property: \(P_t\) maps \(C_0(\mathbb{R})\) into \(C_0(\mathbb{R})\), and \(\|P_t f - f\|_\infty \to 0\) as \(t \to 0\). This follows because \(X_t = xe^{-\theta t} + \sigma\int_0^t e^{-\theta(t-s)}\,\mathrm{d}W_s\) is Gaussian with mean \(xe^{-\theta t}\) and variance \(\frac{\sigma^2}{2\theta}(1 - e^{-2\theta t})\), so \(P_t f(x) = \int f(xe^{-\theta t} + y)\,\nu_t(\mathrm{d}y)\) for a Gaussian measure \(\nu_t\), which preserves continuity and vanishing at infinity.

By the general theorem (Feller process with right-continuous filtration implies strong Markov), the OU process is strong Markov. Therefore, at the stopping time \(\tau = \inf\{t : X_t = c\}\), the strong Markov property guarantees:

\[ \mathbb{E}[\varphi(X_{\tau+t}) \mid \mathcal{F}_\tau] = \mathbb{E}^{X_\tau}[\varphi(X_t)] = \mathbb{E}^c[\varphi(X_t)] \]

The post-\(\tau\) process \((X_{\tau+t})_{t \ge 0}\), conditioned on \(\mathcal{F}_\tau\), behaves as an OU process started from \(c\), independent of the pre-\(\tau\) history. The key property is Feller continuity of the semigroup — the continuity \(x \mapsto P_t f(x)\) ensures the restart argument works at the random location \(X_\tau = c\).

Exercise 7. Explain the distinction between the strong Markov property and the optional sampling theorem. Construct a scenario involving a submartingale \(Y_t\) and a stopping time \(\tau\) where the optional sampling theorem applies but the strong Markov property is not relevant, and vice versa.

Solution to Exercise 7

The strong Markov property concerns the conditional law of the future process given \(\mathcal{F}_\tau\): it states \(\mathbb{E}[\varphi(X_{\tau+t}) \mid \mathcal{F}_\tau] = \mathbb{E}^{X_\tau}[\varphi(X_t)]\). It is a property of Markov processes and applies to the full path after \(\tau\).

The optional sampling theorem (OST) concerns the expected value of a (sub/super)martingale at a stopping time: for a martingale \(M\) and bounded stopping times \(\sigma \le \tau\), it states \(\mathbb{E}[M_\tau \mid \mathcal{F}_\sigma] = M_\sigma\). It applies to any martingale, regardless of whether it is Markov.

OST applies, strong Markov not relevant: Let \(Y_t = e^{W_t - t/2}\) (the exponential martingale). Let \(\tau = \min(1, \inf\{t : Y_t = 2\})\). The OST gives \(\mathbb{E}[Y_\tau] = Y_0 = 1\). Here \(Y_t\) is not a Markov process on its own (its conditional expectation depends on \(t\), not just \(Y_t\)), so the strong Markov property for \(Y\) is not meaningful.

Strong Markov applies, OST not relevant: Let \(X_t\) be an OU process and \(\tau = \inf\{t : X_t = c\}\). The strong Markov property tells us the full distributional behavior of \((X_{\tau+t})_{t \ge 0}\). Now consider the submartingale \(Y_t = |X_t|\). The OST for submartingales gives the inequality \(\mathbb{E}[Y_\tau] \ge Y_0\) (under appropriate integrability), but this says nothing about the post-\(\tau\) law. The strong Markov property provides the richer conclusion that the process restarts as an OU process from \(c\).