Classification of Second-Order PDEs¶
The behavior of a second-order PDE is determined by its type: elliptic, parabolic, or hyperbolic. This classification governs which boundary conditions are appropriate, whether solutions are smooth or can develop shocks, and which numerical methods converge. In finance, virtually every pricing PDE is parabolic -- a direct consequence of the diffusion nature of asset price models.
The General Second-Order Linear PDE¶
In two independent variables \((x, y)\), the most general second-order linear PDE takes the form:
where \(A\), \(B\), \(C\) may depend on \((x, y)\).
The classification depends on the discriminant:
| Discriminant | Type | Prototype | Physical Behavior |
|---|---|---|---|
| \(\Delta < 0\) | Elliptic | Laplace equation | Steady-state, equilibrium |
| \(\Delta = 0\) | Parabolic | Heat equation | Diffusion, smoothing |
| \(\Delta > 0\) | Hyperbolic | Wave equation | Propagation, oscillation |
Analogy with Conic Sections
The names come from the classification of the conic \(Ax^2 + 2Bxy + Cy^2 = 1\). The same discriminant \(B^2 - AC\) determines whether the conic is an ellipse (\(< 0\)), parabola (\(= 0\)), or hyperbola (\(> 0\)). This is not a coincidence: the characteristic curves of the PDE are governed by the same quadratic form.
The Three Types in Detail¶
Elliptic Equations¶
Prototype: Laplace's equation
Here \(A = C = 1\), \(B = 0\), so \(\Delta = -1 < 0\).
Key properties:
- Solutions are infinitely smooth (analytic, in fact)
- Maximum principle: the maximum of \(u\) occurs on the boundary
- Boundary data on the entire boundary determines the solution (Dirichlet problem)
- No preferred direction of propagation -- information spreads in all directions simultaneously
Financial example: The steady-state bond pricing equation when coefficients are time-independent:
where \(\mathcal{L}\) is the generator of the short-rate process. This is an elliptic ODE in one spatial dimension.
Parabolic Equations¶
Prototype: The heat equation
Rewriting as \(-\frac{\partial u}{\partial t} + \frac{1}{2}\frac{\partial^2 u}{\partial x^2} = 0\) and setting \((x,y) = (x,t)\): we have \(A = \frac{1}{2}\), \(B = 0\), \(C = 0\), giving \(\Delta = 0\).
Key properties:
- Smoothing: Solutions become instantly \(C^\infty\) for \(t > 0\), even for rough initial data
- Maximum principle: Solutions cannot exceed their initial/boundary maxima
- Causality: Information propagates forward in time -- the solution at time \(t\) depends only on data at earlier times
- Initial condition at \(t = 0\) plus boundary conditions determine the solution
Financial example: The Black-Scholes PDE is parabolic.
Hyperbolic Equations¶
Prototype: The wave equation
Here \(A = c^2\), \(B = 0\), \(C = -1\), so \(\Delta = c^2 > 0\).
Key properties:
- Finite propagation speed: Disturbances travel at speed \(c\)
- No smoothing: Discontinuities in initial data persist
- Solutions can develop shocks and discontinuities
- Two initial conditions needed: \(u(0,x)\) and \(u_t(0,x)\)
Financial relevance: Hyperbolic PDEs are essentially absent from standard financial models. Asset prices are modeled as diffusions (parabolic), not waves. However, certain mean-field game formulations in systemic risk can produce hyperbolic-like behavior.
Why Financial PDEs Are Parabolic¶
The Black-Scholes PDE is:
Identifying \(A = \frac{1}{2}\sigma^2 S^2\), \(B = 0\), and \(C = 0\) (there is no \(\partial^2 V / \partial t^2\) term):
The equation is parabolic (provided \(\sigma > 0\) and \(S > 0\)).
The Deep Reason
Financial PDEs are parabolic because they arise from diffusion processes via the Feynman-Kac theorem. Any Ito diffusion
has a generator \(\mathcal{L} = \mu\partial_x + \frac{1}{2}\sigma^2\partial_{xx}\) that is a second-order operator with no mixed time-space second derivatives. The resulting PDE \(\partial_t u + \mathcal{L}u = 0\) is always parabolic. Parabolicity is the PDE signature of Brownian diffusion.
Classification for Variable Coefficients¶
When coefficients depend on position, the type can vary from point to point. For a general operator:
the classification depends on the eigenvalues of the coefficient matrix \(A = (a_{ij})\):
| Condition on Eigenvalues | Type |
|---|---|
| All nonzero, same sign | Elliptic |
| One zero eigenvalue, rest same sign | Parabolic |
| Nonzero eigenvalues of mixed sign | Hyperbolic |
Degenerate Parabolicity¶
When \(\sigma(x) = 0\) at some point, the PDE degenerates -- it loses its second-order character there. This matters in finance:
- At \(S = 0\) in Black-Scholes: \(\frac{1}{2}\sigma^2 S^2 \to 0\), so the equation degenerates. The boundary \(S = 0\) is an absorbing state where no boundary condition is needed
- At the Feller boundary in the CIR model: \(\frac{1}{2}\xi^2 v \to 0\) as \(v \to 0\), leading to subtle boundary behavior
Degeneracy and Regularity
At points of degeneracy, the PDE may not have classical (smooth) solutions. The theory of viscosity solutions provides the correct framework for well-posedness in degenerate cases.
The Multidimensional Case¶
For a system with \(d\) spatial dimensions and one time dimension, the pricing PDE has the form:
where \(\Sigma\) is the volatility matrix from the SDE \(dX_t = \mu\,dt + \Sigma\,dW_t\).
Classification: The equation is parabolic if the diffusion matrix \(a = \Sigma\Sigma^\top\) is positive semi-definite. It is uniformly parabolic (non-degenerate) if \(a\) is positive definite, i.e., \(\Sigma\) has full rank.
Example: In the Heston model with state \((S, v)\), the diffusion matrix is:
The determinant is \(\xi^2 v^2 S^2(1-\rho^2) > 0\) when \(v > 0\), \(S > 0\), and \(|\rho| < 1\), so the system is non-degenerate parabolic in the interior.
Characteristic Curves and Information Flow¶
Each PDE type has a distinct pattern of characteristic curves along which information propagates:
- Elliptic: No real characteristics -- information spreads in all directions instantly
- Parabolic: One family of characteristics (the time slices \(t = \text{const}\)) -- information flows forward in time
- Hyperbolic: Two families of characteristics -- information propagates at finite speed along specific directions
For financial PDEs, the parabolic structure means:
- Terminal conditions at \(t = T\) propagate backward to earlier times
- The solution at \((t, S)\) depends on the terminal condition everywhere (infinite propagation speed in space)
- Solutions are smooth for \(t < T\) even if the terminal payoff is non-smooth
Smoothing in Action
The call payoff \(g(S) = (S - K)^+\) has a kink at \(S = K\). Yet for any \(t < T\), the Black-Scholes price \(V(t, S)\) is a smooth (\(C^\infty\)) function of \(S\). This is the parabolic smoothing property -- the diffusion in the PDE instantly regularizes discontinuities.
Summary of the Classification¶
| Aspect | Elliptic | Parabolic | Hyperbolic |
|---|---|---|---|
| Prototype | \(\Delta u = 0\) | \(u_t = \frac{1}{2}u_{xx}\) | \(u_{tt} = c^2 u_{xx}\) |
| Smoothing | Yes | Yes | No |
| Max principle | Yes | Yes | No |
| Propagation | All directions | Forward in time | Along characteristics |
| Conditions | Boundary only | Initial + boundary | Two initial conditions |
| Finance | Steady-state problems | Pricing PDEs | Not standard |
In finance, parabolic PDEs are the rule because asset prices are modeled as diffusion processes. The classification tells us that pricing problems are well-posed with a terminal condition and appropriate boundary conditions, and that solutions enjoy the smoothing and maximum principle properties inherited from diffusion.
See Also¶
- Why PDEs in Finance -- motivation for the PDE approach
- Boundary Value Problems -- appropriate conditions for each PDE type
- The SDE-PDE Bridge -- how diffusion generators produce parabolic PDEs
- Maximum Principle -- the key property of parabolic equations
Exercises¶
Exercise 1. Classify each of the following PDEs as elliptic, parabolic, or hyperbolic by computing the discriminant \(\Delta = B^2 - AC\):
(a) \(u_{xx} + 2u_{xy} + u_{yy} = 0\)
(b) \(u_{xx} - 4u_{yy} = 0\)
(c) \(3u_{xx} + 2u_{xy} + u_{yy} = 0\)
Solution to Exercise 1
(a) \(u_{xx} + 2u_{xy} + u_{yy} = 0\): Here \(A = 1\), \(B = 1\) (the coefficient of \(u_{xy}\) is \(2B\), so \(B = 1\)), \(C = 1\).
The equation is parabolic.
(b) \(u_{xx} - 4u_{yy} = 0\): Here \(A = 1\), \(B = 0\), \(C = -4\).
The equation is hyperbolic. (This is a wave equation with speed \(c = 2\).)
(c) \(3u_{xx} + 2u_{xy} + u_{yy} = 0\): Here \(A = 3\), \(B = 1\) (since the coefficient of \(u_{xy}\) is \(2B = 2\)), \(C = 1\).
The equation is elliptic.
Exercise 2. The Black-Scholes PDE is \(\partial_t V + rS\partial_S V + \frac{1}{2}\sigma^2 S^2 \partial_{SS} V - rV = 0\). Identify the highest-order coefficients \(A\), \(B\), \(C\) (treating \((t, S)\) as the two independent variables) and verify that the discriminant gives \(\Delta = 0\), confirming that the equation is parabolic.
Solution to Exercise 2
The Black-Scholes PDE is:
With independent variables \((x, y) = (S, t)\), the highest-order terms are:
- Coefficient of \(\partial^2 V / \partial S^2\): this gives \(A = \frac{1}{2}\sigma^2 S^2\)
- Coefficient of \(\partial^2 V / \partial S\,\partial t\): there is no mixed second-order term, so \(2B = 0\), hence \(B = 0\)
- Coefficient of \(\partial^2 V / \partial t^2\): there is no such term, so \(C = 0\)
The discriminant is:
Since \(\Delta = 0\), the equation is parabolic, as expected for a pricing PDE arising from a diffusion process. Note that this classification holds for all \(S > 0\) and \(\sigma > 0\); at \(S = 0\) the equation degenerates (the leading coefficient \(A\) vanishes).
Exercise 3. The heat equation has the smoothing property: solutions become \(C^\infty\) for \(t > 0\) even if the initial data is discontinuous. The wave equation does not have this property. Explain this difference in terms of the PDE classification and the nature of characteristic curves for parabolic versus hyperbolic equations.
Solution to Exercise 3
Parabolic equations (heat equation): The discriminant \(\Delta = 0\) means there is exactly one family of real characteristics -- the time slices \(t = \text{const}\). Information from the initial data propagates instantaneously in all spatial directions. In Fourier space, an initial mode \(e^{ikx}\) evolves as \(e^{ikx - \frac{1}{2}k^2 t}\), where the factor \(e^{-\frac{1}{2}k^2 t}\) damps high-frequency components exponentially fast. This damping is stronger for larger \(|k|\), which explains why the heat equation smooths rough initial data instantly: all discontinuities and kinks are immediately suppressed.
Hyperbolic equations (wave equation): The discriminant \(\Delta > 0\) means there are two families of real characteristics \(x \pm ct = \text{const}\). Information propagates at finite speed \(c\) along these characteristics. The general solution \(u(t,x) = f(x - ct) + g(x + ct)\) shows that the initial profile is simply translated without distortion. In Fourier space, modes evolve as \(e^{i(kx \pm kct)}\) -- pure oscillation with no damping. Discontinuities in the initial data travel along the characteristics indefinitely without being smoothed.
Summary: The parabolic structure implies infinite propagation speed with exponential frequency damping (smoothing), while the hyperbolic structure implies finite propagation speed with no damping (preservation of singularities). This is why the heat equation instantaneously regularizes any initial data, whereas the wave equation preserves discontinuities forever.
Exercise 4. For the Heston model, the diffusion matrix is
Compute the determinant and eigenvalues of \(a\). Under what conditions on \(v\), \(S\), \(\rho\), and \(\xi\) is the pricing PDE non-degenerate parabolic? What happens at \(v = 0\) or \(S = 0\)?
Solution to Exercise 4
The diffusion matrix of the Heston model is:
Determinant:
Eigenvalues: The trace is \(\text{tr}(a) = vS^2 + \xi^2 v = v(S^2 + \xi^2)\), and \(\det(a) = \xi^2 v^2 S^2(1-\rho^2)\). The eigenvalues \(\lambda_{1,2}\) satisfy:
The discriminant under the square root simplifies to:
so both eigenvalues are real and non-negative.
Non-degenerate parabolic condition: The PDE is non-degenerate parabolic when both eigenvalues are strictly positive, which requires \(\det(a) > 0\). This holds when:
- \(v > 0\) (variance is positive)
- \(S > 0\) (stock price is positive)
- \(|\rho| < 1\) (correlation is not \(\pm 1\))
- \(\xi > 0\) (vol-of-vol is positive)
At \(v = 0\): Both entries involving \(v\) vanish, so \(a = 0\) (the zero matrix). The PDE completely degenerates -- there is no diffusion. This corresponds to the variance process hitting zero, where the Feller boundary condition determines the behavior.
At \(S = 0\): The matrix becomes \(a = \begin{pmatrix} 0 & 0 \\ 0 & \xi^2 v \end{pmatrix}\), which has rank 1 (if \(v > 0\)). The PDE degenerates in the \(S\)-direction but remains diffusive in the \(v\)-direction. As with Black-Scholes, \(S = 0\) is an absorbing boundary for geometric Brownian motion.
Exercise 5. Explain why the Black-Scholes PDE degenerates at \(S = 0\) (the coefficient of \(\partial_{SS}\) vanishes). What does this degeneracy imply about the nature of the boundary at \(S = 0\) for geometric Brownian motion? Why is no boundary condition required there, unlike at a finite barrier?
Solution to Exercise 5
The Black-Scholes PDE has the second-order term \(\frac{1}{2}\sigma^2 S^2 \frac{\partial^2 V}{\partial S^2}\). At \(S = 0\), the coefficient \(\frac{1}{2}\sigma^2 S^2\) vanishes, so the PDE loses its second-order (diffusive) character. Similarly, the drift term \(rS\frac{\partial V}{\partial S}\) also vanishes. The equation reduces to:
which is a first-order ODE in \(t\) that determines \(V(t, 0)\) uniquely from the terminal condition \(V(T, 0) = g(0)\).
Why no boundary condition is needed: For a non-degenerate parabolic PDE on a domain with boundary, a boundary condition is required because the diffusion term allows the process to reach the boundary, and we must specify what happens there. At \(S = 0\), the diffusion coefficient vanishes, so the PDE cannot "transport" information through the boundary \(S = 0\). The equation is self-contained at that point.
Probabilistic connection: Geometric Brownian motion has the explicit solution:
The exponential function is strictly positive for all real arguments, so \(S_t > 0\) for all \(t \geq 0\) whenever \(S_0 > 0\). Therefore \(\mathbb{P}(S_t = 0 \text{ for some } t > 0) = 0\). Since the diffusion never reaches \(S = 0\), there are no sample paths that require a boundary rule at \(S = 0\). The conditional expectation \(V(t, S) = e^{-r(T-t)}\mathbb{E}^{\mathbb{Q}}[g(S_T) \mid S_t = S]\) is fully determined without specifying boundary behavior.
In contrast, at a finite barrier \(B > 0\), the coefficient \(\frac{1}{2}\sigma^2 B^2 > 0\) and the process reaches \(B\) with positive probability, so a boundary condition is necessary to specify the option's behavior upon barrier contact.
Exercise 6. For an Ito diffusion \(dX_t = \mu(X_t)\,dt + \sigma(X_t)\,dW_t\), the generator \(\mathcal{L} = \mu\partial_x + \frac{1}{2}\sigma^2\partial_{xx}\) is a second-order operator with no mixed \(\partial_{tx}\) term. Explain why this structure always produces a parabolic PDE \(\partial_t u + \mathcal{L}u = 0\), regardless of the choice of \(\mu\) and \(\sigma\) (provided \(\sigma \neq 0\)).
Solution to Exercise 6
For a general Ito diffusion \(dX_t = \mu(X_t)\,dt + \sigma(X_t)\,dW_t\), the infinitesimal generator is:
The Kolmogorov backward equation for \(u(t, x) = \mathbb{E}[g(X_T) \mid X_t = x]\) is:
Written explicitly:
Identifying the second-order coefficients with \((x, y) = (x, t)\):
- \(A = \frac{1}{2}\sigma^2(x)\) (coefficient of \(\partial_{xx}\))
- \(B = 0\) (no mixed \(\partial_{xt}\) term)
- \(C = 0\) (no \(\partial_{tt}\) term)
The discriminant is \(\Delta = B^2 - AC = 0 - \frac{1}{2}\sigma^2(x) \cdot 0 = 0\).
This gives \(\Delta = 0\) regardless of the specific forms of \(\mu\) and \(\sigma\). The reason is structural: the Ito SDE is first-order in time (\(dX_t = \ldots\,dt + \ldots\,dW_t\)), so the generator \(\mathcal{L}\) contains no time derivatives. The resulting PDE \(\partial_t u + \mathcal{L}u = 0\) is therefore first-order in \(t\) and (at most) second-order in \(x\), with no \(\partial_{tt}\) or \(\partial_{tx}\) terms. This structure forces \(B = C = 0\) and hence \(\Delta = 0\), making the PDE parabolic.
The condition \(\sigma \neq 0\) ensures that \(A = \frac{1}{2}\sigma^2 > 0\), so the equation is genuinely second-order (non-degenerate parabolic), with the associated smoothing and maximum principle properties. If \(\sigma = 0\) everywhere, the equation reduces to a first-order PDE (transport equation) rather than a parabolic one.
Exercise 7. The call payoff \(g(S) = (S - K)^+\) has a kink (discontinuous first derivative) at \(S = K\). Yet for any \(t < T\), the Black-Scholes price \(V(t, S)\) is a smooth function of \(S\). This is the parabolic smoothing property. Give an intuitive explanation using the probabilistic interpretation: \(V(t, S) = e^{-r(T-t)}\mathbb{E}^{\mathbb{Q}}[(S_T - K)^+ \mid S_t = S]\) is an average over many possible terminal values, which smooths out the kink.
Solution to Exercise 7
The Black-Scholes price of a European call is:
Under the risk-neutral measure, \(S_T = S\exp\!\left((r - \frac{1}{2}\sigma^2)(T-t) + \sigma\sqrt{T-t}\,Z\right)\) where \(Z \sim N(0,1)\). Thus:
Why smoothing occurs: The payoff \((S_T - K)^+\) has a kink at \(S_T = K\). However, the price \(V(t, S)\) is an integral of this kinked function against the Gaussian density \(\phi(z)\). This integral is a convolution-like operation that averages the payoff over all possible outcomes of \(S_T\).
For any \(t < T\), the distribution of \(S_T \mid S_t = S\) is a log-normal distribution with positive variance \(\sigma^2(T-t) > 0\). The key mechanism is:
- When \(S\) varies smoothly, the entire log-normal distribution of \(S_T\) shifts smoothly
- The kink at \(S_T = K\) is a fixed feature of the integrand, but the weight placed on values near \(K\) changes smoothly as \(S\) varies
- Integration against a smooth (Gaussian) kernel produces a smooth function
Technically, differentiation under the integral sign is justified because the Gaussian density \(\phi(z)\) is \(C^\infty\) and decays rapidly. Each derivative \(\frac{\partial^k V}{\partial S^k}\) passes through the integral, producing a finite result for all orders \(k\).
At \(t = T\), however, the variance \(\sigma^2(T - t) = 0\), so \(S_T = S\) deterministically. The "averaging" collapses to a point evaluation, and the kink reappears: \(V(T, S) = (S - K)^+\). The smoothing is only effective when there is genuine uncertainty (\(T - t > 0\)) to "smear out" the non-smoothness.