Setup
Model calibration is the process of choosing a parameter vector so that a model's prices (or implied volatilities) best reproduce a set of observed market quotes. It is the bridge between a theoretical model and a live trading system: a model with uncalibrated parameters is mathematically interesting but commercially useless.
Financial Insight. Every derivatives desk calibrates models to market prices every morning — and often intraday after large moves. A vol desk calibrating Heston to the S&P 500 surface runs this optimisation on 30–60 strike-expiry pairs in under a second. A rates desk calibrating an LMM to the swaption cube may solve a sequence of smaller optimisations (cascade calibration) over 50–100 swaptions. The mathematical framework is the same in each case: a non-linear least-squares (NLS) problem.
Assumptions for this module:
- The model is a smooth function of on an open domain (smoothness ensures the Jacobian exists).
- Market quotes are point observations: we observe prices or implied vols at specified strikes and expiries, with no bid-ask structure modelled.
- The pricing function is deterministic: given , is evaluated exactly (no Monte Carlo noise; this assumption is relaxed when we treat calibration under MC pricing).
- We work with implied volatility space by default. Fitting in price space is discussed under Limitations.
- Constraints on parameters (positivity, correlation bounds, Feller condition) are handled via bound constraints or interior-point methods. This module focuses on the unconstrained problem; constraints enter naturally in the Jacobian rank and KKT conditions.
Theory
The Calibration Objective Function
Definition 1.1 (Calibration Residuals and Objective). Let be a parameter vector. For each instrument , define the residual where is the market quote (e.g.\ implied vol) and is the model's corresponding output. The weighted NLS objective is where is a diagonal weight matrix and .
The factor of is conventional — it cancels the factor of 2 in gradient expressions. The objective is everywhere non-negative, with global minimum at if and only if a perfect fit exists (an exact solution ).
Weighting choices in practice:
| Scheme | Rationale | |
|---|---|---|
| Uniform | Equal treatment of all instruments | |
| Bid-ask inverse | Down-weights illiquid wings | |
| Vega-weighted | Equivalent to fitting price errors scaled by vega | |
| Relative | Penalises relative (not absolute) misfit |
On a live vol desk, bid-ask weights are standard: wide spreads signal illiquid options whose market quotes are less reliable. Uniform weighting in vol space is the starting point for calibration diagnostics.
The Jacobian Matrix
Definition 1.2 (Jacobian). The Jacobian of the residual vector is with entries .
(The sign follows from ; market quotes are independent of .)
The Jacobian encodes the sensitivity of every model output to every parameter. For a 5-parameter Heston model fitted to 40 market vols, is — a tall, overdetermined system.
Gradient and Hessian
Differentiating the objective:
At a local minimum , the first-order condition is:
This says the weighted residuals are orthogonal to the column space of — a geometric statement that the residual vector is in the orthogonal complement of the model's tangent plane.
The exact Hessian involves second derivatives of the model:
Definition 1.3 (Gauss-Newton Hessian Approximation). The Gauss-Newton approximation discards the second-order term:
The Gauss-Newton approximation is valid when: (i) residuals are small at the solution (model fits well), or (ii) the model is nearly linear in (second derivatives are negligible). In practice, both conditions are approximately satisfied at convergence, making Gauss-Newton the foundation of all efficient NLS solvers.
Remark. is always positive semi-definite (PSD). The exact Hessian may not be PSD away from the solution. Using guarantees descent directions even far from the optimum.
The Normal Equations
Setting the gradient to zero and using the Gauss-Newton approximation:
Theorem 1.1 (Gauss-Newton Normal Equations). The Gauss-Newton update step satisfies the normal equations: The Gauss-Newton update is then .
This is a linear system. For small (Heston has 5 parameters), it is cheap to solve via Cholesky factorisation. The system is singular when has rank less than — signalling parameter identifiability problems.
Example 1.1. For a 2-parameter model () fitted to vol quotes, , and . The normal equations reduce to a linear system solvable in closed form. The solution is the least-squares estimate of the parameter update.
Condition Number and Well-Posedness
The condition number determines how sensitive the solution is to perturbations in market data:
Theorem 1.2 (Sensitivity of NLS Solution). If the calibration objective has a unique minimum and the Jacobian has full column rank , then the sensitivity of to a perturbation in market quotes satisfies: A large condition number indicates that small changes in market quotes produce large changes in calibrated parameters — a signature of ill-posedness.
In Heston calibration, the condition number of is often –. This is why regularisation (Module: Regularisation & Stability) is necessary before reporting model parameters as "stable".
Price Space vs. Volatility Space
A critical practical choice is whether to calibrate in price space or implied vol space.
Warning — Price Space Calibration. If represents model prices (in dollars), deeply in-the-money options have prices close to their intrinsic value and carry negligible model sensitivity. Deep OTM options have small prices but carry most of the smile information. Minimising in price space implicitly up-weights expensive (ITM) options and down-weights cheap (OTM) ones — the opposite of what the vol desk cares about. Calibrate in implied vol space or use vega weighting to transform back to approximately uniform vol sensitivity.
Bound Constraints and KKT Conditions
Most calibration problems impose bounds: for all . The first-order optimality condition at a constrained solution is given by the Karush-Kuhn-Tucker (KKT) conditions:
Practically: parameters at their bounds have non-zero multipliers (they are constrained by the bound, not by the objective). Parameters strictly interior to their bounds satisfy the unconstrained first-order condition .
Financial Insight. In Heston calibration, the Feller condition ensures the variance process stays positive. This is an inequality constraint on parameters. In practice, many calibrated Heston surfaces violate Feller — the optimiser is not enforcing it unless explicitly coded. A quant who does not check this is reporting mathematically invalid parameters.
Validation
The companion notebook constructs a 2-parameter toy model (parameterised normal smile), generates synthetic market quotes, then verifies:
- The residual vector is zero at the true parameters.
- The Jacobian computed by central finite differences matches the analytic sensitivities to within accuracy.
- The Gauss-Newton step from a perturbed initial point moves toward the true solution.
- The condition number of changes with the choice of instruments (uniform strikes vs. clustered at the money).
Before opening the notebook: For the function evaluated at with observations : (a) Write down the Jacobian at . (b) Compute the Gauss-Newton update . (c) Is the solution unique? What would make the system degenerate?
Limitations
Warning — Non-Convexity. The NLS objective for any nontrivial model (Heston, SABR, LMM) is non-convex in . The Gauss-Newton algorithm finds a local minimum, not the global one. Multiple local minima can yield very different parameter sets that fit the market equally well numerically but extrapolate very differently. Global search (grid initialisation, differential evolution, or basin hopping) is required before trusting a calibrated parameter set.
Warning — Overfitting the Wing. With many parameters (e.g. LMM with 50+ rates), the model has enough degrees of freedom to fit every market quote exactly, including noise. Perfect fit () is not the goal; a smooth fit that generalises to unobserved strikes is. Regularisation terms penalising parameter roughness are necessary — see the Regularisation & Stability module.
Other limitations:
- Sparse market data: Far-dated expiries often have 3 quoted strikes. with rank deficiency produces unstable parameter estimates. Fix: regularisation or parameter fixing.
- MC pricing noise: If model prices are computed by Monte Carlo, the objective has stochastic noise. Standard NLS solvers assume deterministic objectives; use of noisy objectives requires specialised methods (SPSA, stochastic approximation).
- Non-smooth models: Jump models (Merton, Variance Gamma) have less smooth parameter dependence, degrading finite-difference Jacobian accuracy.
- Numeraire and convention sensitivity: A mismatch between market vol convention (Bachelier vs. Black, forward vs. spot, ACT/360 vs. ACT/365) in and produces a spurious calibration error that the optimiser will try — and partially succeed — to absorb via parameter distortion.
Interview Angle
L1 (Junior) — Typical questions:
-
What does it mean to calibrate a model? Expected: fit model parameters to market prices/vols. Minimise sum of squared differences (or similar). Know that this is done fresh each day.
-
Why calibrate in vol space rather than price space? Expected: price space up-weights expensive (ITM) options. Vol space treats all quoted strikes comparably. Vega weighting is a weighted compromise.
-
What is the Jacobian in the context of calibration? Expected: matrix of sensitivities of model outputs to parameters, . Used to compute the Gauss-Newton update step.
L2 (Senior) — Typical questions:
-
Derive the gradient of the NLS objective. When is the Gauss-Newton Hessian approximation valid? Expected: gradient . GN approximation valid when residuals small (good fit) or model nearly linear. Full derivation expected.
-
What does a high condition number of imply for calibration? Expected: small perturbations in market data produce large parameter changes — ill-conditioned problem. Parameters are not uniquely identified. Regularisation needed.
-
Why do calibrated parameters often violate the Feller condition? Expected: because LM finds a local minimum of the vol-space objective, ignoring model validity constraints unless they are explicitly imposed. Parameters at the boundary of the feasible set require active bound handling.
L3 (Researcher) — Typical questions:
-
Under what conditions is the NLS calibration problem locally well-posed, and what can you say about the uniqueness of the global solution? Expected: locally well-posed if has full column rank (implicit function theorem). Global uniqueness requires convexity of — not available for stochastic vol models. Comment on the relationship to identifiability (whether the model is distinguishable from data).
-
Compare L1, L2 (Huber), and L∞ loss functions for calibration. When would you prefer a robust loss? Expected: L2 sensitive to outliers (e.g. stale wing quotes). Huber loss = L2 near origin, L1 in tails, reduces outlier influence. L∞ (minimax) minimises worst-case fit. Discussion of robustness vs. differentiability trade-off.
-
How does calibrating a model with unobservable state variables (like instantaneous variance in Heston) change the problem structure? Expected: is a latent variable. Can treat as a parameter (joint calibration of and ), or estimate via a separate filter (particle filter, Kalman). Joint calibration is standard but is poorly identified from short-dated options alone.