SABR CalibrationLMMSwaption CubeRank ReductionCascade Calibration

SABR and LMM Calibration — Multi-Instrument Fitting

Module 5 of 530 min readLevel: Hard

Setup

The SABR and Libor Market Model (LMM) are the workhorses of rates vol calibration. SABR is calibrated per expiry-tenor pair to a few strikes; LMM is calibrated to the full swaption cube (a 3D array of expiries, tenors, and strikes). Both present calibration challenges qualitatively different from the Heston setting:

  • SABR: very few parameters (3 per slice after fixing β\beta) but a highly non-linear dependence on the ATM vol. The calibration is fast but requires special treatment of the near-ATM region.
  • LMM: many parameters (a full correlation matrix and one vol per rate), but the problem decomposes into tractable sub-problems via rank reduction and cascade calibration.
INSIGHT

Financial Insight. On a rates desk, SABR is run every morning on the swaption cube: for each expiry-tenor pair, a SABR model is calibrated to 5–7 strike quotes. The calibrated (α,ρ,ν)(\alpha, \rho, \nu) per slice defines the "SABR surface". LMM is calibrated less frequently (typically weekly or when the model is used for exotic pricing), as it requires the full swaption cube and is more computationally intensive. The practical skill is knowing which model to use for which task and how to calibrate each efficiently.

Notation and conventions:

  • SABR: forward rate F>0F > 0 under the forward measure for a given expiry-tenor. β[0,1]\beta \in [0,1] fixed. Calibrate (α,ρ,ν)(\alpha, \rho, \nu) where α>0\alpha > 0 (initial vol), ρ(1,1)\rho \in (-1, 1) (skew parameter), ν>0\nu > 0 (vol of vol).
  • LMM (BGM model): NN forward rates F1(t),,FN(t)F_1(t), \ldots, F_N(t) for consecutive tenors. Each FnF_n has its own piecewise-constant vol σn\sigma_n and a common correlation matrix ρRN×N\rho \in \mathbb{R}^{N \times N}.
  • All vols are Black-Scholes normal or log-normal, annualised, in decimal.
  • Swaption strikes quoted as spreads to the ATM forward swap rate.

Theory

SABR Per-Expiry Calibration

Fixing β\beta: The β\beta parameter controls whether the ATM vol is stable (normal dynamics, β=0\beta = 0) or inversely related to the forward rate (log-normal dynamics, β=1\beta = 1). In the EUR post-2016 negative rates environment, β=0\beta = 0 (normal SABR) is standard for caps/floors. For GBP and USD swaptions, β=0.5\beta = 0.5 or β=1\beta = 1 is common. β\beta is not calibrated — it is set by market convention and the risk quant.

REMARK

Remark. β\beta and ρ\rho have degenerate effects on the smile skew. Fitting both simultaneously produces an ill-conditioned Jacobian. Fix β\beta by market convention; calibrate only (α,ρ,ν)(\alpha, \rho, \nu).

The Hagan SABR implied vol formula (for β\beta fixed, KFK \neq F):

σNSABR(K,F;α,ρ,ν)α(FK)(β1)/21+(1β)224ln2 ⁣FK+(1β)41920ln4 ⁣FKzχ(z)[1+((1β)2α224(FK)1β+ρβνα4(FK)(1β)/2+(23ρ2)ν224)T]\sigma_N^{\text{SABR}}(K, F; \alpha, \rho, \nu) \approx \frac{\alpha\,(FK)^{(\beta-1)/2}}{1 + \frac{(1-\beta)^2}{24}\ln^2\!\tfrac{F}{K} + \frac{(1-\beta)^4}{1920}\ln^4\!\tfrac{F}{K}} \cdot \frac{z}{\chi(z)} \cdot \left[1 + \left(\frac{(1-\beta)^2 \alpha^2}{24(FK)^{1-\beta}} + \frac{\rho\beta\nu\alpha}{4(FK)^{(1-\beta)/2}} + \frac{(2 - 3\rho^2)\nu^2}{24}\right)T\right]

where z=να(FK)(1β)/2lnFKz = \frac{\nu}{\alpha}(FK)^{(1-\beta)/2}\ln\tfrac{F}{K} and χ(z)=ln ⁣12ρz+z2+zρ1ρ\chi(z) = \ln\!\frac{\sqrt{1-2\rho z + z^2} + z - \rho}{1 - \rho}.

At the money (K=FK = F), the formula simplifies to:

THEOREM

Theorem 5.1 (SABR ATM Implied Vol). At K=FK = F (ATM), the Hagan formula reduces to: σATMSABR  =  αFβ1[1+((1β)2α224F2(1β)+ρβνα4F1β+(23ρ2)ν224)T]\sigma_{\text{ATM}}^{\text{SABR}} \;=\; \alpha\,F^{\beta-1} \left[1 + \left(\frac{(1-\beta)^2\alpha^2}{24 F^{2(1-\beta)}} + \frac{\rho\beta\nu\alpha}{4 F^{1-\beta}} + \frac{(2-3\rho^2)\nu^2}{24}\right)T\right]

For β=1\beta = 1 (log-normal): σATM=α[1+(ρνα4+(23ρ2)ν224)T]\sigma_{\text{ATM}} = \alpha\left[1 + \left(\frac{\rho\nu\alpha}{4} + \frac{(2-3\rho^2)\nu^2}{24}\right)T\right]

This ATM formula provides a near-explicit equation for α\alpha given σATM\sigma_{\text{ATM}} (it is quadratic in α\alpha for β=1\beta = 1): solve the quadratic to initialise α\alpha, then refine via NLS on all quotes.

SABR calibration procedure (per expiry-tenor slice):

  1. Fix β\beta (market convention).
  2. Estimate α\alpha from the ATM implied vol by solving the ATM formula above.
  3. Set initial ρ\rho \approx ATM skew slope estimate; ν\nu \approx smile curvature estimate.
  4. Run LM with 3 parameters (α,ρ,ν)(\alpha, \rho, \nu) and NK=5N_K = 577 quoted strikes.
  5. Repeat for each expiry-tenor pair in the swaption cube.

Since each per-slice calibration involves only 3 parameters and 5–7 instruments, it converges in 5–15 LM iterations.

WARNING

Warning — Hagan Formula Validity. The Hagan formula is an asymptotic expansion in ν2T\nu^2 T (vol of vol times time). For long-dated expiries (T>10T > 10) or high vol of vol (ν>1\nu > 1), the approximation breaks down and the formula can produce negative vols or arbitrage violations. Use the Antonov-Konikov-Spector exact formula or the effective approximation of Obloj (2008) for long dates. Verify calibrated smiles are positive and monotone for all strikes before using for pricing.

LMM: The Model and Its Parameters

DEFINITION

Definition 5.1 (LIBOR Market Model — Brace-Gatarek-Musiela 1997). Under the spot LIBOR measure, the nn-th forward rate Fn(t)F_n(t) for the period [Tn1,Tn][T_{n-1}, T_n] evolves as: dFn(t)=μn(t)dt+σn(t)Fn(t)dWn(t),tTn1dF_n(t) = \mu_n(t)\,dt + \sigma_n(t)\,F_n(t)\,dW_n(t), \quad t \le T_{n-1} where WnW_n are correlated Brownian motions with dWm,Wnt=ρmndtd\langle W_m, W_n\rangle_t = \rho_{mn}\,dt, σn(t)\sigma_n(t) is the deterministic vol function for FnF_n, and μn\mu_n is the drift (uniquely determined by no-arbitrage under the spot measure).

For calibration to caplets, the key formula is:

Cn=τnBlack(Fn(0),K,vnTn1)C_n = \tau_n \cdot \text{Black}(F_n(0), K, v_n \sqrt{T_{n-1}})

where vn2=1Tn10Tn1σn2(t)dtv_n^2 = \frac{1}{T_{n-1}} \int_0^{T_{n-1}} \sigma_n^2(t)\,dt is the root mean square vol of FnF_n.

For calibration to swaptions, the Rebonato (1999) approximation gives:

THEOREM

Theorem 5.2 (Rebonato Swaption Approximation). The implied vol Σαβ\Sigma_{\alpha\beta} of a payer swaption on the swap [Tα,Tβ][T_\alpha, T_\beta] with notional payment at each TnT_n, n=α+1,,βn=\alpha+1,\ldots,\beta, is approximately:

Σαβ2Tα    1Sαβ2n,m=α+1βwnwmρnmFn(0)Fm(0)0Tασn(t)σm(t)dt\Sigma_{\alpha\beta}^2 T_\alpha \;\approx\; \frac{1}{S_{\alpha\beta}^2} \sum_{n,m=\alpha+1}^{\beta} w_n\,w_m\,\rho_{nm}\,F_n(0)\,F_m(0) \int_0^{T_\alpha} \sigma_n(t)\,\sigma_m(t)\,dt

where Sαβ=n=α+1βτnDnFn(0)S_{\alpha\beta} = \sum_{n=\alpha+1}^{\beta} \tau_n D_n F_n(0) is the forward swap rate (with DnD_n discount factors) and wn=τnDnFn(0)/Sαβw_n = \tau_n D_n F_n(0) / S_{\alpha\beta} are the swaption weights.

This approximation converts the LMM swaption vol into a quadratic function of the vol parameters — enabling fast gradient-based calibration.

Rank Reduction for the Correlation Matrix

The LMM correlation matrix ρRN×N\rho \in \mathbb{R}^{N\times N} has N(N1)/2N(N-1)/2 free parameters — far too many to calibrate from a finite swaption cube. Rank reduction restricts the correlation matrix to have a specific parametric form.

DEFINITION

Definition 5.2 (Rank-kk Approximation). Write ρ=BB\rho = B B^\top where BRN×kB \in \mathbb{R}^{N \times k} has normalised rows (bn=1\|b_n\| = 1). This ensures ρ\rho is PSD with unit diagonal. For kk factors, there are kNk(k+1)/2kN - k(k+1)/2 free parameters (after removing rotational freedom).

Rebonato parametrisation (1-factor): ρmn=exp(λnm)\rho_{mn} = \exp(-\lambda|n - m|) for a single parameter λ>0\lambda > 0. This is a separable exponential form — one parameter controls all pairwise correlations.

For calibration with limited swaption data, k=2k = 2 or k=3k = 3 factors is standard:

  • k=2k = 2: captures the overall level and the short-rate/long-rate spread direction.
  • k=3k = 3: adds curvature of the correlation structure.

Cascade (Sequential) Calibration

Rather than calibrating all LMM parameters simultaneously (which is a large, ill-conditioned NLS problem), cascade calibration proceeds sequentially:

DEFINITION

Definition 5.3 (Cascade Calibration). For n=1,2,,N1n = 1, 2, \ldots, N-1 (in order of increasing expiry):

  1. Calibrate σn(t)\sigma_n(t) on [Tn1,Tn)[T_{n-1}, T_n) to match the co-terminal swaption [Tn,Tβ][T_n, T_\beta] given the already-calibrated σ1,,σn1\sigma_1, \ldots, \sigma_{n-1}.
  2. Each calibration step involves a single 1D root-finding problem (or a small NLS with few parameters).

This "peeling" approach converts a large joint optimisation into N1N-1 small problems.

Limitation of cascade calibration: the sequential approach may not produce the globally optimal joint solution. Specifically, the Rebonato swaption approximation introduces approximation errors that accumulate through the cascade. In practice, cascade calibration is followed by a global LM polish to reduce the joint residuals.

EXAMPLE

Example 5.1 (3-Rate LMM Calibration). With N=3N = 3 forward rates (F1,F2,F3F_1, F_2, F_3) at quarterly intervals, the correlation matrix is: ρ=(1ρ12ρ13ρ121ρ23ρ13ρ231)\rho = \begin{pmatrix} 1 & \rho_{12} & \rho_{13} \\ \rho_{12} & 1 & \rho_{23} \\ \rho_{13} & \rho_{23} & 1 \end{pmatrix} Using the Rebonato parametrisation ρmn=eλmn\rho_{mn} = e^{-\lambda|m-n|} reduces this to one parameter λ\lambda. With flat vols σn=σ\sigma_n = \sigma for all nn, calibration to 3 caplet vols gives σ\sigma from caplets and λ\lambda from the ratio of the 2-year swaption vol to the 1-year caplet vol.

SABR-LMM: Stochastic Vol for Each Rate

The combined SABR-LMM model assigns SABR dynamics to each forward rate FnF_n, giving each a time-varying vol αn(t)\alpha_n(t) and introducing additional correlations between vol processes. This enables fitting to the swaption smile (not just ATM) but at the cost of many more parameters. Full SABR-LMM calibration is a research-grade task; the standard simplification is to calibrate SABR per-expiry-tenor (for smile) and LMM for the correlation structure (for exotics sensitivity) separately.


Validation

The companion notebook implements SABR and a simplified 3-rate LMM calibration:

  1. SABR Hagan formula for given (α,β,ρ,ν)(\alpha, \beta, \rho, \nu) evaluated across a strike grid.
  2. ATM formula to initialise α\alpha from market ATM vol — verified against the full formula.
  3. Per-expiry SABR calibration to synthetic market quotes using scipy.optimize.least_squares.
  4. Recovery of the true parameters to within 10310^{-3}.
  5. Rebonato swaption approximation for a 3-rate LMM — compared against the exact result.
  6. Rank-1 Rebonato correlation parametrisation — fitting λ\lambda to a caplet/swaption ratio.
PRACTICE

Before opening the notebook: For SABR with β=0.5\beta = 0.5, F=0.04F = 0.04, T=1T = 1, and quotes σATM=0.25\sigma_{\text{ATM}} = 0.25, σ(K=0.02)=0.30\sigma(K = 0.02) = 0.30, σ(K=0.06)=0.22\sigma(K = 0.06) = 0.22: (a) What sign of ρ\rho do you expect (the smile is negatively skewed)? (b) Estimate α\alpha using the ATM formula (ignore the correction term for a first estimate). (c) Which of ρ\rho or ν\nu primarily controls the skew vs. the curvature?


Limitations

WARNING

Warning — SABR Arbitrage at Extreme Strikes. The Hagan approximation produces implied vols that correspond to negative risk-neutral densities at very low or very high strikes. This is a defect of the approximation, not the true SABR model. The issue manifests as negative butterfly spreads. In production, always verify that the calibrated smile is free of butterfly arbitrage (second derivative of call price with respect to strike 0\ge 0) before using it for pricing. Use the Antonov-Konikov-Spector interpolation or the free boundary SABR (for negative rates) as alternatives.

WARNING

Warning — LMM Dimensionality. A 20-rate LMM has a 20×2020\times 20 correlation matrix (190 free entries) plus 20 vol parameters. Without rank reduction, the calibration is grossly under-determined by the swaption cube (which has at most 20×20×5=200020 \times 20 \times 5 = 2000 quotes, but many of these are illiquid). Rank reduction to k=3k = 3 factors reduces the correlation parameters to 3×206=54\approx 3 \times 20 - 6 = 54 — manageable but still large. The choice of kk is a model risk decision.

Other limitations:

  • Cascade calibration ordering: the cascade proceeds from short to long maturities. Errors in early-calibrated vols propagate and amplify into later calibrations. The cascade is an approximation to the joint problem; global LM refinement is necessary.

  • SABR parameter migration: SABR parameters calibrated today may differ significantly from yesterday's if the vol surface has moved. The SABR model is not dynamically consistent (i.e., there is no SABR dynamics for α\alpha itself) — re-calibration is not a hedge. For dynamic hedging, need the SABR-LMM extension.

  • Rebonato approximation accuracy: the Rebonato approximation has errors of O(v2T)O(v^2 T) where vv is the vol level. For high-vol or long-dated swaptions (T>10T > 10 years), the approximation can be off by more than 1 vol point. Use exact Monte Carlo or PDE benchmarks to validate.

  • Correlation instability: the LMM correlation matrix must be PSD. After calibration, a matrix assembled from independently fitted entries may not be PSD — project onto the nearest PSD matrix (eigenvalue clipping) if needed.


Interview Angle

PRACTICE

L1 (Junior) — Typical questions:

  1. What is the SABR model and what are its parameters? Expected: dF=αFβdW1dF = \alpha F^\beta dW^1, dα=ναdW2d\alpha = \nu\alpha dW^2, dW1dW2=ρdtdW^1 dW^2 = \rho dt. Parameters: α\alpha (initial vol), β\beta (CEV exponent, usually fixed), ρ\rho (skew), ν\nu (vol of vol, curvature).

  2. Why is β\beta fixed rather than calibrated in SABR? Expected: β\beta and ρ\rho have degenerate effects on the skew — fitting both simultaneously is ill-conditioned. Market convention fixes β\beta by rate regime (normal for negative rates, log-normal for positive rates).

  3. What is the Libor Market Model (LMM)? Expected: directly models a set of forward LIBOR/SOFR rates as log-normal processes (under BGM / BG). Consistent with Black's formula for caplets. Used for pricing path-dependent interest rate products (Bermudans, CMS, etc.).

PRACTICE

L2 (Senior) — Typical questions:

  1. Describe the SABR per-expiry calibration procedure. What is the key simplification at ATM? Expected: fix β\beta; use ATM formula to analytically pin α\alpha; run LM on (ρ,ν)(\rho, \nu) (or (α,ρ,ν)(\alpha, \rho, \nu) with α\alpha from ATM). ATM formula is approximately linear in α\alpha → good initialisation. Per slice: 3 parameters, 5–7 instruments, fast convergence.

  2. Explain rank reduction of the LMM correlation matrix. Why is it necessary? Expected: N(N1)/2N(N-1)/2 free parameters in ρ\rho — too many for the data. Rank-kk factorisation ρ=BB\rho = BB^\top reduces to kNk(k+1)/2kN - k(k+1)/2 parameters. k=2k=2 or 3 is standard. State the Rebonato 1-parameter form ρmn=eλmn\rho_{mn} = e^{-\lambda|m-n|}.

  3. What is cascade calibration? What are its weaknesses? Expected: sequential calibration from short to long expiries; each step is a small root-finding problem. Weaknesses: errors propagate, approximation quality degrades for long dates, not globally optimal. Mitigated by global LM polish afterward.

PRACTICE

L3 (Researcher) — Typical questions:

  1. Under what conditions does the Hagan SABR approximation produce arbitrage-free smiles? How would you fix the problem in practice? Expected: approximation produces negative densities at very low strikes (near zero) and very high strikes. Condition: approximation is valid for ν2T1\nu^2 T \ll 1 and α1\alpha \ll 1. Fix: use the exact SABR density (numerically from the 2D Kolmogorov PDE), or use the free-boundary SABR, or the Antonov-Konikov-Spector closed form. For low strikes in negative rate regimes: shifted SABR or normal SABR.

  2. Derive the Rebonato swaption approximation formula. What assumptions does it make? Expected: swap rate is approximated as a weighted sum of forward rates; the weight wnτnDnFn(0)/Sαβw_n \approx \tau_n D_n F_n(0)/S_{\alpha\beta} is frozen at t=0t=0 (the "freezing" approximation). The log-normal dynamics of the frozen approximation then give a log-normal swaption vol via the co-variance integral. Accuracy: O(v2)O(v^2) relative error in vol.

  3. Compare calibrating the LMM to caplets only vs. to swaptions. What additional information do swaptions provide? Expected: caplets constrain each σn\sigma_n independently (diagonal information). Swaptions constrain the integrated covariance n,mρnmσnσm\sum_{n,m} \rho_{nm} \sigma_n \sigma_m (off-diagonal information). Calibrating only to caplets leaves ρ\rho unconstrained — any correlation matrix fits. Swaptions uniquely determine the correlation structure under the Rebonato approximation. Joint calibration to caplets + swaptions determines both vols and correlations.