Quiz: The Levenberg-Marquardt Algorithm — Theory and Implementation

Module 2 of 5 · Hard

Quick Quiz

1. The LM damped normal equations are (JJ+λD)δλ=Jr(J^\top J + \lambda D)\delta_\lambda = J^\top r. At λ=0\lambda = 0, this gives the Gauss-Newton step. As λ\lambda \to \infty, the step approaches:

2. The gain ratio ρ=(L(θ)L(θ+δ))/(m(0)m(δ))\rho = (\mathcal{L}(\theta) - \mathcal{L}(\theta+\delta)) / (m(0) - m(\delta)). A gain ratio of ρ=0.95\rho = 0.95 means:

3. Prove that the LM step δλ\delta_\lambda is always a descent direction. The key ingredient is:

4. LM converges quadratically when the residuals vanish at the solution (r(θ)=0r(\theta^*) = 0) but only linearly when r(θ)0r(\theta^*) \ne 0. What does slow convergence near the solution indicate about the model?

5. A Heston calibration starts from a cold start and converges in 45 iterations. The next day, starting from the previous day's calibrated parameters, it converges in 7 iterations. The reason is:

6. The LM algorithm with D=ID = I is applied to a problem where one parameter κ2.0\kappa \approx 2.0 and another ξ0.3\xi \approx 0.3. What problem arises, and how does Marquardt's scaling D=diag(JJ)D = \mathrm{diag}(J^\top J) fix it?

7. During a calibration run, λ\lambda grows from 10410^{-4} to 101010^{10} without the gradient norm decreasing below tolerance. The most likely root cause is:

8. A calibration algorithm reports "converged" after 3 iterations, but the final residuals are large (RMSE=5%\text{RMSE} = 5\% in vol space). What likely happened?