Setup
What Monte Carlo Computes
Monte Carlo (MC) methods estimate expectations of the form
where is a random variable (or vector) with a known distribution and is a payoff or functional. In derivatives pricing, is typically a path of the risk-neutral process and is the discounted payoff.
The standard MC estimator with i.i.d. samples is
By the law of large numbers, almost surely. By the central limit theorem,
The standard error is , converging at rate regardless of the dimension of . This is the central virtue of MC: convergence rate is dimension-independent.
Notation
- : variance of the estimator (per sample).
- : number of independent sample paths.
- : sample variance estimator, used to construct confidence intervals.
- The confidence interval for is .
A key principle: reducing by a factor is equivalent to increasing by . Variance reduction techniques achieve this without generating more samples.
Antithetic Variates
Principle
For each sample , generate a paired sample from the antithetic distribution — typically by negating the underlying normal draw. The antithetic estimator is:
This is unbiased: since has the same marginal distribution as .
Variance Analysis
Since and are identically distributed, . Therefore:
Variance is reduced whenever . Compared to the standard estimator variance using samples (matching the computational cost — two paths per antithetic pair), the antithetic estimator variance is:
The efficiency gain is:
where is the correlation between and . For this to be a gain, we need .
Application to GBM Call Option
Under GBM, one path realises for . The antithetic path uses :
The payoff is an increasing function of . When is large (call in the money), is large and negative (call out of the money). So for a convex, monotone payoff — antithetic variates are effective.
For path-dependent options, antithetic variates require generating the full antithetic path .
Control Variates
Principle
Let be a random variable correlated with such that is known analytically. The control variate estimator is:
This is unbiased for any constant : the correction term has mean zero.
Optimal Control Coefficient
The variance of is:
Minimising over :
The variance after control variate at :
where is the correlation between and . The variance reduction factor is ; a correlation of gives variance reduction.
In practice, is estimated from a pilot sample or computed jointly with the main estimator using the sample covariance. The bias from estimating is — negligible for large .
Common Control Variates in Option Pricing
Underlying as control. Under , . Set ; then . For a call option, is high (near 1 for at-the-money options), giving substantial reduction.
European as control for path-dependent. For an Asian option on an arithmetic average, the geometric-average Asian has a known closed form. The correlation between arithmetic and geometric payoffs is high ( in typical parameters), yielding near-perfect variance reduction.
Delta-hedged portfolio. For any smooth payoff, the discounted stock price increment has mean zero (it is a martingale under ). Using this as a control variate is known as martingale control variates and can achieve variance reduction close to for smooth payoffs.
Quasi-Monte Carlo
Motivation
Standard MC uses pseudo-random numbers, which are i.i.d. uniform on . The expected error is . Can we do better by replacing pseudo-random sequences with deterministic low-discrepancy sequences that fill the hypercube more uniformly?
Discrepancy
The star discrepancy of a sequence is:
It measures the worst-case deviation between the empirical distribution of the sequence and the uniform distribution. A sequence is low-discrepancy (LD) if — much smaller than the discrepancy of a random sequence.
Koksma-Hlawka Inequality
For a function of bounded Hardy-Krause variation :
For a low-discrepancy sequence: error , which beats for fixed and large . In practice, the improvement is dramatic for and smooth .
Sobol Sequences
Sobol sequences are the most widely used LD sequences in finance. They are digital nets constructed in base 2 using generating matrices that ensure equidistribution across dyadic subintervals. Key properties:
- Each dimension is independently constructed (with cross-dimension properties ensured via scrambling).
- The first points of a Sobol sequence cover in a maximally stratified manner.
- Scrambled Sobol (Owen 1995) randomises the sequence while preserving the LD property, enabling unbiased estimation and confidence intervals.
Effective Dimension
The Koksma-Hlawka bound grows with . For high-dimensional problems (e.g., simulating a 252-step path = ), the theoretical advantage of QMC may be lost. In practice, what matters is the effective dimension: for a GBM path, the variation in the payoff is dominated by a few directions (early time steps, large moves). The Brownian bridge construction assigns Sobol dimensions to the most influential time points first, concentrating the low-discrepancy property where it matters most.
Convergence Comparison
| Method | Expected error | Notes |
|---|---|---|
| Standard MC | Dimension-independent, easy to implement | |
| Antithetic variates | Reduces variance by , no extra paths | |
| Control variates | Reduces variance by , needs known | |
| Quasi-MC (Sobol) | Pre-asymptotic gains for , smooth integrands | |
| Randomised QMC | (theoretical) | Confidence intervals available; best for smooth functions |
All methods have the same asymptotic rate in the worst case for non-smooth payoffs. The gains are problem-dependent.
Limitations
Non-smooth payoffs. Digital options have a payoff with a jump discontinuity. The Koksma-Hlawka bound requires bounded variation — a digital payoff has in the Hardy-Krause sense (discontinuous). QMC loses its advantage; standard MC performs comparably. Smoothing the payoff (e.g., replace the digital with a call spread) restores QMC efficiency.
Curse of dimensionality. The factor in QMC becomes prohibitive for at typical sample sizes. The effective-dimension reduction via Brownian bridge or principal component analysis (PCA) construction is essential.
Antithetic for discontinuous payoffs. Antithetic variates can increase variance if the payoff is not monotone (e.g., a butterfly spread). Always verify the sign of before applying.
Control variate estimation. If is estimated from the same sample used for the main estimate, the combined estimator is biased (though at rate ). For small or high-stakes applications, use a separate pilot sample for estimation.
Interview Angle
L1. State the Monte Carlo convergence rate. How many additional paths are needed to halve the standard error? What is the standard error of a MC estimate of a Black-Scholes call price?
Rate: . To halve , multiply by 4. For a BS call: simulate , compute . Then and where .
L2. Derive the optimal control variate coefficient and the resulting variance reduction formula . Explain why makes the discounted stock a valid control variate.
Derivation: . Differentiate in and set to zero: . Substituting back: .
Why is valid: Under , the discounted stock is a martingale, so — this is known analytically. Setting with , the correction term has zero mean, making the control variate estimator unbiased.
L3. State the Koksma-Hlawka inequality. Under what conditions does quasi-MC outperform standard MC? Explain the role of the Brownian bridge construction in making Sobol sequences effective for path-dependent options.
Koksma-Hlawka: . For LD sequences , so the error is vs. for standard MC. QMC wins when: (1) has bounded variation (smooth or Lipschitz payoffs), (2) is moderate (the factor is manageable), and (3) is large enough for the pre-asymptotic advantage to materialise.
Brownian bridge: For a path , the standard construction maps Sobol dimension to time step in order. But the payoff variance is dominated by the terminal value and large-scale moves, not fine-scale increments. The Brownian bridge construction uses Sobol dimension 1 for , dimension 2 for (interpolated given and ), then fills in midpoints recursively. This assigns the lowest-discrepancy dimensions to the highest-variance components of the path, making the effective dimension of the integral much smaller than and restoring QMC efficiency.