ProbabilityMeasure TheoryLebesgue IntegrationExpectation

Lebesgue Integration and Expectation

Module 2 of 522 min readLevel: Medium

Setup

From Riemann to Lebesgue

The Riemann integral partitions the domain — the x-axis — into thin vertical slices and approximates ff by step functions on those slices. For continuous functions on bounded intervals this works well. It fails in three ways that matter for finance:

  1. Discontinuous functions. The indicator 1Q\mathbf{1}_{\mathbb{Q}} (1 on rationals, 0 elsewhere) is not Riemann integrable. Its Lebesgue integral over [0,1][0,1] is 00 — the rationals have measure zero — yet this cannot be recovered from Riemann's construction.

  2. Limit interchange. A uniformly bounded sequence of Riemann-integrable functions can converge pointwise to a function that is not Riemann integrable. The Lebesgue theory has clean theorems (MCT, DCT) governing when limits and integrals commute.

  3. Abstract sample spaces. Ω\Omega is not the real line. Asset price paths live in C([0,T])C([0,T]); there is no natural x-axis to partition. Expectation must be defined on an arbitrary measurable space.

The Lebesgue integral resolves all three by partitioning the range of ff — the y-axis — into thin horizontal strips, measuring the probability of the preimage of each strip, and summing.

INSIGHT

Why this matters on a derivatives desk. Every option price is an expectation: V0=erTEQ[Φ(ST)]V_0 = e^{-rT} \mathbb{E}^\mathbb{Q}[\Phi(S_T)]. Monte Carlo is a law-of-large-numbers approximation to this Lebesgue integral. Changing between the physical measure P\mathbb{P} and the risk-neutral measure Q\mathbb{Q} (Girsanov's theorem, Module 3 of this course) is a change in the integrating measure — it has precise meaning only in the Lebesgue framework.

Conventions

  • (Ω,F,P)(\Omega, \mathcal{F}, \mathbb{P}) is a probability space as defined in Module 1.
  • X,Y:ΩRX, Y : \Omega \to \mathbb{R} always denote F\mathcal{F}-measurable random variables.
  • E[X]\mathbb{E}[X] and ΩXdP\int_\Omega X \, d\mathbb{P} are used interchangeably — they are the same object.
  • "Integrable" without qualification means E[X]<\mathbb{E}[|X|] < \infty (i.e., XL1(Ω,F,P)X \in L^1(\Omega, \mathcal{F}, \mathbb{P})).
  • "a.s." means "P\mathbb{P}-almost surely" — holding except on a set of probability zero.

Theory

1. Simple Functions

The Lebesgue integral is built in three ascending stages: (1) simple functions, (2) non-negative measurables, (3) general integrable functions.

DEFINITION

Definition 1.1 (Simple Function). A function s:ΩRs : \Omega \to \mathbb{R} is simple if it takes only finitely many values. Every simple function has a unique standard form

s=i=1nai1Ai,s = \sum_{i=1}^n a_i \, \mathbf{1}_{A_i},

where a1,,ana_1, \ldots, a_n are the distinct values of ss and Ai=s1({ai})FA_i = s^{-1}(\{a_i\}) \in \mathcal{F} are the corresponding preimages. The AiA_i are disjoint and cover Ω\Omega.

The sets AiA_i belong to F\mathcal{F} because ss is measurable ({s=ai}\{s = a_i\} is the preimage of a Borel set). Their disjointness and covering property follow directly from the fact that the aia_i are distinct.

DEFINITION

Definition 1.2 (Integral of a Non-Negative Simple Function). If s=i=1nai1Ais = \sum_{i=1}^n a_i \mathbf{1}_{A_i} with ai0a_i \ge 0, the Lebesgue integral of ss is

ΩsdP:=i=1naiP(Ai)[0,],\int_\Omega s \, d\mathbb{P} := \sum_{i=1}^n a_i \, \mathbb{P}(A_i) \in [0, \infty],

with the convention 0=00 \cdot \infty = 0.

The integral is a weighted average: value aia_i times the probability of the region where ss takes that value. This is the natural generalisation of the discrete expected value formula E[X]=kxkP(X=xk)\mathbb{E}[X] = \sum_k x_k \mathbb{P}(X = x_k).

EXAMPLE

Example 1.3. Let Ω={H,T}\Omega = \{H, T\} (fair coin), P({H})=P({T})=1/2\mathbb{P}(\{H\}) = \mathbb{P}(\{T\}) = 1/2, and X(H)=3X(H) = 3, X(T)=1X(T) = -1. Then X=31{H}+(1)1{T}X = 3 \cdot \mathbf{1}_{\{H\}} + (-1) \cdot \mathbf{1}_{\{T\}} is simple (but not non-negative; we handle the sign in §3). If instead X(H)=3X(H) = 3, X(T)=1X(T) = 1, then XX is a non-negative simple function and

ΩXdP=312+112=2.\int_\Omega X \, d\mathbb{P} = 3 \cdot \tfrac{1}{2} + 1 \cdot \tfrac{1}{2} = 2.


2. Non-Negative Measurable Functions

Every non-negative measurable function is a pointwise limit of increasing simple functions. This is the foundational approximation lemma of Lebesgue integration.

THEOREM

Theorem 2.1 (Monotone Approximation). For every measurable f:Ω[0,]f : \Omega \to [0, \infty], there exists a sequence of non-negative simple functions 0s1s20 \le s_1 \le s_2 \le \cdots with snfs_n \nearrow f pointwise. One canonical construction is

sn(ω):=k=0n2n1k2n1{k2nf<k+12n}(ω)+n1{fn}(ω).s_n(\omega) := \sum_{k=0}^{n \cdot 2^n - 1} \frac{k}{2^n} \, \mathbf{1}_{\left\{\frac{k}{2^n} \le f < \frac{k+1}{2^n}\right\}}(\omega) + n \cdot \mathbf{1}_{\{f \ge n\}}(\omega).

This truncates ff at level nn and quantises the range into intervals of width 2n2^{-n}. Each preimage is in F\mathcal{F} by measurability.

DEFINITION

Definition 2.2 (Integral of a Non-Negative Measurable Function). For measurable f:Ω[0,]f : \Omega \to [0, \infty],

ΩfdP:=sup{ΩsdP:0sf, s simple}.\int_\Omega f \, d\mathbb{P} := \sup \left\{ \int_\Omega s \, d\mathbb{P} : 0 \le s \le f,\ s \text{ simple} \right\}.

The value may be ++\infty. Theorem 2.1 guarantees the supremum is achieved by the canonical approximating sequence.

THEOREM

Theorem 2.3 (Monotone Convergence Theorem — MCT). Let 0f1f20 \le f_1 \le f_2 \le \cdots be measurable functions with fnff_n \nearrow f pointwise (P\mathbb{P}-a.s.). Then

limnΩfndP=ΩfdP.\lim_{n \to \infty} \int_\Omega f_n \, d\mathbb{P} = \int_\Omega f \, d\mathbb{P}.

The MCT justifies swapping the limit and the integral whenever the sequence is increasing. It is used to compute the expectation of any non-negative random variable as the limit of expectations of bounded approximations.

REMARK

The increasing condition is essential. Let Ω=R\Omega = \mathbb{R} with Lebesgue measure λ\lambda and set fn=1[n,n+1]f_n = \mathbf{1}_{[n, n+1]}. Then fn0f_n \to 0 pointwise, yet fndλ=1\int f_n \, d\lambda = 1 for all nn. The limit and integral do not commute because the mass "escapes to infinity" — an MCT failure. This is precisely the phenomenon behind heavy-tailed return distributions in finance where variance contributions at large values do not vanish.


3. General Integrable Functions

Every measurable f:ΩRf : \Omega \to \mathbb{R} decomposes as f=f+ff = f^+ - f^-, where f+:=max(f,0)0f^+ := \max(f, 0) \ge 0 and f:=max(f,0)0f^- := \max(-f, 0) \ge 0 are both non-negative and measurable.

DEFINITION

Definition 3.1 (Lebesgue Integral). For a measurable f:ΩRf : \Omega \to \mathbb{R}:

ΩfdP:=Ωf+dPΩfdP,\int_\Omega f \, d\mathbb{P} := \int_\Omega f^+ \, d\mathbb{P} - \int_\Omega f^- \, d\mathbb{P},

provided at least one term is finite. If both terms are finite — equivalently, if ΩfdP<\int_\Omega |f| \, d\mathbb{P} < \infty — then ff is P\mathbb{P}-integrable, written fL1(Ω,F,P)f \in L^1(\Omega, \mathcal{F}, \mathbb{P}).

THEOREM

Theorem 3.2 (Standard Properties). For f,gL1f, g \in L^1 and constants a,bRa, b \in \mathbb{R}:

  1. Linearity: (af+bg)dP=a ⁣fdP+b ⁣gdP\displaystyle\int (af + bg) \, d\mathbb{P} = a\!\int f \, d\mathbb{P} + b\!\int g \, d\mathbb{P}.
  2. Monotonicity: fgf \le g a.s. \Rightarrow fdPgdP\displaystyle\int f \, d\mathbb{P} \le \int g \, d\mathbb{P}.
  3. Null sets: modifying ff on a P\mathbb{P}-null set does not change fdP\displaystyle\int f \, d\mathbb{P}.
  4. Triangle inequality: fdPfdP\displaystyle\left|\int f \, d\mathbb{P}\right| \le \int |f| \, d\mathbb{P}.
THEOREM

Theorem 3.3 (Dominated Convergence Theorem — DCT). Let (fn)(f_n) be measurable functions with fnff_n \to f P\mathbb{P}-a.s. If there exists gL1g \in L^1 with fng|f_n| \le g P\mathbb{P}-a.s. for all nn, then fL1f \in L^1 and

limnΩfndP=ΩfdP.\lim_{n \to \infty} \int_\Omega f_n \, d\mathbb{P} = \int_\Omega f \, d\mathbb{P}.

The dominating function gg prevents mass from escaping. In pricing, the DCT is the justification for differentiating expectations to compute Greeks — provided the derivative of the payoff with respect to the parameter (spot, vol, rate) is dominated by an integrable function.


4. Expectation

DEFINITION

Definition 4.1 (Expectation). The expectation (expected value) of an integrable random variable XX on (Ω,F,P)(\Omega, \mathcal{F}, \mathbb{P}) is

E[X]:=ΩXdP.\mathbb{E}[X] := \int_\Omega X \, d\mathbb{P}.

This is not a separate definition. The symbol E[X]\mathbb{E}[X] is shorthand for the Lebesgue integral of XX against P\mathbb{P}. All of Theorem 3.2 applies directly:

E[aX+bY]=aE[X]+bE[Y],XY a.s.E[X]E[Y],E[X]E[X].\mathbb{E}[aX + bY] = a\,\mathbb{E}[X] + b\,\mathbb{E}[Y], \qquad X \le Y \text{ a.s.} \Rightarrow \mathbb{E}[X] \le \mathbb{E}[Y], \qquad |\mathbb{E}[X]| \le \mathbb{E}[|X|].

When XX has a density pXp_X with respect to Lebesgue measure on R\mathbb{R}, the change-of-variables formula (image measure) recovers the classical expression:

E[X]=xpX(x)dx.\mathbb{E}[X] = \int_{-\infty}^\infty x \, p_X(x) \, dx.

This is not a separate theorem — it follows from the general machinery of pushforward measures.

INSIGHT

Risk-neutral pricing is an expectation. Under the risk-neutral measure Q\mathbb{Q}, a European derivative with payoff Φ(ST)\Phi(S_T) is priced as

V0=erTEQ[Φ(ST)]=erTΩΦ(ST(ω))dQ(ω).V_0 = e^{-rT} \mathbb{E}^\mathbb{Q}[\Phi(S_T)] = e^{-rT} \int_\Omega \Phi(S_T(\omega)) \, d\mathbb{Q}(\omega).

Monte Carlo samples NN paths and estimates this as erTN1k=1NΦ(ST(ωk))e^{-rT} \cdot N^{-1} \sum_{k=1}^N \Phi(S_T(\omega_k)). The strong law of large numbers — a theorem about Lebesgue integrals — guarantees convergence Q\mathbb{Q}-a.s. as NN \to \infty.


5. Key Inequalities

THEOREM

Theorem 5.1 (Jensen's Inequality). Let XL1X \in L^1 and φ:RR\varphi : \mathbb{R} \to \mathbb{R} be a convex function with φ(X)L1\varphi(X) \in L^1. Then

φ ⁣(E[X])E[φ(X)].\varphi\!\bigl(\mathbb{E}[X]\bigr) \le \mathbb{E}[\varphi(X)].

Proof. Set μ=E[X]\mu = \mathbb{E}[X]. Convexity of φ\varphi at μ\mu means there exists a supporting line: a constant cRc \in \mathbb{R} such that φ(x)φ(μ)+c(xμ)\varphi(x) \ge \varphi(\mu) + c(x - \mu) for all xx. Substituting x=X(ω)x = X(\omega) and taking expectations: E[φ(X)]φ(μ)+c(E[X]μ)=φ(μ)=φ(E[X]).\mathbb{E}[\varphi(X)] \ge \varphi(\mu) + c\,(\mathbb{E}[X] - \mu) = \varphi(\mu) = \varphi(\mathbb{E}[X]). \qquad \square

EXAMPLE

Example 5.2 (Call lower bound). The call payoff (STK)+(S_T - K)^+ is convex in STS_T. Jensen's inequality gives

(EQ[ST]K)+EQ[(STK)+].\bigl(\mathbb{E}^\mathbb{Q}[S_T] - K\bigr)^+ \le \mathbb{E}^\mathbb{Q}[(S_T - K)^+].

Under the risk-neutral measure, EQ[ST]=S0erT\mathbb{E}^\mathbb{Q}[S_T] = S_0 e^{rT} (forward price). Multiplying by erTe^{-rT}:

V0=erTEQ[(STK)+](S0KerT)+.V_0 = e^{-rT}\mathbb{E}^\mathbb{Q}[(S_T-K)^+] \ge (S_0 - Ke^{-rT})^+.

This is the intrinsic-value lower bound for a European call — derived purely from convexity, without Black-Scholes.

THEOREM

Theorem 5.3 (Markov's Inequality). For a non-negative XX and α>0\alpha > 0:

P(Xα)E[X]α.\mathbb{P}(X \ge \alpha) \le \frac{\mathbb{E}[X]}{\alpha}.

Proof. E[X]{Xα}XdPαP(Xα)\mathbb{E}[X] \ge \int_{\{X \ge \alpha\}} X \, d\mathbb{P} \ge \alpha \, \mathbb{P}(X \ge \alpha). \square

THEOREM

Theorem 5.4 (Chebyshev's Inequality). For XL2X \in L^2 with μ=E[X]\mu = \mathbb{E}[X], σ2=Var(X)\sigma^2 = \text{Var}(X), and any k>0k > 0:

P(Xμkσ)1k2.\mathbb{P}(|X - \mu| \ge k\sigma) \le \frac{1}{k^2}.

Proof. Apply Markov's inequality to (Xμ)2(X - \mu)^2 with α=k2σ2\alpha = k^2\sigma^2. \square

REMARK

Markov and model-free risk bounds. If a portfolio's daily P&L has known mean μ\mu and the loss is non-negative after shifting, Markov's inequality bounds the tail probability without any distributional assumption. The bound is often loose in practice — real loss distributions have far heavier tails than Markov implies — but it is the only bound achievable from the mean alone.


6. Geometric Intuition

Range-partitioning versus domain-partitioning. The Riemann integral approximates the area under a curve by summing thin vertical rectangles of width Δx\Delta x. The Lebesgue integral sums thin horizontal strips of height Δy\Delta y: for each level yy, the "width" is not Δx\Delta x but P({yf<y+Δy})\mathbb{P}(\{y \le f < y + \Delta y\}), the probability of the preimage. On abstract Ω\Omega there is no natural x-axis to partition — the range-based approach is the only one available.

Jensen geometrically. If φ\varphi is convex, its graph curves upward. The mean μ=E[X]\mu = \mathbb{E}[X] is the centre of mass of XX's distribution under P\mathbb{P}. The point (μ,φ(μ))(\mu, \varphi(\mu)) lies on the curve. The value E[φ(X)]\mathbb{E}[\varphi(X)] is the probability-weighted average of heights of the curve — always at least as high as the height at the centre of mass, because curvature pushes the average up. Equality holds if and only if XX is constant a.s. or φ\varphi is affine on the support of XX.

MCT geometrically. The integral fdP\int f \, d\mathbb{P} is the total probability-weighted "volume under ff." An increasing sequence fnff_n \nearrow f fills this volume from below: each additional slice of the range adds non-negative probability mass. The integral accumulates monotonically to the full volume.


Validation

The companion notebook verifies the following claims computationally on finite probability spaces using exact rational arithmetic:

  1. Simple function integralE[X]\mathbb{E}[X] computed as iaiP(Ai)\sum_i a_i \mathbb{P}(A_i) on a discrete Ω\Omega, confirmed against the naive weighted sum.
  2. MCT verification — a non-decreasing sequence fnff_n \nearrow f on a discrete space; confirm fnf\int f_n \nearrow \int f.
  3. DCT verification — a bounded convergent sequence on a discrete space; confirm limit and integral commute.
  4. Jensen's inequality — checked for φ(x)=x2\varphi(x) = x^2, φ(x)=ex\varphi(x) = e^x, and the call payoff (xK)+(x-K)^+.
  5. Markov and Chebyshev — verified that the bounds hold exactly for specific distributions.
PRACTICE

Before opening the notebook, try the following by hand:

Let Ω={1,2,3,4}\Omega = \{1, 2, 3, 4\} with P({k})=1/4\mathbb{P}(\{k\}) = 1/4 for each kk. Define X(ω)=ω2.5X(\omega) = \omega - 2.5, so X{1.5, 0.5, 0.5, 1.5}X \in \{-1.5,\ -0.5,\ 0.5,\ 1.5\}.

  1. Compute E[X]\mathbb{E}[X] directly from Definition 1.2.
  2. Compute E[X2]\mathbb{E}[X^2] and verify Jensen with φ=x2\varphi = x^2: check (E[X])2E[X2](\mathbb{E}[X])^2 \le \mathbb{E}[X^2].
  3. Compute P(X1)\mathbb{P}(|X| \ge 1) and verify Markov with α=1\alpha = 1: check P(X1)E[X]/1\mathbb{P}(|X| \ge 1) \le \mathbb{E}[|X|]/1.

Limitations

L1L^1 is the minimum; L2L^2 is what stochastic calculus needs. The Itô integral (Module 1 of Stochastic Calculus) is constructed for processes satisfying E[0Tft2dt]<\mathbb{E}[\int_0^T f_t^2 \, dt] < \infty. This is an L2L^2 condition — square-integrability, not just integrability. A random variable in L1L2L^1 \setminus L^2 cannot serve as an Itô integrand. Practically: power-law tailed payoffs (e.g., leveraged volatility products) may be in L1L^1 but fail L2L^2 integrability, invalidating standard Itô calculus arguments.

Non-integrable random variables are real. The Cauchy distribution has E[X]=\mathbb{E}[|X|] = \infty — the expectation does not exist. A model that assumes finite mean when the true distribution is heavy-tailed (Pareto with tail index α<1\alpha < 1) is not a conservative approximation; it is a misspecification. This arises in extreme credit events and in certain volatility-of-volatility models.

Fubini's theorem is not free. To interchange the order of a double integral fdPdQ\int \int f \, d\mathbb{P} \, d\mathbb{Q}, the Fubini–Tonelli theorem requires fL1f \in L^1 of the product measure (or f0f \ge 0 for Tonelli). This condition is non-trivial for joint distributions under stochastic vol models, pricing in multi-currency frameworks (FX triangle), or computing expected values of path-dependent integrals.

Differentiating under E\mathbb{E} requires DCT. The formula θE[f(X,θ)]=E[θf(X,θ)]\partial_\theta \mathbb{E}[f(X, \theta)] = \mathbb{E}[\partial_\theta f(X, \theta)] holds if θfg|\partial_\theta f| \le g with gL1g \in L^1. For discontinuous payoffs (digitals, barriers, first-touch), θf\partial_\theta f can be a Dirac delta — not integrable — and the formula fails. The resulting bias in finite-difference Greeks is not a numerical artefact; it is a mathematical failure of the interchange condition.

WARNING

"E[X]\mathbb{E}[X] exists" is not automatic. Always verify E[X]<\mathbb{E}[|X|] < \infty before applying linearity, Jensen, or the DCT. For a product XYXY, existence of E[X]\mathbb{E}[X] and E[Y]\mathbb{E}[Y] separately does not imply E[XY]\mathbb{E}[XY] is finite — for that you need XL2X \in L^2 and YL2Y \in L^2 (Cauchy-Schwarz: E[XY]X2Y2\mathbb{E}[|XY|] \le \|X\|_2 \|Y\|_2).


Interview Angle

L1 — Junior quant / quant developer

Expected depth: Compute E[X]\mathbb{E}[X] on discrete spaces from first principles; state linearity and monotonicity; apply Jensen to options; know the Markov bound.

PRACTICE

Q1. "What is E[X]\mathbb{E}[X]? Write the formula for a discrete random variable and connect it to the Lebesgue integral."

Expected: E[X]=ixiP(X=xi)\mathbb{E}[X] = \sum_i x_i \mathbb{P}(X = x_i) — this is ΩXdP\int_\Omega X \, d\mathbb{P} when XX is a simple (discrete) function. Linearity and monotonicity follow immediately from the integral definition. Weak answer: "the average" without probability weights or without connecting to the formal definition.

Q2. "State Jensen's inequality and use it to derive a lower bound for a European call."

Expected: φ(E[X])E[φ(X)]\varphi(\mathbb{E}[X]) \le \mathbb{E}[\varphi(X)] for convex φ\varphi. With φ(x)=(xK)+\varphi(x) = (x-K)^+ and X=STX = S_T: V0=erTEQ[(STK)+]erT(EQ[ST]K)+=(S0KerT)+V_0 = e^{-rT}\mathbb{E}^\mathbb{Q}[(S_T-K)^+] \ge e^{-rT}(\mathbb{E}^\mathbb{Q}[S_T]-K)^+ = (S_0 - Ke^{-rT})^+. Weak answer: states the inequality but cannot derive the finance application.

Q3. "Can E[X]\mathbb{E}[X] fail to exist? Give a concrete example."

Expected: Yes. The Cauchy distribution p(x)=1/(π(1+x2))p(x) = 1/(\pi(1+x^2)): the integral x/(π(1+x2))dx\int |x| / (\pi(1+x^2)) dx diverges. Common in finance: power-law tails with tail index α<1\alpha < 1. The sample mean of Cauchy samples does not converge as sample size increases — it fluctuates permanently.

L2 — Senior quant

Expected depth: Explain when DCT justifies differentiating under E\mathbb{E}, derive Jensen via supporting hyperplanes, distinguish L1L^1 from L2L^2.

PRACTICE

Q1. "When can you compute SV0\partial_S V_0 by differentiating under the expectation sign? When does it fail?"

Expected: DCT justifies it when SΦ(ST)|\partial_S \Phi(S_T)| is dominated by an L1L^1 function. For a call, S(STK)+=1ST>K\partial_S (S_T - K)^+ = \mathbf{1}_{S_T > K}, which is bounded by 1 — DCT applies, yielding Δ=erTPQ(ST>K)=erTN(d2)\Delta = e^{-rT}\mathbb{P}^\mathbb{Q}(S_T > K) = e^{-rT} N(d_2). For a digital 1ST>K\mathbf{1}_{S_T > K}, the derivative with respect to KK is δ(STK)-\delta(S_T - K), a Dirac delta — not integrable, DCT fails, naive differentiation gives a wrong value.

Q2. "What is the difference between L1L^1 and L2L^2? Why does the Itô integral require L2L^2?"

Expected: L1L^1: E[X]<\mathbb{E}[|X|] < \infty; L2L^2: E[X2]<\mathbb{E}[X^2] < \infty. L2L1L^2 \subset L^1 on probability spaces (Cauchy-Schwarz). The Itô isometry E[(0TftdWt)2]=E[0Tft2dt]\mathbb{E}[(\int_0^T f_t \, dW_t)^2] = \mathbb{E}[\int_0^T f_t^2 \, dt] is defined only when the right side is finite — i.e., when the integrand is L2([0,T]×Ω)L^2([0,T] \times \Omega). Without this, the stochastic integral is not a martingale and its variance is not controlled.

Q3. "State the Dominated Convergence Theorem precisely and give a pricing application."

Expected: fnff_n \to f a.s., fngL1|f_n| \le g \in L^1 for all nn \Rightarrow fndPfdP\int f_n \, d\mathbb{P} \to \int f \, d\mathbb{P}. Application: continuity of option prices in parameters. For a call with strike KnKK_n \to K, (STKn)+(STK)+(S_T - K_n)^+ \to (S_T - K)^+ a.s. and (STKn)+STL1|(S_T - K_n)^+| \le S_T \in L^1 (under log-normal, EQ[ST]=S0erT<\mathbb{E}^\mathbb{Q}[S_T] = S_0 e^{rT} < \infty) — so DCT gives continuity of the call price in the strike.

L3 — Quant researcher

Expected depth: Fubini–Tonelli and its conditions, uniform integrability, LpL^p interpolation, measurability on path space.

PRACTICE

Q1. "When can you swap E\mathbb{E} and an infinite sum? State the precise theorem."

Expected: If Xn0X_n \ge 0: MCT gives E[nXn]=nE[Xn]\mathbb{E}[\sum_n X_n] = \sum_n \mathbb{E}[X_n] always (no integrability required). For general (Xn)(X_n): if nE[Xn]<\sum_n \mathbb{E}[|X_n|] < \infty, set g=nXnL1g = \sum_n |X_n| \in L^1 (MCT applied to partial sums), then k=1nXkg|\sum_{k=1}^n X_k| \le g — DCT gives E[nXn]=nE[Xn]\mathbb{E}[\sum_n X_n] = \sum_n \mathbb{E}[X_n]. For double integrals: Tonelli (non-negative, no condition) or Fubini (fL1f \in L^1 of the product measure) allows interchange of order.

Q2. "What is uniform integrability? Why is it the correct condition for L1L^1 martingale convergence?"

Expected: (Xn)(X_n) is uniformly integrable (UI) if supnE[Xn1Xn>c]0\sup_n \mathbb{E}[|X_n| \mathbf{1}_{|X_n| > c}] \to 0 as cc \to \infty — the tails of Xn|X_n| are uniformly controlled. A martingale (Mt)t0(M_t)_{t \ge 0} converges in L1L^1 (not just a.s.) if and only if it is UI. Boundedness in L2L^2 (suptE[Mt2]<\sup_t \mathbb{E}[M_t^2] < \infty) implies UI by Cauchy-Schwarz. This underpins the Optional Stopping Theorem: E[Mτ]=E[M0]\mathbb{E}[M_\tau] = \mathbb{E}[M_0] holds for bounded stopping times, or for UI martingales.

Q3. "Brownian paths live in C([0,T])C([0,T]). What is the natural σ-algebra and measure on this space?"

Expected: The Borel σ-algebra on C([0,T])C([0,T]) with the sup-norm topology. This is generated by cylinder sets {Bt1A1,,BtnAn}\{B_{t_1} \in A_1, \ldots, B_{t_n} \in A_n\}. The Wiener measure WW is the unique probability measure on C([0,T])C([0,T]) under which B0=0B_0 = 0 a.s., BtBsN(0,ts)B_t - B_s \sim \mathcal{N}(0, t-s), and increments on disjoint intervals are independent. Its existence is guaranteed by the Kolmogorov extension theorem. Integration of functionals F:C([0,T])RF : C([0,T]) \to \mathbb{R} with respect to WW is what every path-dependent pricing formula computes.