Stochastic CalculusBrownian MotionQuadratic Variation

Brownian Motion and Quadratic Variation

18 min readLevel: Medium

Setup

Let (Ω,F,P)(\Omega, \mathcal{F}, \mathbb{P}) be a complete probability space equipped with a filtration F=(Ft)t0\mathbb{F} = (\mathcal{F}_t)_{t \geq 0} satisfying the usual conditions: right-continuity (Ft=Ft+\mathcal{F}_t = \mathcal{F}_{t+}) and completeness (null sets of P\mathbb{P} are in F0\mathcal{F}_0).

All processes are defined on this space. Time horizon is [0,T][0, T] for a fixed T>0T > 0. Expectations are under P\mathbb{P} unless stated otherwise.


Definition

A standard Brownian motion (or Wiener process) is a stochastic process (Wt)t0(W_t)_{t \geq 0} satisfying:

  1. Initial condition: W0=0W_0 = 0 almost surely.
  2. Independent increments: for 0s<t0 \leq s < t, the increment WtWsW_t - W_s is independent of Fs\mathcal{F}_s.
  3. Gaussian increments: WtWsN(0,ts)W_t - W_s \sim \mathcal{N}(0,\, t - s) for all 0st0 \leq s \leq t.
  4. Continuous paths: tWt(ω)t \mapsto W_t(\omega) is continuous P\mathbb{P}-almost surely.

These axioms are not redundant. Axiom 4 is a topological regularity condition, independent of the distributional structure in axioms 2–3. Without it, the process would be defined only up to modification, and pathwise integration would be meaningless.

The covariance structure follows immediately: for sts \leq t, Cov(Ws,Wt)=E[WsWt]=s.\mathrm{Cov}(W_s, W_t) = \mathbb{E}[W_s W_t] = s.


Sample Path Properties

Hölder Continuity

By the Kolmogorov–Chentsov continuity theorem, a process with E[WtWsp]Cts1+α\mathbb{E}[|W_t - W_s|^p] \leq C |t - s|^{1 + \alpha} for some p,α,C>0p, \alpha, C > 0 admits a version with Hölder-continuous paths of any exponent γ<α/p\gamma < \alpha/p.

For Brownian motion, E[WtWs2k]=(2k1)!!tsk\mathbb{E}[|W_t - W_s|^{2k}] = (2k-1)!! \cdot |t-s|^k. Taking k=2k = 2: E[WtWs4]=3(ts)2\mathbb{E}[|W_t - W_s|^4] = 3(t-s)^2, so we may take p=4p = 4, α=1\alpha = 1, giving Hölder exponent up to γ<1/4\gamma < 1/4. Sharper analysis yields Hölder exponent γ<1/2\gamma < 1/2.

Brownian motion paths are not Hölder-1/21/2: the modulus of continuity is limh0suptshWtWs2hlog(1/h)=1a.s.\lim_{h \to 0} \sup_{|t - s| \leq h} \frac{|W_t - W_s|}{\sqrt{2h \log(1/h)}} = 1 \quad \text{a.s.} (the Lévy modulus). The hlog(1/h)\sqrt{h \log(1/h)} factor, not just h\sqrt{h}, captures the exact regularity.

Nowhere Differentiability

With probability one, Brownian motion is nowhere differentiable. To see why: if the path were differentiable at some point tt, then (Wt+hWt)/hWt(W_{t+h} - W_t)/h \to W'_t as h0h \to 0. But this quotient has standard deviation h1/2h^{-1/2}, which diverges. A formal proof uses Paley–Wiener–Zygmund or a direct Borel–Cantelli argument.

This is the mathematical source of the informal statement "Brownian motion fluctuates on every scale."

Infinite Total Variation

Define the total variation of WW on [0,T][0, T] over a partition π={0=t0<t1<<tn=T}\pi = \{0 = t_0 < t_1 < \cdots < t_n = T\}: TV(W,π)=i=1nWtiWti1.TV(W, \pi) = \sum_{i=1}^{n} |W_{t_i} - W_{t_{i-1}}|.

The total variation of Brownian motion is infinite almost surely: supπTV(W,π)=+a.s.\sup_{\pi} TV(W, \pi) = +\infty \quad \text{a.s.}

This has a fundamental consequence: Lebesgue–Stieltjes integration of the form fdW\int f \, dW cannot be defined pathwise. The standard integration-by-parts formula requires finite variation. For Brownian motion, a new theory — the Itô integral — is required.


Quadratic Variation

The quadratic variation of a process XX on [0,t][0, t] is defined as the limit in probability: [X]t=limπ0i=1n(XtiXti1)2,[X]_t = \lim_{|\pi| \to 0} \sum_{i=1}^{n} (X_{t_i} - X_{t_{i-1}})^2, where the limit is taken as the mesh π=maxi(titi1)0|\pi| = \max_i(t_i - t_{i-1}) \to 0 over arbitrary partitions.

Theorem: [W]t=t[W]_t = t

For standard Brownian motion, [W]t=t[W]_t = t almost surely for all t0t \geq 0.

Proof. Fix an equal-mesh partition with nn intervals and mesh h=t/nh = t/n. Let Δi=WtiWti1\Delta_i = W_{t_i} - W_{t_{i-1}} and define: Qn=i=1nΔi2.Q_n = \sum_{i=1}^{n} \Delta_i^2.

Since ΔiN(0,h)\Delta_i \sim \mathcal{N}(0, h) and all Δi\Delta_i are independent: E[Δi2]=h,Var(Δi2)=E[Δi4]h2=3h2h2=2h2.\mathbb{E}[\Delta_i^2] = h, \qquad \mathrm{Var}(\Delta_i^2) = \mathbb{E}[\Delta_i^4] - h^2 = 3h^2 - h^2 = 2h^2.

Therefore: E[Qn]=nh=t,Var(Qn)=n2h2=2t2/nn0.\mathbb{E}[Q_n] = nh = t, \qquad \mathrm{Var}(Q_n) = n \cdot 2h^2 = 2t^2/n \xrightarrow[n \to \infty]{} 0.

By Chebyshev's inequality, QntQ_n \to t in L2L^2, and hence in probability. Convergence a.s. follows by a subsequence argument extended to general (non-equal) partitions via an L2L^2 monotonicity argument. \square

Differential Notation

The result is commonly written as: (dWt)2=dt.(dW_t)^2 = dt.

This is shorthand for the quadratic variation result, not a pathwise identity. More precisely, for any adapted process ff, 0Tftd[W]t=0Tftdt.\int_0^T f_t \, d[W]_t = \int_0^T f_t \, dt. This identity is central to the derivation of Itô's lemma.


Contrast: Total Variation vs Quadratic Variation

PropertySmooth C1C^1 pathBrownian motion
Total variationFiniteInfinite (a.s.)
Quadratic variationZerott (a.s.)

For a C1C^1 function ff on [0,T][0,T]: increments are O(h)O(h), so squared increments are O(h2)O(h^2), and the sum over n=T/hn = T/h partitions is O(h)0O(h) \to 0.

For Brownian motion: increments are O(h)O(\sqrt{h}), squared increments are O(h)O(h), and the sum over n=T/hn = T/h partitions converges to TT. The non-trivial quadratic variation is the source of the Itô correction in the stochastic chain rule.


Martingale Property

Brownian motion is a martingale with respect to its natural filtration FtW=σ(Ws:st)\mathcal{F}^W_t = \sigma(W_s : s \leq t): E[WtFs]=Ws,st.\mathbb{E}[W_t \mid \mathcal{F}_s] = W_s, \quad s \leq t.

This follows immediately from independent increments: E[WtWsFs]=E[WtWs]=0\mathbb{E}[W_t - W_s \mid \mathcal{F}_s] = \mathbb{E}[W_t - W_s] = 0.

Moreover, Wt2tW_t^2 - t is a martingale: E[Wt2tFs]=Ws2s.\mathbb{E}[W_t^2 - t \mid \mathcal{F}_s] = W_s^2 - s.

The subtracted compensator tt is precisely the quadratic variation [W]t[W]_t. In general, for any continuous local martingale MM, the process Mt2[M]tM_t^2 - [M]_t is a local martingale. This is the Doob–Meyer decomposition in the continuous case.


Lévy's Characterisation

Theorem (Lévy). Let M=(Mt)t0M = (M_t)_{t \geq 0} be a continuous local martingale with M0=0M_0 = 0 and [M]t=t[M]_t = t for all t0t \geq 0. Then MM is a standard Brownian motion.

This is used in proving Girsanov's theorem: after a change of measure, Lévy's characterisation identifies the new process as a Brownian motion by verifying continuity, the local martingale property, and the quadratic variation.


Limitations

Continuous-time idealization. Brownian motion is a mathematical model, not an empirical law. Real asset prices trade discretely, are bounded below by zero, and exhibit microstructure noise at fine timescales. The GBM assumption is a modelling choice.

No-jump model. Standard Brownian motion has continuous paths. Models incorporating jumps (Poisson processes, Lévy processes) require a separate theory; the Itô calculus developed from Brownian motion does not directly apply.

Fractional Brownian motion. For H1/2H \neq 1/2, fractional Brownian motion (fBM) has correlated increments and, for H1/2H \neq 1/2, is not a semimartingale. The standard Itô calculus is invalid for fBM, requiring rough path theory or Malliavin calculus. Rough volatility models (Gatheral–Jaisson–Rosenbaum, 2018) use fBM with H0.1H \approx 0.1 to model instantaneous variance.


Interview Angle

L1: State the four defining properties of Brownian motion. Compute Cov(Ws,Wt)\mathrm{Cov}(W_s, W_t) for sts \leq t.

A standard Brownian motion (Wt)t0(W_t)_{t \geq 0} satisfies: (i) W0=0W_0 = 0 a.s.; (ii) independent increments — WtWsFsW_t - W_s \perp \mathcal{F}_s for s<ts < t; (iii) Gaussian increments — WtWsN(0,ts)W_t - W_s \sim \mathcal{N}(0, t-s); (iv) continuous paths a.s.

For sts \leq t, write Wt=Ws+(WtWs)W_t = W_s + (W_t - W_s). Then: Cov(Ws,Wt)=E[WsWt]=E[Ws(Ws+(WtWs))]=E[Ws2]+E[Ws]E[WtWs]=s+0=s,\mathrm{Cov}(W_s, W_t) = \mathbb{E}[W_s W_t] = \mathbb{E}[W_s(W_s + (W_t - W_s))] = \mathbb{E}[W_s^2] + \mathbb{E}[W_s]\mathbb{E}[W_t - W_s] = s + 0 = s, using E[Ws2]=s\mathbb{E}[W_s^2] = s (from axiom iii) and independence of WsW_s and WtWsW_t - W_s (axiom ii).

L2: Prove that [W]t=t[W]_t = t in L2L^2. Why does infinite total variation not contradict finite quadratic variation? Why is pathwise Lebesgue–Stieltjes integration of dWdW impossible?

Proof. Fix an equal-mesh partition with nn intervals, h=t/nh = t/n. Let Δi=WtiWti1N(0,h)\Delta_i = W_{t_i} - W_{t_{i-1}} \sim \mathcal{N}(0,h), mutually independent. Set Qn=i=1nΔi2Q_n = \sum_{i=1}^n \Delta_i^2. Then E[Qn]=nh=t\mathbb{E}[Q_n] = nh = t and Var(Qn)=n2h2=2t2/n0\mathrm{Var}(Q_n) = n \cdot 2h^2 = 2t^2/n \to 0. By Chebyshev, QnL2tQ_n \xrightarrow{L^2} t, hence in probability. This extends to general (non-uniform) partitions by an L2L^2 monotonicity argument.

Total variation vs quadratic variation. The key is the scaling of increments. On a partition of mesh hh:

  • Increments Δi=O(h)|\Delta_i| = O(\sqrt{h}); summing n=t/hn = t/h of them gives TV=O(hn)=O(t/h)\mathrm{TV} = O(\sqrt{h} \cdot n) = O(t/\sqrt{h}) \to \infty.
  • Squared increments Δi2=O(h)|\Delta_i|^2 = O(h); summing nn of them gives [W]t=O(hn)=O(t)[W]_t = O(h \cdot n) = O(t), finite.

Infinite TV arises because the paths are rough at the h\sqrt{h} scale. Finite QV arises because squaring converts h\sqrt{h} scaling to hh scaling, exactly matching the partition size. The two facts are entirely consistent.

Why pathwise Lebesgue–Stieltjes fails. Lebesgue–Stieltjes integration fdg\int f \, dg requires gg to have bounded variation on [0,T][0,T]: the Stieltjes sum \sum f(t_i^*)(g(t_i) - g(t_{i-1}}) converges absolutely by bounding f|f| against TV(g)\mathrm{TV}(g). Since TV(W)=+\mathrm{TV}(W) = +\infty a.s., there is no uniform dominating bound; the Stieltjes sum can oscillate without limit, and the integral cannot be defined sample-path by sample-path. This is why the Itô integral requires an L2L^2 construction based on non-anticipating approximations rather than a pathwise Stieltjes limit.

L3: State and prove Lévy's characterisation theorem. How is it used in the proof of Girsanov's theorem? What goes wrong if you try to build an Itô calculus for fractional Brownian motion with H1/2H \neq 1/2?

Lévy's characterisation. Let M=(Mt)t0M = (M_t)_{t \geq 0} be a continuous local martingale with M0=0M_0 = 0 and [M]t=t[M]_t = t a.s. for all t0t \geq 0. Then MM is a standard Brownian motion.

Proof sketch. Fix uRu \in \mathbb{R} and define ϕt=eiuMt+u2t/2\phi_t = e^{iuM_t + u^2 t/2}. Apply Itô's lemma (using [M]t=t[M]_t = t in the dM2dM^2 term): dϕt=ϕt ⁣(iudMt+iu0dt+12(iu)2dt+u22dt)=iuϕtdMt.d\phi_t = \phi_t\!\left(iu \, dM_t + iu \cdot 0 \, dt + \frac{1}{2}(iu)^2 dt + \frac{u^2}{2} dt\right) = iu\phi_t \, dM_t. So ϕt\phi_t is a local martingale. Under standard integrability conditions it is a true martingale, so E[ϕtFs]=ϕs\mathbb{E}[\phi_t \mid \mathcal{F}_s] = \phi_s. Dividing: E ⁣[eiu(MtMs)Fs]=eu2(ts)/2.\mathbb{E}\!\left[e^{iu(M_t - M_s)} \mid \mathcal{F}_s\right] = e^{-u^2(t-s)/2}. The conditional characteristic function is that of N(0,ts)\mathcal{N}(0, t-s) and is non-random — meaning MtMsM_t - M_s is independent of Fs\mathcal{F}_s and Gaussian. Since this holds for all s<ts < t and all uu, Kolmogorov's theorem identifies MM as a Brownian motion. \square

Role in Girsanov. After defining W~t=Wt+0tθsds\widetilde{W}_t = W_t + \int_0^t \theta_s \, ds, Lévy is used to certify that W~\widetilde{W} is a Q\mathbb{Q}-Brownian motion. One verifies: (a) W~\widetilde{W} is a continuous Q\mathbb{Q}-local martingale (this uses the change-of-measure formula and the Doléans-Dade exponential); (b) [W~]t=[W]t=t[\widetilde{W}]_t = [W]_t = t because quadratic variation is a pathwise property — it depends on the sample paths, not the probability measure. Lévy then immediately identifies W~\widetilde{W} as a Q\mathbb{Q}-Brownian motion. This is the cleanest way to conclude the proof; without Lévy, one would have to verify all four Brownian axioms directly.

Fractional Brownian motion (H1/2H \neq 1/2). For H(0,1)H \in (0,1), H1/2H \neq 1/2, the fractional BM BHB^H has correlated increments: Cov(BtHBsH,BvHBuH)0in general.\mathrm{Cov}(B^H_t - B^H_s,\, B^H_v - B^H_u) \neq 0 \quad \text{in general.} The quadratic variation behaves as [BH]tt2H[B^H]_t \sim t^{2H} (non-linear in tt for H1/2H \neq 1/2). Two consequences:

  1. Not a semimartingale. For H1/2H \neq 1/2, BHB^H cannot be written as a local martingale plus a finite-variation process. The standard Itô calculus applies only to semimartingales. In particular, there is no Itô isometry, no Doob–Meyer decomposition, and no Girsanov theorem in the classical sense.

  2. Markov property fails. Since increments are correlated, the future distribution of Bt+sHBtHB^H_{t+s} - B^H_t depends on the history (BuH)ut(B^H_u)_{u \leq t}, not just on the current value. Feynman-Kac requires the underlying process to be Markovian (or augmented with a finite-dimensional Markovian state). In rough volatility models (H0.1H \approx 0.1), the instantaneous variance has memory, and the option price cannot be expressed as a function of a finite state vector. Pricing requires either Monte Carlo methods (e.g., the hybrid scheme of Bennedsen–Lunde–Pakkanen) or rough path / Malliavin calculus extensions that replace the Itô integral with the controlled rough path integral.

Verify your understanding before moving on.

Start Quiz →