Ito's Formula

The chain rule for a smooth function of a smooth path comes from a first-order Taylor expansion, because the second-order term is of order $(dt)^2$ and vanishes. For a function of Brownian motion the second-order term does not vanish, because the quadratic variation makes the squared increment of order $dt$ rather than $(dt)^2$ , and the surviving term is the signature of stochastic calculus. Ito's formula is the resulting chain rule. This post proves it from the second-order Taylor expansion, then derives the integration-by-parts rule and solves geometric Brownian motion [1], [2]. Here $W$ is a standard Brownian motion.

#Ito's formula for Brownian motion

Theorem1

Let $f$ be twice continuously differentiable with bounded first and second derivatives. Then for every $t$ , almost surely,

f(W_t)=f(W_0)+\int_0^t f'(W_s)\,dW_s+\frac12\int_0^t f''(W_s)\,ds. \tag{1}

Proof

Fix $t$ and a sequence of partitions $0=t_0<\cdots<t_m=t$ with mesh tending to $0$ , and write $\Delta_i=W_{t _i}-W_{t_{i-1}}$ . Telescoping and the second-order Taylor expansion with Lagrange remainder give, for each interval, a point $\xi_i$ between $W_{t_{i-1}}$ and $W_{t_i}$ with

f(W_t)-f(W_0)=\sum_i\Big(f'(W_{t_{i-1}})\Delta_i+\tfrac12 f''(W_{t_{i-1}})\Delta_i^2\Big)+\tfrac12\sum_i\big( f''(\xi_i)-f''(W_{t_{i-1}})\big)\Delta_i^2, \tag{2}

splitting the second-order term into its value at the left endpoint and a remainder.

The first-order sum. The integrand $f'(W_s)$ is adapted and continuous, so the left-endpoint sums $\sum_i f'(W_{t_{i-1}})\Delta_i$ converge in $L^2$ to the stochastic integral $\int_0^t f'(W_s)\,dW_s$ , the simple integrands $f'(W_{t_{i-1}})\mathbf 1_{(t_{i-1},t_i]}$ approximating $f'(W_\cdot)$ in the integral norm.

The second-order sum. Compare $\sum_i f''(W_{t_{i-1}})\Delta_i^2$ with the Riemann sum $\sum_i f''(W_{t_{i -1}})(t_i-t_{i-1})$ , which converges to $\int_0^t f''(W_s)\,ds$ because $f''(W_\cdot)$ is continuous. The difference is $\sum_i f''(W_{t_{i-1}})\big(\Delta_i^2-(t_i-t_{i-1})\big)$ , and its second moment is, since the increments $\Delta_i^2-(t_i-t_{i-1})$ are mean zero and independent of the past,

\E\Big[\Big(\sum_i f''(W_{t_{i-1}})\big(\Delta_i^2-(t_i-t_{i-1})\big)\Big)^2\Big]=\sum_i\E\big[f''(W_{t_{i-1 }})^2\big]\,2(t_i-t_{i-1})^2\le 2\norm{f''}_\infty^2\,t\cdot\text{mesh}, \tag{3}

the cross terms vanishing by the same mean-zero-and-independence property. Two distinct moves combine here. First, $\sum_i f''(W_{t_{i-1}})\big(\Delta_i^2-(t_i-t_{i-1})\big)\to0$ in $L^2$ by the bound above. Second, the Riemann sum $\sum_i f''(W_{t_{i-1}})(t_i-t_{i-1})\to\int_0^t f''(W_s)\,ds$ pathwise by continuity of $f''(W_\cdot)$ . Adding them, $\sum_i f''(W_{t_{i-1}})\Delta_i^2\to\int_0^t f''(W_s)\,ds$ in probability.

The remainder. On the path, $W$ is continuous on $[0,t]$ with compact range, so $f''$ is uniformly continuous there, and $\abs{\xi_i-W_{t_{i-1}}}\le\abs{\Delta_i}$ tends to $0$ uniformly in $i$ as the mesh shrinks. Hence $\max_i\abs{f''(\xi_i)-f''(W_{t_{i-1}})}\to 0$ pathwise, while along the subsequence on which $\sum_i\Delta_i^2\to t$ almost surely the sum $\sum_i\Delta_i^2$ stays bounded, so the remainder is at most $\max_i\abs{f''(\xi_i)-f''(W_{t_{i-1} })}\sum_i\Delta_i^2\to 0$ almost surely along that subsequence.

Each piece converges in probability, so along a subsequence of partitions all converge almost surely, and passing to the limit in Equation (2) yields Equation (1) almost surely at the fixed $t$ . Applying this countably often establishes the identity on a fixed countable dense set of $t$ on one null set. The indefinite Ito integral $M_t=\int_0^t f'(W_s)\,dW_s$ has a continuous modification, the standard property of the stochastic integral of an $L^2$ adapted integrand, and $\int_0^t f''(W_s)\,ds$ is continuous in $t$ pathwise, so both sides are continuous in $t$ and the identity extends from the dense set to all $t$ off that null set.

The differential shorthand for Equation (1) is $df(W_t)=f'(W_t)\,dW_t+\tfrac12 f''(W_t)\,dt$ , the extra half-times-second-derivative term being the entire content of the formula.

#The general formula

The same expansion applies to an Ito process, a process of the form

X_t=X_0+\int_0^t b_s\,ds+\int_0^t\sigma_s\,dW_s, \tag{4}

with progressively measurable integrands $b\in\mathcal L^1_{loc}$ and $\sigma\in\mathcal L^2_{loc}$ , that is $\int_0^t\abs{b_s}\,ds<\infty$ and $\int_0^t\sigma_s^2\,ds<\infty$ almost surely for every $t$ , so that both integrals exist, whose quadratic variation is $\qv X_t=\int_0^t\sigma_s^2\,ds$ since only the stochastic part contributes. Writing $dX_t=b_t\,dt+\sigma_t\,dW_t$ , the squared increment obeys the multiplication rule $dX_t^2=\sigma_t^2\,dt$ , which abbreviates $d\qv X_t=\sigma_t^2\,dt$ , the only second-order quantity that appears. This holds because the drift $\int b\,ds$ has finite variation, hence zero quadratic variation and zero covariation with the martingale part, while $\qv{\int\sigma\,dW}_t=\int_0^t\sigma_s^2\,ds$ ; the symbolic covariation identities $dW\,dW=dt$ , $dW\,dt=0$ , $dt\,dt=0$ are the mnemonic for this computation.

Theorem2

For an Ito process $X$ and a function $f(t,x)$ with $f$ , $\partial_t f$ , $\partial_x f$ , and $\partial_{xx} f$ continuous,

f(t,X_t)=f(0,X_0)+\int_0^t\Big(\partial_t f+b_s\,\partial_x f+\tfrac12\sigma_s^2\,\partial_{xx}f\Big)ds+ \int_0^t\sigma_s\,\partial_x f\,dW_s, \tag{5}

the partial derivatives evaluated at $(s,X_s)$ .

Proof

We first reduce to the case in which $b$ , $\sigma$ , and the derivatives of $f$ are all bounded. Define the stopping time $\tau_N=\inf\{s:\abs{X_s}\ge N\text{ or }\int_0^s(\abs{b_r}+\sigma_r^2)\,dr\ge N\}$ and run the process up to $\tau_N$ . Confining $X$ to $[-N,N]$ bounds the continuous $f,\partial_t f,\partial_x f,\partial_{xx}f$ over the compact $[0,t]\times[-N,N]$ , and capping the accumulated $\int(\abs{b}+\sigma^2)$ bounds the coefficients in the relevant norms; the two controls together supply every hypothesis the estimates below consume. Since $\int_0^t(\abs{b_s}+\sigma_s^2)\,ds<\infty$ almost surely and paths are continuous, $\tau_N\to\infty$ almost surely, so it suffices to prove the formula on $[0,t\wedge\tau_N]$ and let $N\to\infty$ . Assume henceforth that $b$ , $\sigma$ , and the derivatives of $f$ are bounded.

Split each step as $f(t_i,X_{t_i})-f(t_{i-1},X_{t_{i-1}})=[f(t_i,X_{t_i})-f(t_{i-1},X_{t_i})]+[f(t_{i-1},X_{t_i})-f(t_{i-1},X_{t_{i-1}})]$ . A first-order expansion in time of the first bracket contributes $\partial_t f\,\Delta t_i$ by the mean value theorem and continuity of $\partial_t f$ . A second-order expansion in space of the second bracket contributes $\partial_x f(X_{t_{i-1}})\,\Delta X_i+\tfrac12\partial_{xx}f(X_{t_{i-1}})\,\Delta X_i^2$ plus a Lagrange remainder $\tfrac12\big(\partial_{xx}f(\eta_i)-\partial_{xx}f(X_{t_{i-1}})\big)\Delta X_i^2$ at an intermediate point $\eta_i$ . The remainder vanishes exactly as in the remainder step of Theorem 1, since $\partial_{xx}f$ is uniformly continuous on the compact range of $X$ , so $\max_i\abs{\partial_{xx}f(\eta_i)-\partial_{xx}f(X_{t_{i-1}})}\to0$ pathwise, while $\sum_i\Delta X_i^2$ stays bounded along the subsequence on which it converges to $\int_0^t\sigma_s^2\,ds$ . The increment $\Delta X_i$ splits into its drift part, contributing $\int b\,\partial_x f\,ds$ , and its diffusion part, contributing $\int\sigma\,\partial_x f\,dW$ .

For the second-order space term we replace the true increment $\Delta X_i=\int_{t_{i-1}}^{t_i}b_s\,ds+\int_{t_{i-1}}^{t_i}\sigma_s\,dW_s$ by its frozen-coefficient surrogate $b_{t_{i-1}}\Delta t_i+\sigma_{t_{i-1}}\Delta W_i$ . The error in each integrand is controlled by the simple-process approximation that defines the stochastic integral. The step process $\sigma_{t_{i-1}}\mathbf 1_{(t_{i-1},t_i]}$ converges to $\sigma$ in $\mathcal L^2$ , so the squared diffusion error $\sum_i\big(\int_{t_{i-1}}^{t_i}\sigma_s\,dW_s-\sigma_{t_{i-1}}\Delta W_i\big)^2\to0$ in $L^1$ , and the drift error $\int_{t_{i-1}}^{t_i}b_s\,ds-b_{t_{i-1}}\Delta t_i$ is a finite-variation term of total order $o(1)$ ; after multiplication by the bounded $\partial_x f$ and $\partial_{xx}f$ these errors vanish in probability in both the first- and second-order sums. With the surrogate,

\Delta X_i^2=\sigma_{t_{i-1}}^2\Delta W_i^2+2b_{t_{i-1}}\sigma_{t_{i-1}}\Delta t_i\,\Delta W_i+b_{t_{i-1}}^2(\Delta t_i)^2. \tag{6}

The first piece converges against $\partial_{xx}f$ to $\int\sigma_s^2\,\partial_{xx}f\,ds$ by replacing $\Delta W_i^2$ with $\Delta t_i$ at $L^2$ cost $2\sum(\sigma^2\partial_{xx}f)^2(\Delta t_i)^2\to0$ as in Equation (3) (finite since $\sigma$ and $\partial_{xx}f$ are bounded) followed by Riemann convergence; the cross piece is bounded by $C(\max_i\abs{\Delta W_i})\sum_i\Delta t_i\to0$ by continuity of $W$ , and the last by $C\,\text{mesh}\sum_i\Delta t_i\to0$ , the constants $C=\sup\abs{b\sigma\partial_{xx}f}$ and $\sup\abs{b^2\partial_{xx}f}$ finite by the boundedness secured above. The time term sums to the Riemann integral $\int\partial_t f\,ds$ . Collecting the $ds$ and $dW$ terms gives Equation (5) on $[0,t\wedge\tau_N]$ ; letting $N\to\infty$ removes the boundedness assumption.

#Integration by parts and geometric Brownian motion

Applying the formula to a product gives the stochastic analogue of the Leibniz rule, with the extra covariation term.

Corollary3

For Ito processes $X$ and $Y$ , $d(X_tY_t)=X_t\,dY_t+Y_t\,dX_t+d\qv{X,Y}_t$ , where $d\qv{X,Y}_t$ is the covariation differential.

Proof

Take $X$ and $Y$ driven by the common $W$ , so that $X+Y$ is again an Ito process of the form Equation (4) (its coefficients the sums of those of $X$ and $Y$ ) and the single-driver Theorem 2 applies to each of $X+Y$ , $X$ , $Y$ . Polarise, using only that theorem applied to $u\mapsto u^2$ (with the localisation above) along the single Ito processes $X+Y$ , $X$ , and $Y$ . This gives $d((X+Y)^2)=2(X+Y)\,d(X+Y)+d\qv{X+Y}$ , $d(X^2)=2X\,dX+d\qv X$ , and $d(Y^2)=2Y\,dY+d\qv Y$ , and $\qv{X+Y}=\qv X+2\qv{X,Y}+\qv Y$ . Writing $XY=\tfrac12\big((X+Y)^2-X^2-Y^2\big)$ and subtracting,

d(XY)=\tfrac12\big[2(X+Y)\,d(X+Y)-2X\,dX-2Y\,dY\big]+\tfrac12\big[d\qv{X+Y}-d\qv X-d\qv Y\big]=X\,dY+Y\,dX+d\qv{X,Y}_t, \tag{7}

which is the stated rule.

Corollary4

The geometric Brownian motion $X_t=X_0\exp\big((\mu-\tfrac12\sigma^2)t+\sigma W_t\big)$ solves $dX_t=\mu X_t \,dt+\sigma X_t\,dW_t$ .

Proof

Apply Theorem 2 with $X=W$ ( $b_s=0$ , $\sigma_s=1$ , $\qv W_t=t$ ) to $f(t,x)=X_0\exp((\mu-\tfrac12\sigma^2)t+\sigma x)$ evaluated along $W$ , so that $X_t=f(t,W_t)$ , noting $\partial_t f=(\mu-\tfrac12\sigma^2)f$ , $\partial_x f=\sigma f$ , and $\partial_{xx}f=\sigma^2 f$ . Substituting, the $ds$ coefficient is $(\mu-\tfrac12\sigma^2)f+\tfrac12\sigma^2 f=\mu f$ and the $dW$ coefficient is $\sigma f$ , so $dX_t=\mu X_t\,dt+\sigma X_t\,dW_t$ . The drift correction $-\tfrac12\sigma^2$ in the exponent is exactly the Ito term, the reason the expected growth rate $\mu$ exceeds the median growth rate.

Ito's formula is the computational heart of stochastic calculus, turning every smooth transformation of a process into another Ito process with an explicit drift and diffusion. The half-times-second-derivative correction is where the quadratic variation enters every calculation, and it is the reason an option's value satisfies a second-order partial differential equation and the reason a log-price drifts below its arithmetic mean. It is the tool through which the stochastic differential equations of the next post are solved and verified.

[1]

B. Øksendal, Stochastic Differential Equations: An Introduction with Applications, 6th ed. Springer, 2003.

[2]

I. Karatzas and S. E. Shreve, Brownian Motion and Stochastic Calculus, 2nd ed. Springer, 1991.

Explore connections

see in the atlas

referenced by (3)

cite

@misc{itos-formula,
  author = {Zac Kienzle},
  title  = {Ito's Formula},
  year   = {2026},
  month  = {06},
  url    = {https://zackienzle.com/blog/itos-formula}
}

#Ito's formula for Brownian motion

Theorem1

Let $f$ be twice continuously differentiable with bounded first and second derivatives. Then for every $t$ , almost surely,

f(W_t)=f(W_0)+\int_0^t f'(W_s)\,dW_s+\frac12\int_0^t f''(W_s)\,ds. \tag{1}

Proof

f(W_t)-f(W_0)=\sum_i\Big(f'(W_{t_{i-1}})\Delta_i+\tfrac12 f''(W_{t_{i-1}})\Delta_i^2\Big)+\tfrac12\sum_i\big( f''(\xi_i)-f''(W_{t_{i-1}})\big)\Delta_i^2, \tag{2}

splitting the second-order term into its value at the left endpoint and a remainder.

\E\Big[\Big(\sum_i f''(W_{t_{i-1}})\big(\Delta_i^2-(t_i-t_{i-1})\big)\Big)^2\Big]=\sum_i\E\big[f''(W_{t_{i-1 }})^2\big]\,2(t_i-t_{i-1})^2\le 2\norm{f''}_\infty^2\,t\cdot\text{mesh}, \tag{3}

The differential shorthand for Equation (1) is $df(W_t)=f'(W_t)\,dW_t+\tfrac12 f''(W_t)\,dt$ , the extra half-times-second-derivative term being the entire content of the formula.

#The general formula

The same expansion applies to an Ito process, a process of the form

X_t=X_0+\int_0^t b_s\,ds+\int_0^t\sigma_s\,dW_s, \tag{4}

Theorem2

For an Ito process $X$ and a function $f(t,x)$ with $f$ , $\partial_t f$ , $\partial_x f$ , and $\partial_{xx} f$ continuous,

f(t,X_t)=f(0,X_0)+\int_0^t\Big(\partial_t f+b_s\,\partial_x f+\tfrac12\sigma_s^2\,\partial_{xx}f\Big)ds+ \int_0^t\sigma_s\,\partial_x f\,dW_s, \tag{5}

the partial derivatives evaluated at $(s,X_s)$ .

Proof

\Delta X_i^2=\sigma_{t_{i-1}}^2\Delta W_i^2+2b_{t_{i-1}}\sigma_{t_{i-1}}\Delta t_i\,\Delta W_i+b_{t_{i-1}}^2(\Delta t_i)^2. \tag{6}

#Integration by parts and geometric Brownian motion

Applying the formula to a product gives the stochastic analogue of the Leibniz rule, with the extra covariation term.

Corollary3

For Ito processes $X$ and $Y$ , $d(X_tY_t)=X_t\,dY_t+Y_t\,dX_t+d\qv{X,Y}_t$ , where $d\qv{X,Y}_t$ is the covariation differential.

Proof

d(XY)=\tfrac12\big[2(X+Y)\,d(X+Y)-2X\,dX-2Y\,dY\big]+\tfrac12\big[d\qv{X+Y}-d\qv X-d\qv Y\big]=X\,dY+Y\,dX+d\qv{X,Y}_t, \tag{7}

which is the stated rule.

Corollary4

The geometric Brownian motion $X_t=X_0\exp\big((\mu-\tfrac12\sigma^2)t+\sigma W_t\big)$ solves $dX_t=\mu X_t \,dt+\sigma X_t\,dW_t$ .

Proof

[1]

B. Øksendal, Stochastic Differential Equations: An Introduction with Applications, 6th ed. Springer, 2003.

[2]

I. Karatzas and S. E. Shreve, Brownian Motion and Stochastic Calculus, 2nd ed. Springer, 1991.

Explore connections

see in the atlas

referenced by (3)

cite

@misc{itos-formula,
  author = {Zac Kienzle},
  title  = {Ito's Formula},
  year   = {2026},
  month  = {06},
  url    = {https://zackienzle.com/blog/itos-formula}
}