Characteristic Functions

The characteristic function is the Fourier transform of a probability law, and it converts the two hardest operations on random variables into easy ones. Adding independent variables becomes multiplying their transforms, and convergence in distribution becomes pointwise convergence of the transforms, so the central limit theorem reduces to a Taylor expansion. This post develops the characteristic function from its definition through the inversion formula and the Levy continuity theorem, on the probability space and using the independence of the previous posts [1], [2].

#Definition and elementary properties

Definition1

The characteristic function of a random variable $X$ is $\varphi_X(t)=\E[e^{itX}]=\E[\cos tX]+i\E[\sin tX]$ , defined for every real $t$ .

The expectation exists because $e^{itX}$ is bounded, $\abs{e^{itX}}=1$ , so the characteristic function is defined for every law without integrability assumptions. This is its structural advantage over the moment generating function, which requires $\E[e^{tX}]$ to be finite near $0$ .

Proposition2

The characteristic function satisfies $\varphi_X(0)=1$ and $\abs{\varphi_X(t)}\le 1$ , is uniformly continuous on $\R$ , and for independent $X$ and $Y$ obeys $\varphi_{X+Y}=\varphi_X\varphi_Y$ .

Proof

At $t=0$ , $\varphi_X(0)=\E[1]=1$ , and $\abs{\varphi_X(t)}=\abs{\E[e^{itX}]}\le\E\abs{e^{itX}}=1$ . For uniform continuity, $\abs{\varphi_X(t+h)-\varphi_X(t)}=\abs{\E[e^{itX}(e^{ihX}-1)]}\le\E\abs{e^{ihX}-1}$ , a bound independent of $t$ , and $\abs{e^{ihX}-1}\le 2$ with $e^{ihX}-1\to 0$ pointwise as $h\to 0$ , so the dominated convergence theorem sends $\E\abs{e^{ihX}-1}\to 0$ , which is uniform continuity. For the product rule, independence makes $e^{itX}$ and $e^{itY}$ independent, so the factorisation of expectations applied to real and imaginary parts gives $\varphi_{X+Y}(t)=\E[e^{itX}e^{itY}]=\E[e^{itX}]\,\E[e^{itY}]=\varphi_X(t)\varphi_Y(t)$ .

The product rule is the reason characteristic functions suit sums. The distribution of a sum of independent variables, a convolution that is awkward to compute directly, becomes a pointwise product of transforms.

#Moments

Differentiating under the expectation reads the moments of $X$ off the derivatives of $\varphi_X$ at the origin.

Proposition3

If $\E\abs X^n<\infty$ , then $\varphi_X$ is $n$ times continuously differentiable with $\varphi_X^{(k)}(t) =\E[(iX)^k e^{itX}]$ , so $\varphi_X^{(k)}(0)=i^k\E[X^k]$ , and

\varphi_X(t)=\sum_{k=0}^n\frac{(it)^k}{k!}\E[X^k]+o(t^n)\quad\text{as }t\to 0. \tag{1}

Proof

The difference quotient $(e^{i(t+h)X}-e^{itX})/h$ converges to $iXe^{itX}$ as $h\to 0$ and is bounded in modulus by $\abs X$ , since $\abs{e^{ihX}-1}\le\abs{hX}$ . When $\E\abs X<\infty$ the dominated convergence theorem lets the derivative pass inside, $\varphi_X'(t)=\E[iXe^{itX}]$ . Iterating $k\le n$ times, with dominator $\abs X^k$ integrable by hypothesis, gives $\varphi_X^{(k)}(t)=\E[(iX)^k e^{itX}]$ , continuous in $t$ by dominated convergence, and at $t=0$ equal to $i^k\E[X^k]$ . The expansion Equation (1) is Taylor's theorem applied to the $n$ times differentiable $\varphi_X$ at the origin with these derivatives.

#The inversion formula and uniqueness

The characteristic function determines the law. The proof is an explicit formula recovering the probability of an interval from the transform.

Theorem4

For $a<b$ with $\P(X=a)=\P(X=b)=0$ ,

\P(a<X\le b)=\lim_{T\to\infty}\frac{1}{2\pi}\int_{-T}^{T}\frac{e^{-ita}-e^{-itb}}{it}\,\varphi_X(t)\,dt. \tag{2}

Proof

Write $\varphi_X(t)=\int_\R e^{itx}\,d\P_X(x)$ and insert it into the integral. The integrand is bounded by $b-a$ on $[-T,T]\times\R$ , since $\abs{(e^{-ita}-e^{-itb})/(it)}=\abs{\int_a^b e^{-its}\,ds}\le b-a$ , so the finite-measure Fubini theorem permits exchanging the integrals,

\frac{1}{2\pi}\int_{-T}^T\frac{e^{-ita}-e^{-itb}}{it}\varphi_X(t)\,dt=\int_\R\Big(\frac{1}{\pi}\int_0^T \frac{\sin t(x-a)-\sin t(x-b)}{t}\,dt\Big)d\P_X(x), \tag{3}

The inner expression collapses to a sine integral because, after division by $it$ , the cosine parts of $e^{it(x-a)}-e^{it(x-b)}$ carry a factor $1/t$ that is odd in $t$ and integrate to zero over the symmetric range, while the sine parts are even and survive. The Dirichlet integral $\int_0^T\frac{\sin ct}{t}\,dt\to\frac{\pi}{2}\operatorname{sgn}(c)$ as $T\to\infty$ , with the partial integrals bounded uniformly in $c$ and $T$ . So the inner expression is bounded and converges to $\tfrac12(\operatorname{sgn}(x-a)-\operatorname{sgn}(x-b))$ , which equals $1$ on $(a,b)$ , $\tfrac12$ at $x\in\{a,b\}$ , and $0$ outside $[a,b]$ . The bounded convergence theorem sends the right side of Equation (3) to $\P(a<X<b)+\tfrac12(\P(X=a)+\P(X=b))$ , which under the continuity hypothesis is $\P(a<X\le b)$ .

Corollary5

Two random variables with the same characteristic function have the same law.

Proof

The formula Equation (2) determines $\P(a<X\le b)$ from $\varphi_X$ for every $a,b$ that are not atoms. The non-atoms are all but countably many points, hence dense, so the distribution function is determined at a dense set of points and, being right-continuous, everywhere. The law is determined by its distribution function.

#Convergence in distribution and tightness

A sequence of laws converges in distribution, written $X_n\Rightarrow X$ , when $F_{X_n}(x)\to F_X(x)$ at every continuity point $x$ of $F_X$ . The characteristic functions control this convergence, but only once tightness prevents mass from escaping to infinity, which the next lemma controls by bounding the tail mass with the characteristic function near the origin.

Lemma6

For every $u>0$ , $\P\big(\abs X\ge 2/u\big)\le\dfrac{1}{u}\displaystyle\int_{-u}^{u}\big(1-\Re\varphi_X(t) \big)\,dt$ .

Proof

By change of variables and Fubini,

\frac1u\int_{-u}^u\big(1-\Re\varphi_X(t)\big)\,dt=\int_\R\frac1u\int_{-u}^u\big(1-\cos tx\big)\,dt\, d\P_X(x)=2\int_\R\Big(1-\frac{\sin ux}{ux}\Big)d\P_X(x), \tag{4}

the inner integral being $\frac1u(2u-2\sin(ux)/x)=2(1-\sin(ux)/(ux))$ . The integrand is nonnegative, and where $\abs{ux}\ge 2$ it satisfies $1-\frac{\sin ux}{ux}\ge 1-\frac1{\abs{ux}}\ge\tfrac12$ . Restricting the integral to $\{\abs x\ge 2/u\}$ and using this bound gives the right side at least $2\cdot\tfrac12\P(\abs X \ge 2/u)=\P(\abs X\ge 2/u)$ .

The estimate says a characteristic function close to $1$ near the origin forces the law to concentrate, because $1-\Re\varphi_X(t)$ small on $[-u,u]$ bounds the tail mass beyond $2/u$ .

#The Levy continuity theorem

Theorem7

Let $X_n$ have characteristic functions $\varphi_n$ . If $\varphi_n(t)\to\varphi(t)$ for every $t$ and $\varphi$ is continuous at $0$ , then $\varphi$ is the characteristic function of a law $\mu$ and $X_n\Rightarrow\mu$ .

Proof

First, tightness. Fix $\varepsilon>0$ . Since $\varphi(0)=\lim\varphi_n(0)=1$ and $\varphi$ is continuous at $0$ , choose $u>0$ with $\frac1u\int_{-u}^u(1-\Re\varphi(t))\,dt<\varepsilon/2$ . The integrands $1-\Re \varphi_n$ are bounded by $2$ and converge pointwise to $1-\Re\varphi$ , so by bounded convergence the integrals converge and $\frac1u\int_{-u}^u(1-\Re\varphi_n(t))\,dt<\varepsilon$ for all large $n$ . By Lemma 6, $\P(\abs{X_n}\ge 2/u)<\varepsilon$ for those $n$ , and enlarging $2/u$ to cover the finitely many remaining $n$ makes the family tight.

Now take any subsequence. By Helly's selection theorem, every sequence of distribution functions has a further subsequence converging at continuity points to a nondecreasing right-continuous limit $G$ , obtained by a diagonal extraction of convergent values on the rationals followed by right-continuous interpolation. Tightness forces $G$ to be a genuine distribution function, with no mass lost to $\pm\infty$ . Along that subsequence $X_{n_k}\Rightarrow\nu$ for the law $\nu$ of $G$ , and convergence in distribution with the uniformly bounded continuous integrands $e^{itx}$ gives $\varphi_{n_k}(t)\to\varphi_ \nu(t)$ . But $\varphi_{n_k}(t)\to\varphi(t)$ by hypothesis, so $\varphi_\nu=\varphi$ , and by Corollary 5 every subsequential limit is the same law $\mu$ with characteristic function $\varphi$ . A sequence all of whose subsequences have a further subsequence converging to the same limit converges to that limit, so $X_n\Rightarrow\mu$ .

The Levy continuity theorem is the analytic engine of the central limit theorem. To prove a sum of independent variables converges in distribution to a Gaussian, one shows its characteristic function, a product of the individual transforms by Proposition 2, converges pointwise to $e^{-t^2/2}$ , the transform of the standard normal, and the continuity theorem converts that pointwise convergence into the distributional limit. The expansion Equation (1) supplies the pointwise limit through a second-order Taylor approximation of each factor; that computation is carried out in the central limit theorem post. The characteristic function thereby reduces a statement about the shape of a distribution to a calculation with an ordinary function of a real variable.

[1]

R. Durrett, Probability: Theory and Examples, 5th ed. in Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, 2019.

[2]

D. Williams, Probability with Martingales. Cambridge University Press, 1991.

Explore connections

see in the atlas

referenced by (8)

cite

@misc{characteristic-functions,
  author = {Zac Kienzle},
  title  = {Characteristic Functions},
  year   = {2026},
  month  = {05},
  url    = {https://zackienzle.com/blog/characteristic-functions}
}

#Definition and elementary properties

Definition1

The characteristic function of a random variable $X$ is $\varphi_X(t)=\E[e^{itX}]=\E[\cos tX]+i\E[\sin tX]$ , defined for every real $t$ .

Proposition2

The characteristic function satisfies $\varphi_X(0)=1$ and $\abs{\varphi_X(t)}\le 1$ , is uniformly continuous on $\R$ , and for independent $X$ and $Y$ obeys $\varphi_{X+Y}=\varphi_X\varphi_Y$ .

Proof

#Moments

Differentiating under the expectation reads the moments of $X$ off the derivatives of $\varphi_X$ at the origin.

Proposition3

If $\E\abs X^n<\infty$ , then $\varphi_X$ is $n$ times continuously differentiable with $\varphi_X^{(k)}(t) =\E[(iX)^k e^{itX}]$ , so $\varphi_X^{(k)}(0)=i^k\E[X^k]$ , and

\varphi_X(t)=\sum_{k=0}^n\frac{(it)^k}{k!}\E[X^k]+o(t^n)\quad\text{as }t\to 0. \tag{1}

Proof

#The inversion formula and uniqueness

The characteristic function determines the law. The proof is an explicit formula recovering the probability of an interval from the transform.

Theorem4

For $a<b$ with $\P(X=a)=\P(X=b)=0$ ,

\P(a<X\le b)=\lim_{T\to\infty}\frac{1}{2\pi}\int_{-T}^{T}\frac{e^{-ita}-e^{-itb}}{it}\,\varphi_X(t)\,dt. \tag{2}

Proof

\frac{1}{2\pi}\int_{-T}^T\frac{e^{-ita}-e^{-itb}}{it}\varphi_X(t)\,dt=\int_\R\Big(\frac{1}{\pi}\int_0^T \frac{\sin t(x-a)-\sin t(x-b)}{t}\,dt\Big)d\P_X(x), \tag{3}

Corollary5

Two random variables with the same characteristic function have the same law.

Proof

#Convergence in distribution and tightness

Lemma6

For every $u>0$ , $\P\big(\abs X\ge 2/u\big)\le\dfrac{1}{u}\displaystyle\int_{-u}^{u}\big(1-\Re\varphi_X(t) \big)\,dt$ .

Proof

By change of variables and Fubini,

\frac1u\int_{-u}^u\big(1-\Re\varphi_X(t)\big)\,dt=\int_\R\frac1u\int_{-u}^u\big(1-\cos tx\big)\,dt\, d\P_X(x)=2\int_\R\Big(1-\frac{\sin ux}{ux}\Big)d\P_X(x), \tag{4}

The estimate says a characteristic function close to $1$ near the origin forces the law to concentrate, because $1-\Re\varphi_X(t)$ small on $[-u,u]$ bounds the tail mass beyond $2/u$ .

#The Levy continuity theorem

Theorem7

Proof

[1]

R. Durrett, Probability: Theory and Examples, 5th ed. in Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, 2019.

[2]

D. Williams, Probability with Martingales. Cambridge University Press, 1991.

Explore connections

see in the atlas

referenced by (8)

cite

@misc{characteristic-functions,
  author = {Zac Kienzle},
  title  = {Characteristic Functions},
  year   = {2026},
  month  = {05},
  url    = {https://zackienzle.com/blog/characteristic-functions}
}