Convex Sets and Functions

Convexity is the dividing line between optimisation problems that can be solved and those that cannot. A convex function curves upward everywhere, so it has a single basin and no false minima to trap a descent. A convex set can be cleanly separated from any outside point by a flat boundary, the geometric root of every duality theorem. This post builds the two ideas and their characterisations, deriving the separating hyperplane from the projection onto a closed convex set proved in the Hilbert space track [1], [2]. The setting is Euclidean space $\R^n$ with inner product $\ip xy$ .

#Convex sets and functions

Definition1

A set $C\subseteq\R^n$ is convex when it contains the segment between any two of its points, $\lambda x+(1-\lambda)y\in C$ for all $x,y\in C$ and $\lambda\in[0,1]$ . A function $f:\R^n\to\R$ is convex when

f\big(\lambda x+(1-\lambda)y\big)\le\lambda f(x)+(1-\lambda)f(y) \tag{1}

for all $x,y$ and $\lambda\in[0,1]$ , and strictly convex when the inequality is strict for $x\neq y$ and $\lambda\in(0,1)$ .

A function is convex exactly when the region above its graph is a convex set, since the epigraph $\{(x,t):t\ge f(x)\}$ contains the segment between two of its points precisely when Equation (1) holds. Convexity of a function is therefore a special case of convexity of a set, and the finite form of the defining inequality is Jensen's inequality, $f(\sum_i\lambda_i x_i)\le\sum_i \lambda_i f(x_i)$ for weights $\lambda_i\ge 0$ summing to $1$ , which follows by induction. For smooth functions convexity is read from derivatives.

Proposition2

A differentiable $f$ is convex if and only if $f(y)\ge f(x)+\ip{\nabla f(x)}{y-x}$ for all $x,y$ , the graph lying above each tangent. A twice continuously differentiable $f$ is convex if and only if its Hessian $\nabla^2 f(x)$ is positive semidefinite for every $x$ .

Proof

Suppose $f$ is convex. For $\lambda\in(0,1)$ , Equation (1) rearranges to $\frac{f(x+\lambda(y-x)) -f(x)}{\lambda}\le f(y)-f(x)$ , and letting $\lambda\to 0^+$ the left side tends to the directional derivative $\ip{\nabla f(x)}{y-x}$ , giving the tangent inequality. Conversely, if the tangent inequality holds, apply it at the point $z=\lambda x+(1-\lambda)y$ to both $x$ and $y$ , $f(x)\ge f(z)+\ip{\nabla f(z)}{x-z}$ and $f(y)\ge f(z)+\ip{\nabla f(z)}{y-z}$ , and take the $\lambda,(1-\lambda)$ weighted sum, whose gradient terms cancel because $\lambda(x-z)+(1-\lambda)(y-z)=0$ , leaving $\lambda f(x)+(1-\lambda)f(y)\ge f(z)$ . For the Hessian, the tangent inequality along the line $x+sv$ is equivalent to the one-variable function $g(s)=f(x+sv)$ being convex for every direction $v$ , which for $g\in C^2$ holds if and only if $g''(s)=\ip{v}{\nabla^2 f(x+sv)v}\ge 0$ , that is the Hessian is positive semidefinite.

#Separating and supporting hyperplanes

The geometric heart of convexity is that a convex set can be separated from any point it does not contain. The closest point of the set provides the separating direction.

Theorem3

Let $C\subseteq\R^n$ be nonempty, closed, and convex, and let $z\notin C$ . Then there is a vector $a\neq 0$ and a scalar $c$ with $\ip ax<c<\ip az$ for all $x\in C$ .

Proof

Let $p$ be the projection of $z$ onto $C$ , the unique nearest point, which exists because $C$ is closed and convex and $\R^n$ is complete. Set $a=z-p\neq 0$ . By the variational inequality $\ip{z-p}{x-p}\le 0$ for all $x\in C$ , we have $\ip ax \le\ip ap$ for all $x\in C$ . Moreover $\ip az-\ip ap=\ip a{z-p}=\norm a^2>0$ , so $\ip ap<\ip az$ . Choosing $c$ strictly between $\ip ap$ and $\ip az$ separates, since $\ip ax\le\ip ap<c<\ip az$ for all $x\in C$ .

At a boundary point the separating hyperplane becomes a supporting one, touching the set.

Theorem4

Let $C\subseteq\R^n$ be convex with nonempty interior and let $x_0$ be a boundary point. Then there is a vector $a\neq 0$ with $\ip a{x}\le\ip a{x_0}$ for all $x\in C$ , a supporting hyperplane at $x_0$ .

Proof

Pick an interior point $w\in\operatorname{int}C$ and set $z_k=x_0+\tfrac1k(x_0-w)$ , which converge to $x_0$ and lie outside the closure $\overline C$ . Indeed if $z_k\in\overline C$ , then $x_0$ is a proper convex combination of the interior point $w$ and the point $z_k\in\overline C$ , namely $x_0=\tfrac1{k+1}w +\tfrac k{k+1}z_k$ , which places $x_0$ in $\operatorname{int}C$ because the open segment from an interior point to any point of the closure lies in the interior, contradicting $x_0$ being on the boundary. That principle holds because, picking $r>0$ with $B(w,r)\subseteq C$ and $y'\in C$ near $z_k$ , convexity gives $\lambda B(w,r)+(1-\lambda)y'=B(\lambda w+(1-\lambda)y',\lambda r)\subseteq C$ for $\lambda\in(0,1]$ , and $\norm{y'-z_k}$ small enough keeps $\lambda w+(1-\lambda)z_k$ inside this ball. Each $z_k$ gives by Theorem 3, applied to the closed convex $\overline C$ (the closure of a convex set is convex, since for $x_n\to x$ , $y_n\to y$ in $C$ the combination $\lambda x_n+(1-\lambda)y_n\in C$ converges to $\lambda x+(1-\lambda)y\in\overline C$ ), a unit vector $a_k$ with $\ip{a_k}{x}\le\ip{a_k}{z_k}$ for all $x\in\overline C$ . The unit vectors lie in the compact unit sphere, so by the Bolzano-Weierstrass theorem a subsequence converges to a unit vector $a$ . Passing to the limit in $\ip{a_k}{x}\le\ip{a_k}{z_k}$ , with $z_k\to x_0$ , gives $\ip a{x}\le\ip a{x_0}$ for all $x\in C$ , the supporting hyperplane.

#Local minima are global

The payoff of convexity for optimisation is that it abolishes the distinction between local and global minima.

Theorem5

If $f$ is convex, then every local minimum is a global minimum, and the set of minimisers is convex. If $f$ is strictly convex the minimiser is unique.

Proof

Let $x^\ast$ be a local minimum, so $f(x^\ast)\le f(x)$ for $x$ near $x^\ast$ , and suppose some $y$ had $f(y)<f(x^\ast)$ . Along the segment, convexity gives $f(\lambda y+(1-\lambda)x^\ast)\le\lambda f(y)+(1- \lambda)f(x^\ast)<f(x^\ast)$ for every $\lambda\in(0,1]$ , and taking $\lambda$ small enough that the point lies in the neighbourhood contradicts local minimality. So $f(x^\ast)\le f(y)$ for all $y$ , a global minimum. The minimisers form the sublevel set $\{f\le f(x^\ast)\}$ at the minimum value, which is convex because $f$ is. If $f$ is strictly convex and $x_1\neq x_2$ both minimised, the midpoint would have $f(\tfrac12 x_1+\tfrac12 x_2)<\tfrac12 f(x_1)+\tfrac12 f(x_2)=f(x^\ast)$ , below the minimum, which is impossible, so the minimiser is unique.

Convexity is the structural assumption that turns optimisation from a search through a landscape of traps into a problem with one answer reachable by descent. The supporting hyperplane theorem is the geometric fact that every convex set is the intersection of the half-spaces that support it. This duality between a set and the linear functionals that bound it is the lever that the Lagrangian duality of the next post pulls to certify optimality. The quadratic programs of mean-variance portfolio choice and the convex cost of optimal execution are convex problems whose solutions this theory locates and guarantees.

[1]

S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge University Press, 2004.

[2]

R. T. Rockafellar, Convex Analysis. Princeton University Press, 1970.

Explore connections

see in the atlas

referenced by (1)

Convex Duality and the KKT Conditions

cite

@misc{convex-sets-and-functions,
  author = {Zac Kienzle},
  title  = {Convex Sets and Functions},
  year   = {2026},
  month  = {06},
  url    = {https://zackienzle.com/blog/convex-sets-and-functions}
}

#Convex sets and functions

Definition1

f\big(\lambda x+(1-\lambda)y\big)\le\lambda f(x)+(1-\lambda)f(y) \tag{1}

for all $x,y$ and $\lambda\in[0,1]$ , and strictly convex when the inequality is strict for $x\neq y$ and $\lambda\in(0,1)$ .

Proposition2

Proof

#Separating and supporting hyperplanes

The geometric heart of convexity is that a convex set can be separated from any point it does not contain. The closest point of the set provides the separating direction.

Theorem3

Let $C\subseteq\R^n$ be nonempty, closed, and convex, and let $z\notin C$ . Then there is a vector $a\neq 0$ and a scalar $c$ with $\ip ax<c<\ip az$ for all $x\in C$ .

Proof

At a boundary point the separating hyperplane becomes a supporting one, touching the set.

Theorem4

Proof

#Local minima are global

The payoff of convexity for optimisation is that it abolishes the distinction between local and global minima.

Theorem5

If $f$ is convex, then every local minimum is a global minimum, and the set of minimisers is convex. If $f$ is strictly convex the minimiser is unique.

Proof

[1]

S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge University Press, 2004.

[2]

R. T. Rockafellar, Convex Analysis. Princeton University Press, 1970.

Explore connections

see in the atlas

referenced by (1)

Convex Duality and the KKT Conditions

cite

@misc{convex-sets-and-functions,
  author = {Zac Kienzle},
  title  = {Convex Sets and Functions},
  year   = {2026},
  month  = {06},
  url    = {https://zackienzle.com/blog/convex-sets-and-functions}
}