Convex Functions
1. Basic properties and examples
1.1 Definitions
A function is convex if is a convex set and if for all , and with , we have
We say is concave if is convex.
1.2 Extended-value extensions
It is often convenient to extend a convex function to all of by defining its value to be outside its domain. If is convex, define its extended-value extension by
is also convex.
1.3 First-order conditions
Suppose is differentiable (i.e., its gradient exists at each point in , which is open). Then is convex if and only if is convex and
holds for all . Similarly, is concave if and only if is convex and
for all .
1.4 Second-order conditions
We now assume is twice differentiable, that is, its Hessian or second derivative exists at each point of , which is open. Then is convex if and only if is convex and its Hessian is positive semidefinite: for all ,
The Hessian matrix of function is defined by
For a function on , this reduces to the simple condition , which means that the derivative is nondecreasing. The condition can be interpreted geometrically as the requirement that the graph of the function have positive curvature at .
1.5 Examples
Omitted
1.6 Sublevel sets
The -sublevel set of a function is defined as
Sublevel sets of a convex function are convex, for any value of .
Similarly, if is concave, its -superlevel set, given by , is a convex set.
1.7 Epigraph
The graph of a function is defined as
which is a subset of . The epigraph of a function is defined as
A function is convex if and only if its epigraph is a convex set. A function is concave if and only if its hypograph, defined as
is a convex set.
Many results for convex functions can be proved (or interpreted) geometrically using epigraphs, and applying results for convex sets. As an example, consider the first-order condition for convexity:
where is convex and . We can interpret this basic inequality geometrically in terms of . If , then
We can express this as:
This means that the hyperplane defined by supports at the boundary point .
1.8 Jensen's inequality and extensions
The basic inequality
is sometimes called Jensen's inequality.
As in the case of convex sets, the inequality extends to infinite sums, integrals, and expected values. For example, if on , , then
If is a random variable such that with probability one, and is convex, then we have
All of these inequalities are now called Jensen's inequality.
2. Operations that preserve convexity
2.1 Nonnegative weighted sums
If is convex in for every , and for every , then the function defined as
is convex in (provided that the integral exists).
2.2 Composition with an affine mapping
Suppose Define by
with . If is convex, then so is .
2.3 Pointwise maximum and supremum
If and are convex functions then their pointwise maximum , defined by
with is also convex.
2.4 Composition
Suppose and they are convex, then we discuss the convexity of their composition , defined as
Omitted.
3. The conjugate function
3.1 Definitions
The supermum of a subset of a partially ordered set is the least element in that is greater than or equal to each element of , if such an element exists.
Let . The function defined as
is called the conjugate of the function . The domain of the conjugate function consists of for which the supremum is finite, i.e., for which the difference is bounded above on .
We see immediately that is convex, since it is the pointwise supermum of a family of convex functions of . This is true whether or not is convex.
3.2 Basic properties
Fenchel's inequality
This is quite obvious.
Conjugate of the conjugate
The conjugate of the conjugate of a convex function is the original function.
Differentiable functions ★
The conjugate of a differentiable function is also called the Legendre transform of .
Suppose is convex and differentiable, with . Any maximizer of satisfies , and conversely, if satisfies , then maximizes . Therefore, if , we have
This allows us to determine for any for which we can solve the gradient equation for any .
We can express this another way. Let be arbitrary and define . Then we have
Scaling and composition with affine transformation
Suppose is nonsingular and . Then the conjugate of is
with .
Sums of independent functions
If , where and are convex functions with conjugates and , respectively, then
4. Quasiconvex functions
4.1 Definition
A function is called quasiconvex if its domain and all its sublevel sets
for all , are convex. A function is quasiconcave if is quasiconvex, i.e., every superlevel set is convex.
A function that is both quasiconvex and quasiconcave is called quasilinear. If a function is quasilinear, then its domain, and every level set is convex.
For a function on , quasiconvexity requires that each sublevel set to be an interval. All convex functions are quasiconvex, but not all quasiconvex functions are convex, so quasiconvexity is a generalization of convexity.
4.2 Basic properties
A function is quasiconvex if and only is convex and for any and ,
Quasiconvex functions on
A continuous function is quasiconvex if and only if at least one of the following conditions holds:
- is nondecreasing
- is nonincreasing
- there is a point such that for (and ), is nonincreasing, and for (and ), is nondecreasing.
4.3 Differentiable quasiconvex functions
First-order conditions
Suppose is differentiable. Then is quaisconvex if and only if is convex and for all
Proof It suffices to proove the result for a function on ; the general result follows by restrication to an arbitrary line.
Suppose is differentiable function on and satisfies
Suppose where . We assume , and show that for . Suppose there exists a with . Since is differentiable, we can choose a that also satisfies . Because
However, implies , which contradicts .
To prove sufficiency, assume is quasiconvex. Suppose . By the definition of quasiconvexity for . Diving both sides by , taking limit for , we obtain
The condition (4.1) has a simple geometric interpretation when . It states that defines a supporting hyperplane to the sublevel set , at the point .
Second-order conditions
Now suppose is twice differentiable. If is quasiconvex, then for all , and all , we have
For a quasicconvex function on , this reduces a simple condition
For a quasiconvex function on , when , the condition (4.2) means that is positive semidefinite on the -dimensional subspace . This implies that can have at most one negative eigenvalue.
As a (partial) converse, if satisfies
for and all , then is quasiconvex.
4.4 Operations that preserve quasiconvexity
Omitted
4.5 Representation via family of convex functions
In the sequel, it will be convenient to represent the sublevel sets of a quasiconvex function (which are convex) via inequalities of convex functions. We seek a family of convex functions , indexed by , with
To see that such a representable always exists, we can take
5. Log-concave and log-convex functions
Omitted
6. Convexity with respect to generalized inequalities
6.1 Monotonicity with respect to a generalized inequality
Suppose is a proper cone with associated generalized inequality . A function is called K-nondecreasing if
and K-increasing if
Gradient conditions with monotonicity
A differentiable function , with convex domain, is -nondecreasing if and only if
for all . The converse is not true.
6.2 Convexity with respect to a generalized inequality
Suppose is a proper cone with associated generalized inequality . We say is -convex if for all , and ,
Dual characterization of -convexity
A function is -convex if and only for every , the function is convex.
Differentiable -convex functions
A differentiable function is -convex if and only if its domain is convex, and for all ,
(Here is the derivative or Jacobian matrix of at ).
Suppose , the Jacobian matrix of is defined as
Composition theorem
If is -convex, is convex, and (the extended-value extensions of ) is -nondecreasing, then is convex.
【推荐】国内首个AI IDE,深度理解中文开发场景,立即下载体验Trae
【推荐】编程新体验,更懂你的AI,立即体验豆包MarsCode编程助手
【推荐】抖音旗下AI助手豆包,你的智能百科全书,全免费不限次数
【推荐】轻量又高性能的 SSH 工具 IShell:AI 加持,快人一步
· 全程不用写代码,我用AI程序员写了一个飞机大战
· DeepSeek 开源周回顾「GitHub 热点速览」
· 记一次.NET内存居高不下排查解决与启示
· MongoDB 8.0这个新功能碉堡了,比商业数据库还牛
· .NET10 - 预览版1新功能体验(一)