We now introduce convex functions. For convenience, use
Let
The function
Geometrically, the line segment between
Let
The function
Note that an affine function
Let
Why? Suppose there exists
Why these functions are called convex? Someone may think their graphs are somehow concave. Actually, what we concern is the area above the graphs of convex functions.
Given a real-valued function
Let
Why do we consider convex functions? One of the most important properties of convex functions is that local minimum points must be global minimum.
If
Assume not. Then there exists
Suppose the domain of function
Note that the extended-value function of a convex function is still convex, since the epigraph remains the same.
It is easy to show the following generalization of Jensen’s inequality by induction.
Suppose
Intuitively we can generalize the inequality to the convex combination of infinite many variables. We actually have the following generalized form of the Jensen’s inequality but we should note that the proof of it is nontrivial since we cannot use induction!
Let
Now we would like to proof that
We verify the Jensen's inequality:
Technically we cannot use the weighted AM-GM inequality here, since the weighted version is usually proved by the Jensen's inequality and concavity of the logarithm, which is what we want to show!
We say a function
If
Prove by contradiction. Assume
Now let
There exists midpoint convex but not convex functions if we admit the axiom of choice. Such a function would have to be non-measurable.
Now we use this theorem to verify a more complicated example:
We admit the fact that
If
Combining all of above, we conclude that
We now consider some properties of convexity. Conversely, these properties also provide some criteria to verify convexity.
Let
Suppose
If
Suppose
The first order condition shows that convex functions have linear lower bounds.
We usually use
The first order condition also holds for the strict convexity if applying strict inequality. For the proof for strict convexity, the
Suppose
Let
Applying Jensen inequality (writing
An important corollary of the first order condition is the property of monotone gradient.
Let
The property of monotone gradients indicates that the second order derivative is somehow nonnegative. Assume
Suppose
Furthermore, if
For strict convexity, we can replace
Consider the function
Similarly, consider the function
However, for a series of special functions, the equivalent relation of strict convexity holds. Consider quadratic functions,
The following figures show the different convexity of