A function is said to be convex for any if,

This inequality is known as Jensen’s inequality.

Let us consider a convex funtion as shown in the figure below:

Convex Function Interpretation

Consider two points and with corresponding values of and as shown in the above figure. The blue curve between and in the above figure represent the left hand side of equation . The red curve connecting with represents the right hand side of equation . Then, in this simple scenario, equation can be interpreted as, “a function is convex if, for every pair , the blue curve between and is below the red line.”

From ,

Taking limit on both sides, we get,

where, is directional derivative.

The Taylor series expansion of is given by,

Taking limit on both sides, we get,

From equation and ,

Thus, if is differentiable, and its gradients exists for all , then,

Let us continue with our running example of and try to decipher the meaning of equation through the figure below:

gradient characterization

If we make the gradient point towards the same direction as , then we obtain the green arrow in the above figure. According to equation , “for a function to be convex, such green arrow at must be below the red line.”

Alpha-Strongly Convex Function

A function is said to be -strongly convex if,

Beta-Smooth Function

A function is said to be -smooth if,

The above condition is equivalent to the Lipschitz condition over gradients,

The equations and are also equivalent with,


Proof of :

From , we have,

Multiplying both sides by , we get,

From Cauchy-Schwarz inequality, we have,

From equation and , we get,

Equation can be written as,

The Taylor expansion of a function about a point is given by,

From equation , we have,

From equation and , we get,

Subtracting equation from , we get,

Let

and

Then, becomes,

and, becomes,

Solving equation and , we get,

From equation ,

Interchanging and , we get,

And this concludes our proof of .


Proof of :

To prove , we can rewrite as,

Then, again using Cauchy-Schwarz inequality as before,

If we solve it as before, then we obtain,

And this concludes our proof of .


Proof of :

Let for all . Then from , we have,

Multiplying equation by and equation by , we get,

Adding and , we get,

And this concludes our proof of .


Proof of :

The proof of follows similarly starting from .

Contents

ArkTetra © 2026