Convex Optimization

  • 2.1.1 Line

    Suppose x1x2 are two points in Rn. Points of the form

    y=θx1+(1θ)x2

    where θ ∈ R, form the line passing through x1 and x2.

  • 2.1.2 Affine sets

    (It is hyperplane (may not passing the origin))

    A set CRn is affine if the line through any two distinct points in C lies in C

    We refer to a point of the form θ1x1+···+θkxk, where θ1+···+θk=1, as an affine combination of the points x1,...,xk

    If C is an affine set and x0C, then the set

    V=Cx0={xx0|xC}

    is a subspace

    The set of all affine combinations of points in some set CR n is called the affine hull of C, and denoted affC:

    affC={θ1x1++θkxk|x1,,xkC,θ1++θk=1}

    The affine hull is the smallest affine set that contains C

  • 2.1.3 Affine dimension and relative interior

    We define the affine dimension of a set C as the dimension of its affine hull

    We define the relative interior of the set C, denoted relintC as its interior relative to affC:

    relint={xC|B(x,r)affCC for some r>0}

    where B(x,r)={y|yxr}, the ball of radius r and center x in the norm

    We can then define the relative boundary of a set C as clCrelintC, where clC is the closure of C.

  • 2.1.4 Convex sets

    A set C is convex iff

    θx1+(1θ)x2C,θ[0,1],x1,x2C

    Roughly speaking, a set is convex if every point in the set can be seen by every other point, along an unobstructed straight path between them, where unobstructed means lying in the set. Every affine set is also convex, since it contains the entire line between any two distinct points in it, and therefore also the line segment between the points.

    We call a point of the form θ1x1++θkxk, where θ1++θk=1 and θi0, a convex combination of the points x1,,xk. As with affine sets, it can be shown that a set is convex if and only if it contain contains every convex combination of its points. A convex combination of points can be thought of as a mixture or weighted average of the points, with θi the fraction of xi in the mixture.

    The convex hull of a set C, denoted convC, is the set of all convex combinations of points in C:

    convC={θ1x1++θkxk|xiC,θi0θ1++θk=1}

    As the name suggests, the convex hull convC is always convex. It is the smallest convex set that contains C

  • 2.1.5 Cones

    A set C is called a cone, or nonnegative homogeneous, if for every xC and θ0, we have θxC. A set C is a convex cone if it is convex and a cone, which means that for anyx1,x2C and θ1,θ20, we have

    θ1x1+θ2x2C.

    A point of the form θ1x1++θkxk with θ1,,θk0 is called a conic combination (or a nonnegative linear combination) of x1,,xk. If xi are in a convex cone C, then every conic combination of xi is in C. Conversely, a set C is a convex cone if and only if it contains all conic combinations of its elements.

    The conic hull of a set C is the set of all conic combinations of points in C, i.e.,

    {θ1x1++θkxk|xiC,θi0,i=1,,k}

    which is also the smallest convex cone that contains C.

Convex functions

  • 3.1 Basic properties and examples

  • 3.1.1 Definition

    A function f:RnR is convex if domf is a convex set and

    x,ydomf,0θ1,f(θx+(1θ)y)θf(x)+(1θ)f(y)

    We say f is concave if f is convex, and strictly concave if f is strictly convex.

    A function is convex if and only if it is convex when restricted to any line that intersects its domain. In other words f is convex if and only if for all xdomf and all v, the function g(t)=f(x+tv) is convex (on its domain, {t|x+tvdomf}). This property is very useful, since it allows us to check whether a function is convex by restricting it to a line.

  • 3.1.2 Extended-value extensions

    It is often convenient to extend a convex function to all of Rn by defining its value f to be outside its domain.

    If f is convex we define its extended-value extension f~:RnR{} by

    f~(x)={f(x)xdomf.xdomf.
  • 3.1.3 First-order conditions

    Suppose f is differentiable (i.e., its gradient f exists at each point in domf, which is open). Then f is convex if and only if domf is convex and f(y)f(x)+f(x)T(yx) holds for all x,ydomf

    Strict convexity can also be characterized by a first-order condition: f is strictly convex if and only if domf is convex and for x,ydomf,xy, we have

    f(y)>f(x)+f(x)T(yx).
  • 3.1.4 Second-order conditions

    We now assume that f is twice differentiable, that is, its Hessian or second derivative 2f exists at each point in domf, which is open. Then f is convex if and only if domf is convex and its Hessian is positive semidefinite: for all xdomf

    2f(x)0.

    Strict convexity can be partially characterized by second-order conditions. If 2f(x)0 for all xdomf, then f is strictly convex. The converse, however, is not true: for example, the function f:RR given by f(x)=x4 is strictly convex but has zero second derivative at x=0.

    Remark 3.1 The separate requirement that domf be convex cannot be dropped from the first- or second-order characterizations of convexity and concavity. For example, the function f(x)=1x2, with domf={xR|x0}, satisfies f(x)>0 for all xdomf, but is not a convex function.

  • 3.1.6 Sublevel sets

    The α-sublevel set of a function f:RnR is defined as

    Cα={xdomf|f(x)α}.

  • 3.1.7 Epigraph

    The epigraph of a function f:RnR is defined as

    epif={(x,t)|xdomf,f(x)t}
  • 3.1.8 Jensen’s inequality and extensions

  • 3.1.9 Inequalities