Proof of Seperating Hyperplane Theorem (4)

If there exists two disjoint convex sets $C,D$ then there exists a hyperplane $\vec{a}^\top \vec{x} = b$ that seperates the two sets.

Define the two points $\vec{c} \in C$ and $\vec{d} \in D$ that are closest to each other, and let it be the distance between the two sets:

dist(C,D) = \inf \{||\vec{c} - \vec{d}||_2 | \vec{c}\in C, \vec{d} \in D \}

The $\inf$ function denotes the infimum ⇒ the largest lower bound of $||\vec{c} - \vec{d}||_2$

Then there exists a seperating function:

f(x)=(\vec{d} - \vec{c})^\top (\vec{x} - \frac{\vec{d}+\vec{c}}{2}) = 0

And we will see that

\forall \vec{c} \in C, f(\vec{c}) \le 0 \\ \forall \vec{d} \in D, f(\vec{d}) \le 0

We will prove those constraint by contradiction:

Assue $\exists \vec{u} \in D$ such that $f(\vec{u}) < 0$

\begin{split} f(\vec{u}) &= (\vec{d}-\vec{c})^\top (u-\frac{\vec{d}+\vec{c}}{2}) \\ &= (\vec{d}-\vec{c})^\top ((\vec{u} - \vec{d}) + \frac{1}{2} (\vec{d} - \vec{c})) \\ &=<\vec{d}-\vec{c}, \vec{u}-\vec{d}> + \underbrace{\frac{1}{2} ||\vec{d} - \vec{c}||_2^2}_{\text{we know that this is $\ge 0$}} \\ \end{split}

So implies that for $f(\vec{u}) < 0$ , $<\vec{d} - \vec{c}, \vec{u} - \vec{d}>$ has to be lower than 0

Because this inner product is negative, so if I move along the vector $\vec{u} - \vec{d}$ for some distance $t$ , then I may get closer to $\vec{c}$

Let

\vec{p} = \vec{d} + t(\vec{u} - \vec{d}) = t\vec{u}+(1-t)\vec{d}

If $t \in (0,1)$ then $\vec{p} \in D$ (Since $D$ is convex)

Consider

\begin{split} ||\vec{c} - \vec{p}||_2^2 &= ||\vec{c} - \vec{d} - t(\vec{u}-\vec{d})||^2 \\ &= ((\vec{d} - \vec{d}) - t(\vec{u}-\vec{d}))^\top((\vec{c} - \vec{d})-t(\vec{u}-\vec{d})) \\ &= ||\vec{c} - \vec{d}||^2 + t^2 ||\vec{u}-\vec{d}||^2 - 2t<\vec{c}-\vec{d},\vec{u}-\vec{d}> \end{split}

We want to show that $\vec{p}$ is closer than $\vec{d}$ to $\vec{c}$

So let’s write the next two terms

\begin{split} & t^2 ||\vec{u} - \vec{d}||^2 - 2t<\vec{c}-\vec{d}, \vec{u}-\vec{d}> \\ =& t^2 ||\vec{u} - \vec{d}||^2 + 2t\underbrace{<\vec{d}-\vec{c}, \vec{u} - \vec{d}>}_{\text{strictly }<0} \\ \end{split}

But we can always choose a value of $t$ such that the first term is always smaller in magnitude

t < -\frac{2<\vec{d}-\vec{c},\vec{u}-\vec{d}>}{||\vec{u}-\vec{d}||^2}

So we have proved a contradiction (because now the distance between $\vec{c}$ and $\vec{d}$ is no longer the minimal)