Let
Then
Proof
Define functions
Also define
We now just need to show that
Suppose
Then
so we conclude
But by definition
It trivially follows that
Theorem
Let
Then Mathematical Entropy satisfies:
Proof
By Data Processing Property of the Mutual Information:
and the inequality will follow.
Corollary
Let
Then
Proof
Firstly, for any
so taking
This gives the bound.