Information Entropy

Jul 28, 2021 permanent InformationTheory

Entropy is a measure of uncertainty of a random variable's possible outcomes.

It's highest when there are many equally likely outcomes. As you introduce more predictability (one of the possible values of a variable has a higher probability), Entropy decreases.

It measures how many "questions" on average you need to guess a value from the distribution. Since you'd start by asking the question that is most likely to get the correct answer, distributions with low Entropy would require smaller message sizes on average to send.

The entropy of a variable from distribution $p$ is expressed as: $H = i = 1 \sum n p_{i} \times l o g_{2} (\frac{1}{p})$

The expression is commonly inverted and rewritten like this: $H = - i = 1 \sum n p_{i} \times l o g_{2} (p_{i})$

When using log base 2, the unit of Entropy is a bit (a yes or no question).

In code:

In [1]:

import numpy as np

In [2]:

def entropy(dist):
    return -np.sum(np.array(dist) * np.log2(dist))

When the variable has a 50/50 distribution, you will always need to ask one question to find the answer:

In [3]:

dist = [1/2.]*2
dist

Out[3]:

[0.5, 0.5]

In [4]:

entropy(dist)

Out[4]:

1.0

When you have only one possible outcome, you don't need to ask any questions:

In [5]:

dist = [1.]
dist

Out[5]:

[1.0]

In [6]:

entropy(dist)

Out[6]:

-0.0

When you have a lot of equally likely possibilities, you have to ask a lot of questions and entropy is high:

In [10]:

dist = [1/100.]*100
dist[:5]

Out[10]:

[0.01, 0.01, 0.01, 0.01, 0.01]

In [11]:

entropy(dist)

Out[11]:

6.643856189774724

When you have one really likely possiblity, you only need to ask a question when the answer is the unlikely case:

In [12]:

dist = [99/100] + [1/100.]
dist

Out[12]:

[0.99, 0.01]

In [13]:

entropy(dist)

Out[13]:

0.08079313589591118

Claude Shannon borrowed the term Entropy from thermodynamics as part of his theory of communication.

[@khanacademylabsInformationEntropyJourney]

Cover from How Claude Shannon Invented the Future.

Backlinks

Cross-Entropy

Tags

Notes by Lex Toumbourou

Information Entropy

Backlinks