An Interpretation of KL Divergence
Imagine a lottery game. Let
- For a bet of
on an outcome , the house pays you where is the probability they assign to the outcome . - This is the optimal way for them to make money, if they believe the true distribution is
.
Now, let’s talk about what you do as a player:
- Suppose you know the true distribution of outcomes
. - To maximize your winnings, you should bet proportional to
. - Suppose you bet 1 dollar total.
- Then you optimally bet
dollars for each outcome.
Your expected log-winnings are:
This is actually the formula for KL-divergence.
In other words,
To Do: review other interpretations.
Last Reviewed: 1/20/25