When we output a forecast, we're either explicitly or implicitly outputting a
probability distribution.
For example, if we forecast the AQI in Berkeley tomorrow to be "around" 30, plus
Cross-posted from the BAIR Blog
[https://bair.berkeley.edu/blog/2021/11/08/similarity/].
To understand neural networks, researchers often use similarity metrics to
measure how similar or different two neural networks are