Jacob Steinhardt

Jacob Steinhardt

ML Systems Will Have Weird Failure Modes

Previously, I've argued that future ML systems might exhibit unfamiliar, emergent capabilities [https://bounded-regret.ghost.io/p/1527e9dd-c48d-4941-9b14-4f7293318d5c/], and that thought experiments provide one approach [https://bounded-regret.ghost.io/p/a2d733a7-108a-4587-97fb-db90f66ce030/

Anchor Weights for ML

In the previous post [https://bounded-regret.ghost.io/thought-experiments-provide-a-third-anchor/], I talked about several "anchors" that we could use to think about future ML systems, including current ML systems, humans, ideal optimizers,