It was lots of fun to co-lead this with @kennypeng.bsky.social, with coauthors @nkgarg.bsky.social, Jon Kleinberg, and @emmapierson.bsky.social! Feel free to reach out if we can be helpful. Links:

Draft: https://arxiv.org/abs/2502.04382
Python package: https://github.com/rmovva/HypotheSAEs
Demo: https://hypothesaes.org

9/9
Post image

Comments