This simple pipeline works shockingly well: we substantially outperform (find more interpretable+predictive hypotheses) two recent baselines which use LLMs alone for hypothesis generation (no SAE), and also BERTopic, a classic embedding clustering method. 4/

Comments