đź’ˇNew preprint & Python package: We use sparse autoencoders to generate hypotheses from large text datasets.
Our method, HypotheSAEs, produces interpretable text features that predict a target variable, e.g. features in news headlines that predict engagement. đź§µ1/
Our method, HypotheSAEs, produces interpretable text features that predict a target variable, e.g. features in news headlines that predict engagement. đź§µ1/
Comments
I understand your method and approach :), my problem is convincing reviewer's about comparative interpretability methods being better)
e.g. in "Automated Annotation of Disease Subtypes"
https://www.sciencedirect.com/science/article/abs/pii/S1532046424000686
https://github.com/rmovva/HypotheSAEs 7/