Pleased to share the latest version of my paper with Arthur Spirling and @lexipalmer.bsky.social on replication using LMs
We show:
1. current applications of LMs in political science research *don't* meet basic standards of reproducibility...
We show:
1. current applications of LMs in political science research *don't* meet basic standards of reproducibility...
1 / 3
Comments
We're gonna set up a routine to test this.
I would say though that irrespective of model, we see drastically different downstream results *between* models + several ceased to exist (making model id moot)
We are finding very similar findings for LLM Agent research.
Would anyone be interested in a collaboration on reproducibility on that?