If we recorded model evaluations like we record git commits, I'd look like a 10x engineer.

Comments