My high tech LLM eval methodology is seeing how good it is at making confetti. Currently, not bad... still kinda jank tho

Comments