I was curious to see how much I could speed up the decode function of https://github.com/vanniktech/blurhash/ without looking at the bytecode/assembly. I got it down to ~4x faster on a Pixel 6. Would you be interested in a write-up about how I got there step-by-step?
But yes we all want to read about how you got 4x improvements. Or who knows, even better?
It's currently used in Tusky and Jellyfin for Android TV and it would be nice to make it even faster.