I have noticed the same thing. Everybody talks about how many GPUs, but consider the nuance that goes into the HF part of RLHF. The leverage of that interaction is incredible.

Comments