What are good (recent) examples of model families where smaller models are distilled from the bigger ones?

Comments