The return of the Autoregressive Image Model: AIMv2 now going multimodal.
Excellent work by @alaaelnouby.bsky.social & team with code and checkpoints already up:

https://arxiv.org/abs/2411.14402
1 / 2
Post image
Post image

Comments