(1/8) Excited to share some new work: TESS 2!
TESS 2 is an instruction-tuned diffusion LM that can perform close to AR counterparts for general QA tasks, trained by adapting from an existing pretrained AR model.
📜 Paper: https://arxiv.org/abs/2502.13917
🤖 Demo: https://huggingface.co/spaces/hamishivi/tess-2-demo

More below ⬇️

Comments