(1/8) Excited to share some new work: TESS 2!
TESS 2 is an instruction-tuned diffusion LM that can perform close to AR counterparts for general QA tasks, trained by adapting from an existing pretrained AR model.
📜 Paper: https://arxiv.org/abs/2502.13917
🤖 Demo: https://huggingface.co/spaces/hamishivi/tess-2-demo
More below ⬇️
TESS 2 is an instruction-tuned diffusion LM that can perform close to AR counterparts for general QA tasks, trained by adapting from an existing pretrained AR model.
📜 Paper: https://arxiv.org/abs/2502.13917
🤖 Demo: https://huggingface.co/spaces/hamishivi/tess-2-demo
More below ⬇️
Comments
It may be that instruction-tuning mixtures need to be adjusted for diffusion models (we just used Tulu 2/3 off the shelf).
(1) Using more diffusion steps
(2) Using reward guidance
Explained below 👇
📜 Paper: https://arxiv.org/abs/2502.13917
🧑💻 Code: https://github.com/hamishivi/tess-2
🤖 Demo: https://huggingface.co/spaces/hamishivi/tess-2-demo
🧠 Models: https://huggingface.co/collections/hamishivi/tess-2-677ea36894e38f96dfc7b590