๐Ÿš€ New #ICLR2025 Paper Alert! ๐Ÿš€ Can Audio Foundation Models like Moshi and GPT-4o truly engage in natural conversations? ๐Ÿ—ฃ๏ธ๐Ÿ”Š We benchmark their turn-taking abilities and uncover major gaps in conversational AI. ๐Ÿงต๐Ÿ‘‡ ๐Ÿ“œ: arxiv.org/abs/2503.01174