πŸš€ New #ICLR2025 Paper Alert! πŸš€

Can Audio Foundation Models like Moshi and GPT-4o truly engage in natural conversations? πŸ—£οΈπŸ”Š

We benchmark their turn-taking abilities and uncover major gaps in conversational AI. πŸ§΅πŸ‘‡

πŸ“œ: https://arxiv.org/abs/2503.01174
Post image

Comments