Google study a flight recommendation task where an LLM acts as a flight booking assistant and interacts with a user over multiple rounds. To make good recommendations, the LLM needs to form and update its beliefs about the user's preferences.
Post image

Comments