The logic of this comp is weird. They have the same method applied to all 25k test sets. It’s not a typical application, where we usually know smtg about the object of inquire and could choose algorithm that fit best to our problem. They measure best universal naive method, not real problem solving.
Comments
And yes, no prior expert knowledge was taken into account.