some high level reflections now that I’ve sufficiently reviewed the output of Deep Research and compared it against my own desk research:

the output is too long, does too much “interpreting”, and is really weird (though not wrong exactly) about sources.
having used several of them now I just want to make sure my fellow AI skeptics/haters are aware that there are _drastic_ differences between various the models and expressions/applications of those models

Comments