In theory the object detector just looks for some pattern in an image then tries to localise it, so I guess yes? If a specific call is quite visually distinct on the spectrogram, I don't see why not. Would be cool if it would works!

Comments