Text this: Synchronized audio-visual transients drive efficient visual search for motion-in-depth.