Text this: Visual behavior modelling for robotic theory of mind