Text this: Monocular Human Depth Estimation Via Pose Estimation