Text this: A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images