Text this: A curated dataset for data-driven turbulence modelling