Describir: A curated dataset for data-driven turbulence modelling