Describir: A data driven learning approach for the assessment of data quality