Text this: Estimating disease prevalence in large datasets using genetic risk scores