Text this: Automated detection of poor-quality data: case studies in healthcare