Describir: Optimal compressed representation of high throughput sequence data via light assembly