Unsupervised learning with high-throughput sequencing data