Some of the material in is restricted to members of the community. By logging in, you may be able to gain additional access to certain collections or items. If you have questions about access or logging in, please use the form on the Contact Page.
Li, K., Lu, Y., Deng, L., Wang, L., Shi, L., & Wang, Z. (2020). Deconvolute Individual Genomes From Metagenome Sequences Through Short Read Clustering. Peerj. Retrieved from https://purl.lib.fsu.edu/diginole/FSU_libsubv1_wos_000524530100013
Metagenome assembly from short next-generation sequencing data is a challenging process due to its large scale and computational complexity. Clustering short reads by species before assembly offers a unique opportunity for parallel downstream assembly of genomes with individualized optimization. However, current read clustering methods suffer either false negative (under-clustering) or false positive (over-clustering) problems. Here we extended our previous read clustering software, SpaRC, by exploiting statistics derived from multiple samples in a dataset to reduce the under-clustering problem. Using synthetic and real-world datasets we demonstrated that this method has the potential to cluster almost all of the short reads from genomes with sufficient sequencing coverage. The improved read clustering in turn leads to improved downstream genome assembly quality.
Li, K., Lu, Y., Deng, L., Wang, L., Shi, L., & Wang, Z. (2020). Deconvolute Individual Genomes From Metagenome Sequences Through Short Read Clustering. Peerj. Retrieved from https://purl.lib.fsu.edu/diginole/FSU_libsubv1_wos_000524530100013