Publications

Bioinformatics (Oxford, England). 2018-09-01; 34.17: 3032-3034.

GGRaSP: a R-package for selecting representative genomes using Gaussian mixture models

Clarke TH, Brinkac LM, Sutton G, Fouts DE

PMID: 29668840

Abstract

The vast number of available sequenced bacterial genomes occasionally exceeds the facilities of comparative genomic methods or is dominated by a single outbreak strain, and thus a diverse and representative subset is required. Generation of the reduced subset currently requires a priori supervised clustering and sequence-only selection of medoid genomic sequences, independent of any additional genome metrics or strain attributes.

Metrics