Coronavirus Information for the UC San Diego Community

Our leaders are working closely with federal and state officials to ensure your ongoing safety at the university. Stay up to date with the latest developments. Learn more.

Identification of microbial dark matter in Antarctic environments

TitleIdentification of microbial dark matter in Antarctic environments
Publication TypeJournal Article
Year of Publication2018
AuthorsBowman J.S
JournalFrontiers in Microbiology
Date Published2018/12
Type of ArticleArticle
ISBN Number1664-302X
Accession NumberWOS:000453855200002
Keywords16S rRNA; antarctica; bacterial community; beneath; cryoconite; diversity; glacier; ice; microbiology; permafrost; rare; sea; sea ice; sediment; snow

Numerous studies have applied molecular techniques to understand the diversity, evolution, and ecological function of Antarctic bacteria and archaea. One common technique is sequencing of the 16S rRNA gene, which produces a nearly quantitative profile of community membership. However, the utility of this and similar approaches is limited by what is known about the evolution, physiology, and ecology of surveyed taxa. When representative genomes are available in public databases some of this information can be gleaned from genomic studies, and automated pipelines exist to carry out this task. Here the paprica metabolic inference pipeline was used to assess how well Antarctic microbial communities are represented by the available completed genomes. The NCBI's Sequence Read Archive (SRA) was searched for Antarctic datasets that used one of the Illumine platforms to sequence the 16S rRNA gene. These data were quality controlled and denoised to identify unique reads, then analyzed with paprica to determine the degree of overlap with the closest phylogenetic neighbor with a completely sequenced genome. While some unique reads had perfect mapping to 16S rRNA genes from completed genomes, the mean percent overlap for all mapped reads was 86.6%. When samples were grouped by environment, some environments appeared more or less well represented by the available genomes. For the domain Bacteria, seawater was particularly poorly represented with a mean overlap of 80.2%, while for the domain Archaea glacial ice was particularly poorly represented with an overlap of only 48.0% for a single sample. These findings suggest that a considerable effort is needed to improve the representation of Antarctic microbes in genome sequence databases.

Student Publication: 
Research Topics: