Data from Manuscripts

Bertrand, Erin M. and McCrow, John P. and Moustafa, Ahmed and Zheng, Hong and McQuaid, Jeffrey B and Delmont, Tom O. and Post, Anton F. and Sipler, Rachel E. and Spackeen, Jenna L. and Xu, Kai and Bronk, Deborah A. and Hutchins, David A. and Allen, Andrew E. 2015. Phytoplanktontextendashbacterial interactions mediate micronutrient colimitation at the coastal Antarctic sea ice edge. Proceedings of the National Academy of Sciences. 10.1073/pnas.1501615112


Assembled contigs, nucleotides and amino acids for predicted proteins, and annotation files can be found here.

Unassembled reads for the individual libraries sequenced in this study can be found here


Dupont, Chris L. and McCrow, John P. and Valas, Ruben and Moustafa, Ahmed and Walworth, Nathan and Goodenough, Ursula and Roth, Robyn and Hogle, Shane L and Bai, Jing and Johnson, Zackary I. and Mann, Elizabeth and Palenik, Brian and Barbeau, Katherine A. and Craig Venter, J. and Allen, Andrew E. 2015. Genomes and gene expression across light and productivity gradients in eastern subtropical Pacific microbial communities. ISME J. 9:1076-1092. : International Society for Microbial Ecology 10.1038/ismej.2014.198


Pelagomonas sp. CCMP1756 transcriptome and contigs, nucleotides and amino acids for predicted peptides and annotation of Sanger and 454 metagenomic data and total and polyadenylated RNA metatranscriptomic data can be found here.


Díez, B., Nylander, J.A.A., Allen, A.E., Ininbergs, K., Dupont, C.L., Yooseph, S. Rusch, D.B., Bergman, B. Metagenomic anlaysis of the Indian Ocean picocyanobacterial community: structure, potential function and evolution . PLoS One (In Review).

Genes encoding Chl-binding proteins and phycobiliproteins can be found here.


Databases and Collections

1. PhyloDB 1.075. PhyloDB is custom database suitable for comprehensive annotation of metagenomics and metatranscriptomics data. It is comprised of peptides obtained from KEGG, GenBank, JGI, ENSEMBL, and various other repositories. Version 1.075 of the database consists of 24,509,327 peptides from 19,962 viral, 230 archaeal, 4910 bacterial, and 894 eukaryotic taxa. The database also contains all data from the Marine Microbial Eukaryotic Transcriptome Sequencing Project which is represented by 8,807,335 peptides from 409 taxa.

2. Marine Microbial Eukaryotic Transcriptome Sequence Project (MMETSP).

MMETSP metadata, contgs, nucleotides and amino acids from predicted proteins and associated annotation and expression spreadsheets for each of the 410 singleton or combined assemblies can be found here.

Phylo-METAREP provides a suite of high-performance web based tools to view, query, browse and compare annotated transcriptomes in real time. Phylo-METAREP can be accessed with username and password GUEST. For Phylo-Metarep source code and installation information please visit our the Allen Lab GitHub page.

3. NOAA CalCOFI Genomics Project (NCOG)

The NOAA-California Cooperative Oceanic Fisheries Investigations (CalCOFI) Ocean Genomics (NCOG) Project employs genome-based ocean assessments to complement and augment observations currently collected in the CalCOFI, California Current Ecosystem Long Term Ecological Research (CCE-LTER), and Southern California Coastal Observing System (SCCOOS) programs.

4. Other microbial diversity data sets