A curated 28S rRNA database for quick species identification
MIMt-28S is composed of sequences from two different sources, sequences directly taken from the manually curated repository Targeted Loci from Refseq and sequences extracted from fully sequenced genomes deposited in Genbank and Refseq with the tool RNAmmer 1.2. For every genome, all 28S sequences were extracted and only kept in the database if they were not identical to another copy.
For the curated version (M2c), 28S sequences from Targeted Loci were joint to sequences extracted from Refseq genomes with RNAmmer software, and those sequences 100% redundant with another one present in the database was included in the redundancy file indicating the sequence code and all the species represented by that sequence.
The full version of the MIMt 28S database is composed by all sequences contained in the curated version plus sequences from genomes of new species available only at Genbank.
In total, MIMt-28S_M2c contains 85,182 sequences belonging to 13,207 different species.
Full version of MIMt-28S contains in total 580,092 sequences belonging to 25,871 species.
BioStudies accession: https://www.ebi.ac.uk/biostudies/studies/S-BSST2015. Â https://doi.org/10.6019/S-BSST2015