A curated 28S rRNA database for quick species identification
MIMt-28S is composed of sequences from two different sources, sequences directly taken from the manually curated repository Targeted Loci from Refseq and sequences extracted from fully sequenced genomes deposited in Genbank and Refseq with the tool RNAmmer 1.2. For every genome, all 28S sequences were extracted and only kept in the database if they were not identical to another copy.
For the curated version (M2c), 28S sequences from Targeted Loci were joint to sequences extracted from Refseq genomes with RNAmmer software, and those sequences 100% redundant with another one present in the database was included in the redundancy file indicating the sequence code and all the species represented by that sequence.
The full version of the MIMt 28S database is composed by all sequences contained in the curated version plus sequences from genomes of new species available only at Genbank.
In total, MIMt-28S_M2c contains 67,217 sequences belonging to 12,425 different species.
Full version of MIMt-28S contains in total 419,076 sequences belonging to 20,327 species.