- Research article
- Open Access
The complete mitochondrial genome of the stomatopod crustacean Squilla mantis
BMC Genomics volume 6, Article number: 105 (2005)
Animal mitochondrial genomes are physically separate from the much larger nuclear genomes and have proven useful both for phylogenetic studies and for understanding genome evolution. Within the phylum Arthropoda the subphylum Crustacea includes over 50,000 named species with immense variation in body plans and habitats, yet only 23 complete mitochondrial genomes are available from this subphylum.
I describe here the complete mitochondrial genome of the crustacean Squilla mantis (Crustacea: Malacostraca: Stomatopoda). This 15994-nucleotide genome, the first described from a hoplocarid, contains the standard complement of 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and a non-coding AT-rich region that is found in most other metazoans. The gene order is identical to that considered ancestral for hexapods and crustaceans. The 70% AT base composition is within the range described for other arthropods. A single unusual feature of the genome is a 230 nucleotide non-coding region between a serine transfer RNA and the nad1 gene, which has no apparent function.
I also compare gene order, nucleotide composition, and codon usage of the S. mantis genome and eight other malacostracan crustaceans. A translocation of the histidine transfer RNA gene is shared by three taxa in the order Decapoda, infraorder Brachyura; Callinectes sapidus, Portunus trituberculatus and Pseudocarcinus gigas. This translocation may be diagnostic for the Brachyura. For all nine taxa nucleotide composition is biased towards AT-richness, as expected for arthropods, and is within the range reported for other arthropods. Codon usage is biased, and much of this bias is probably due to the skew in nucleotide composition towards AT-richness.
The mitochondrial genome of Squilla mantis contains one unusual feature, a 230 base pair non-coding region has so far not been described in any other malacostracan. Comparisons with other Malacostraca show that all nine genomes, like most other mitochondrial genomes, share a bias toward AT-richness and a related bias in codon usage. The nine malacostracans included in this analysis are not representative of the diversity of the class Malacostraca, and additional malacostracan sequences would surely reveal other unusual genomic features that could be useful in understanding mitochondrial evolution in this taxon.
The mitochondria are extranuclear organelles present in all metazoans. They contain a circular genome, usually around 16 kilobases in length, with 37 genes (13 protein-coding, two ribosomal RNA genes, and 22 transfer RNA genes). This gene content is widely conserved, but gene order and the DNA sequences of the genes themselves are variable. Because of their small size many more mitochondrial genomes than nuclear genomes have been sequenced, and comparisons among them may serve as models for the evolution of the much larger nuclear genomes . In addition, gene order rearrangements and mitochondrial gene sequences have been widely used for phylogenetic inference [2–7].
At present there are about 650 complete mitochondrial genomes available in public databases. Of these, about 75 percent are of vertebrates. By contrast only 129 complete mitochondrial genomes are available from arthropods, which are the most diverse and speciose phylum of animals. In addition, there is considerable taxonomic bias among the available arthropod sequences; 86, (67 percent) are hexapods. The subphylum Crustacea includes over 50,000 named species and is ecologically and morphologically the most diverse of the arthropod groups, and therefore of all the animals. Crustaceans occupy marine, terrestrial, and fresh water habitats from the deep sea to high mountains; range in adult size from less than one millimeter to more than four meters (leg span); and exhibit extensive variability in body plans when compared to other arthropod groups . At present there are 23 complete crustacean mitochondrial genomes available. Within the Crustacea members of the class Malacostraca, which include crabs, lobsters, and shrimp, are perhaps the most well known to non-scientists. Due to their economic importance this group is often the focus of scientific enquiry. At present there are nine complete malacostracan mitochondrial genomes available, including that of the stomatopod shrimp Squilla mantis. In this paper I describe this genome and compare it to eight other mitochondrial genomes that are available from other Malacostraca.
Mantid shrimps, or stomatopods, are benthic predators distributed in the shallow waters of tropical and subtropical seas. They are best known for their raptorial appendages – pointed or clubbed – which they use to make lightning-fast attacks that disable prey animals by spearing or blunt trauma. Large individuals with the club-type appendages have been known to shatter the sides of aquaria . Squilla mantis (Linnaeus, 1758) (Crustacea: Malacostraca: Stomatopoda), with a maximum length of around 20 cm, is distributed in shallow waters throughout the Mediterranean Sea and Eastern Atlantic . S. mantis is widely consumed by humans throughout its range; UN Food and Agriculture Administration statistics indicate that total catches in the Mediterranean are currently in excess of 6500 tonnes per annum so this species is of some commercial importance .
Results and discussion
Mitochondrial genome composition
The mitochondrial genome of Squilla mantis (GenBank accession number AY639936) is a circular molecule of 15994 nucleotides that contains the same 13 protein-coding genes, 22 transfer RNA genes (tRNA), and two ribosomal RNA genes (rRNA) found in other metazoans [12, 13]; the majority strand (i.e., the strand encoding the majority of genes) encodes nine protein-coding genes and 14 tRNAs while the minority strand encodes four protein-coding genes, eight tRNAs and both rRNA genes (Table 1). The S. mantis genome, like that reported for other arthropod genomes, is AT-rich and has an overall AT content of 70%. This frequency, as expected, varies for different regions of the genome. First and second codon positions average 62% and 63% AT, respectively, tRNA and rRNA genes average 72%, third codon positions average 79%, and putative non-coding regions reach up to 87% AT content (Table 2). There are no significant differences in AT frequency for genes encoded on the majority or minority strands. These values are within the range of 60–87% reported for other arthropods and are not unusual [14, 15].
The predicted structures of the 22 S. mantis tRNAs are shown in Figure 1. Twenty one of these genes were identified by tRNAscan-SE  and have secondary structures similar to those of other published metazoan tRNA genes. Two genes, trnS1 and trnQ, have single T-T mismatches in the acceptor stem, and one gene, trnM, has a single C-A mismatch in the stem of the TψC loop. The trnS1 gene was not identified by the tRNAscan software; rather, it was located by its conserved position in the genome. The variable loop of this gene, with nine nucleotides, is longer than the average of four or five for mitochondrial tRNA genes. This feature is characteristic of type 2 transfer RNA genes, which are uncommon in animal mitochondria but are the norm for bacterial and eukaryotic trnS genes.
The large and small subunit ribosomal RNA (rRNA) genes (rnl, rns) have an AT content of 67%, within the range reported for other arthropod ribosomal RNA genes. Alignments of these genes with other arthropod homologues (not shown) as expected show both conserved and unconserved regions that correspond with the putative stems and loops within these genes. There are thus no unusual features to report for the two rRNA genes.
All of the 13 protein-coding genes, except cox1, have putative ATR methionine or ATT isoleucine start codons. The putative first codon of the cox1 gene is ACG threonine. The lack of a standard initiation codon in cox1 genes is common in arthropod mitochondria, so S. mantis is not unusual [17, 18]. Two of the protein-coding genes, cox1 and nad6, lack a full TAA or TAG stop codon. These genes appear to terminate with a single T from which a stop codon is created by polyadenylation of the mRNA during processing. Again, this phenomenon has been observed in other arthropod mitochondrial genomes and is not unusual [17, 19, 20].
Arthropod mitochondrial genomes typically have a long region that has an AT content higher than that of mitochondrial coding regions. This AT-rich region, ranging from 263 to 4601 base pairs in length and usually located between the rns and trnI genes, is often termed the control region because it contains a number of regulatory elements including the origin of replication for the heavy strand of the mitochondrial genome [21, 22]. In some arthropods the AT-rich region is reported to have any or all of these four different motifs: tandemly repeated sequences, a long sequence of T's, a subregion of even higher AT richness, and stem-loop structures [23, 24].
In S. mantis there are two AT-rich regions, numbered 1 and 2 on Tables 2 and 3. AT-rich region 2 corresponds to the conventional arthropod region between rns and trnI; it is 862 base pairs long, well within the reported range for other arthropods, with an AT content of 77% compared to 70% for the entire S. mantis mitochondrion. However, this region has none of the four motif types that have been reported for arthropods, and I was not able to identify any putatively functional motifs.
I therefore examined the shorter AT-rich region of 230 nucleotides between the trnS2 and nad1 genes for possible functional motifs. Most arthropod mitochondrial genomes have a few short non-coding regions between some genes, usually from a few bases to 20 bases long, but longer non-coding regions, such as AT-rich region 1 in S. mantis, are rare. It therefore seemed possible that this region might have taken over some of the functions putatively assigned to the longer AT-rich region. However, AT-rich region 1, like region 2, contains none of the motifs listed above. Furthermore, the AT content of this region, at 87%, is similar to that calculated collectively for other unassigned nucleotides in the S. mantis genome (Table 2) and is consistent with the hypothesis that this region has no function.
Unusual genomic features, such as this non-coding region or gene order rearrangements, can be useful as characters for reconstructing evolutionary relationships [13, 25]. A second AT-rich region is notably absent even in Harpiosquilla harpax, which is also a member of the family Squillidae. A survey of other members of the genus Squilla for the presence of a similar region would perhaps enhance our understanding of the history of this unusual genomic feature.
Comparison with other malacostracan crustaceans
A number of features of mitochondrial genomes can be used to infer relationships among taxa. These include phylogenetic analysis using DNA and protein sequences, relative rates of sequence evolution, gene order, and the effective number of codons (Nc). I present a phylogenetic analysis and a discussion of rates of sequence evolution in arthropods (including S. mantis) elsewhere , and discuss gene order and Nc below.
Rearrangements of the mitochondrial genome are relatively rare events in evolutionary history. Such rare events can be used to infer relationships among taxa, and mitochondrial gene order rearrangements have proven useful in understanding some aspects of arthropod evolution [27–29]. Figure 2 shows the mitochondrial gene order for the nine Malacostraca for which there are complete mitochondrial genomes. Five of these genomes share the gene order considered ancestral for the Pancrustacea (Crustacea + Hexapoda) . Callinectes sapidus, Portunus trituberculatus and Pseudocarcinus gigas share a single translocation of trnH compared to the ancestral gene order. The mitochondrial genome of Cherax destructor is considerably rearranged and evidences at least seven translocation events compared to the ancestral pancrustacean arrangement . C. sapidus, P. trituberculatus and P. gigas are all decapods within the infraorder Brachyura (Table 1). The trnH translocation shared by these three taxa is therefore not surprising. It is possible that this character is shared among all of the Brachyura, and could therefore serve as a marker for membership in this group and might aid in rapid identification of unidentified individuals, such as larvae or processed materials in markets.
The effective number of codons used in a gene, Nc, is a statistic developed by Wright  to quantify how far codon usage in a gene departs from the equal use of all synonymous codons. The value of Nc can range from 20, the theoretical extreme in which only one codon is used for each amino acid, to 61 when the use of all synonymous codons is equally likely. This statistic, initially developed to compare codon usage between different genes in the same genome, can also be used to compare codon usage between genomes. I calculated Nc for each of the nine malacostracan mitochondrial genomes using the program CodonW . Table 4 shows Nc and GC content for majority strand, minority strand, and all protein genes in the malacostracan mitochondrial genomes. GC rather than AT content is presented to conform to the convention for these comparisons. The Nc values, which range from 38 to 53, are all below the value of 61 that indicates random codon usage, so codon usage in all nine genomes is non-random. There are no obvious similarities in the values for related taxa (i.e., the decapods), but extensive additional sampling among the Malacostraca would be necessary to confirm this observation.
In Figure 3 Nc and GC values are plotted. The distribution of the points suggests a linear relationship between Nc and GC content. A similar association of Nc and GC content was observed by Negrisolo et al. . Equations representing a least squares linear regression analysis are shown for all three data sets in Table 2. Only the regression line for the all genes data set is shown on Figure 3 to prevent clutter. These equations are not statistically probative, but the distribution of points around the line shown in Figure 3 does add to the qualitative impression of a relationship between Nc and GC content. I also calculated the coefficient of determination (R2) for each data set. The values for the majority and minority strand columns are near 0.5, suggesting that around 50% of the variation in one variable is associated with the other. That is, if GC content is taken as independent then 50% of the codon bias in the majority and minority strands is due to the influence of the bias towards low GC values. When both data sets are combined R2 rises to 0.91, suggesting that codon bias and GC content are very closely associated. This discordance between R2 for each strand separately and R2 for all protein-coding genes is puzzling and merits additional study. However, if one assumes that GC content is driven by other biochemical factors then it is clear that much, if not most, of the codon bias observed in these mitochondrial genomes is a consequence of this nucleotide bias.
This is the first formal description of the mitochondrial genome of a stomatopod crustacean. This genome maintains the same genes and gene order that are inferred as ancestral in the Pancrustacea, but does contain one unusual feature: a 230 base pair AT-rich region between the trnS2 and nad1 genes. This feature has no discernable function, but it may prove useful as a character in understanding the evolutionary history of the genus Squilla. Three other arthropod mitochondrial genomes have two large non-coding regions; the ostracod crustacean Vargula hilgendorfii, a millipede Thyropygus sp., and the tick Boophilus microplus [15, 33, 34]. Of these the latter two are clearly duplications of the orginal control region. Only V. hilgendorfii has, like S. mantis, two apparently unrelated non-contiguous AT-rich regions.
A comparison of nine malacostracan genomes, including that of S. mantis, shows that all nine exhibit the nucleotide composition bias favoring A and T nucleotides that is commonly observed for arthropod genomes, and that this bias is responsible at least in part for the observed codon usage bias in these genomes. However, there are no observable patterns of nucleotide composition bias or codon usage bias that unite particular taxa into common groups. These nine taxa represent only a small fraction of the diversity of the Malacostraca, and additional sequencing from across the diversity of this taxon would provide additional data for understanding the evolution of mitochondrial genomes of the class.
Samples and DNA extraction
A single freshly caught specimen of S. mantis was purchased from the fish market at Heraklion, Crete, Greece. Six grams of abdominal muscle were dissected from the specimen and immediately frozen at -70°C. Approximately one half gram of the frozen tissue was shaved from the specimen using a sterile razor blade and genomic DNA extracted using a QIAGEN genomic-tip 20 and the associated buffer set according to the manufacturer's protocol.
PCR, sequencing, and annotation
Short fragments (300–1000 nucleotides) of the mitochondrial genome were amplified at low stringency (50–55 degree annealing temperatures) using primers designed to work on all arthropods (Table 4). Amplification products were cloned into the T-Easy vector (Promega) and at least three clones from each PCR product were sequenced with vector-specific primers using ABI Big-Dye chemistry. Squilla-specific primers were designed and used to amplify longer fragments of 1000–4000 nucleotides that spanned the gaps between the short fragments. Longer fragments were also ligated into the T-Easy vector and at least three clones from each ligation were isolated. Each clone was sequenced using a primer-walking strategy initiated with vector-specific primers. Sequences were assembled using Sequencher v. 3.1 (GeneCodes Corp.). Protein-coding genes were identified using BLAST searches  and by comparison with other arthropod mitochondrial genome sequences. Transfer RNA genes were identified using tRNAscan-SE . Transfer RNA sequences were folded by eye, but made use of the tRNAscan-SE server output when that was available. The effective number of codons, Nc was calculated using the software package CodonW .
Protein genes: cox1, cox 2, cox 3, cytochrome oxidase subunits I, II, and III; cob, cytochrome oxidase b; atp6, atp8, ATP synthase subunits 6 and 8; nad1, nad2, nad3, nad4, nad4L, nad5, nadD6, NADH dehydrogenase subunits 1–6 and 4L. Large and small subunit ribosomal RNA genes are abbreviated rnl and rns. Transfer RNA genes are listed as trnA, trnC, etc., where the final letter is the single letter abbreviation for that amino acid. This nomenclature follows that of Lavrov et al. .
Boore J: Complete mitochondrial genome sequence of Urechis caupo, a representative of the phylum Echiura. BMC Genomics. 2004, 5: 67-10.1186/1471-2164-5-67.
García-Machado E, Pempera M, Dennebouy N, Oliva-Saurez M, Mounolou JC, Monnerot M: Mitochondrial genes collectively suggest the paraphyly of Crustacea with respect to Insecta. J Mol Evol. 1999, 49: 142–149-
Arnason U, Adegoke JA, Bodin K, Born EW, Esa YB, Gullberg A, Nilsson M, Short RV, Xu X, Janke A: Mammalian mitogenomic relationships and the root of the eutherian tree. Proc Natl Acad Sci U S A. 2002, 99: 8151-8156. 10.1073/pnas.102164299.
Nardi F, Spinsanti G, Boore JL, Carapelli A, Dallai R, Frati F: Hexapod origins: monophyletic or paraphyletic?. Science. 2003, 299: 1887-1889. 10.1126/science.1078607.
Phillips MJ, Penny D: The root of the mammalian tree inferred from whole mitochondrial genomes. Mol Phylogenet Evol. 2003, 28: 171-185. 10.1016/S1055-7903(03)00057-5.
Harrison GLA, McLenachan PA, Phillips MJ, Slack KE, Cooper A, Penny D: Four new avian mitochondrial genomes help get to basic evolutionary questions in the late Cretaceous. Mol Biol Evol. 2004, 21: 974-983. 10.1093/molbev/msh065.
Negrisolo E, Minelli A, Valle G: The mitochondrial genome of the house centipede Scutigera and the monophyly versus paraphyly of myriapods. Mol Biol Evol. 2004, 21: 770-780. 10.1093/molbev/msh078.
Martin JW, Davis GE: An Updated Classification of the Recent Crustacea. Science Series No 39. 2001, Los Angeles, Natural History Museum of Los Angeles County
Caldwell RL, Dingle H: Stomatopods. Scientific American. 1976, 234: 80–89-
Crustikon: http://www.tmu.uit.no/crustikon. [http://www.tmu.uit.no/crustikon]
Fishbase: http://fishbase.sinica.edu.tw/. [http://fishbase.sinica.edu.tw/]
Clary DO, Wolstenholme DR: The mitochondrial DNA molecule of Drosophila yakuba: nucleotide sequence, gene organization, and genetic code. J Mol Evol. 1985, 22: 252-271.
Boore JL: Animal mitochondrial genomes. Nucleic Acids Res. 1999, 27: 1767-1780. 10.1093/nar/27.8.1767.
Machida RJ, Miya MU, Nishida M, Nishida S: Complete mitochondrial DNA sequence of Tigriopius japonicus (Crustacea: Copepoda). Mar Biotechnol. 2002, 4: 406—417-10.1007/s10126-002-0033-x.
Ogoh K, Ohmiya Y: Complete mitochondrial DNA sequence of the sea-firefly, Vargula hilgendorfii (Crustacea, Ostracoda) with duplicate control regions. Gene. 2004, 327: 131-139. 10.1016/j.gene.2003.11.011.
Lowe TM, Eddy SR: tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997, 25: 955-964. 10.1093/nar/25.5.955.
Stewart JB, Beckenbach AT: Phylogenetic and genomic analysis of the complete mitochondrial DNA sequence of the spotted asparagus beetle Crioceris duodecimpunctata. Mol Phylogenet Evol. 2003, 26: 513-526. 10.1016/S1055-7903(02)00421-9.
Wilson K, Cahill V, Ballment E, Benzie J: The complete sequence of the mitochondrial genome of the crustacean Penaeus monodon: are malacostracan crustaceans more closely related to insects than to branchiopods?. Mol Biol Evol. 2000, 17: 863-874.
Yamauchi M, Miya M, Nishida M: Complete mitochondrial DNA sequence of the Japanese spiny lobster, Panulirus japonicus (Crustacea: Decapoda). Gene. 2002, 295: 89-10.1016/S0378-1119(02)00824-7.
Miller AD, Nguyen TT, Burridge CP, Austin CM: Complete mitochondrial DNA sequence of the Australian freshwater crayfish, Cherax destructor (Crustacea: Decapoda: Parastacidae): a novel gene order revealed. Gene. 2004, 331: 65-72. 10.1016/j.gene.2004.01.022.
Brown WM: The mitochondrial genome of animals. Molecular Evolutionary Genetics. Edited by: RJ MI. 1985, New York, Plenum, 95–130-
Goddard JM, Wolstenholme DR: Origin and direction of replication in mitochondrial DNA molecules from the genus Drosophila. Nucleic Acids Res. 1980, 8: 741–757-
Shao R, Barker SC: The highly rearranged mitochondrial genome of the plague thrips, Thrips imaginis (Insecta: Thysanoptera): convergence of two novel gene boundaries and an extraordinary arrangement of rRNA Genes. Mol Biol Evol. 2003, 20: 362-370. 10.1093/molbev/msg045.
Zhang DX, Hewitt GM: Nuclear integrations: challenges for mitochondrial DNA markers. Trends in Ecology and Evolution. 1996, 11: 247-251. 10.1016/0169-5347(96)10031-8.
Lavrov DV, Brown WM, Boore JL: Phylogenetic position of the Pentastomida and (pan)crustacean relationships. Proc Biol Sci. 2004, 271: 537–544-
Cook CE, Yue Q, Akam M: Mitochondrial genomes suggest that hexapods and crustaceans are mutually paraphyletic. Proc Biol Sci. 2005, 272: 1295–1304-
Boore JL, Lavrov DV, Brown WM: Gene translocation links insects and crustaceans. Nature. 1998, 392: 667–668-10.1038/33577.
Boore JL, Collins TM, Stanton D, Daehler LL, Brown WM: Deducing the pattern of arthropod phylogeny from mitochondrial DNA rearrangements. Nature. 1995, 376: 163–165-10.1038/376163a0.
Rokas A, Holland PWH: Rare genomic changes as a tool for phylogenetics. TREE. 2001, 15: 454–459-
Wright F: The effective number of codons used in a gene. Gene. 1990, 87: 23–29-10.1016/0378-1119(90)90491-9.
codonw: Correspondence Analysis of Codon Usage. [http://bioweb.pasteur.fr/seqanal/interfaces/codonw.html]
Negrisolo E, Minelli A, Valle G: Extensive gene order rearrangement in the mitochondrial genome of the centipede Scutigera coleoptrata. J Mol Evol. 2004, 58: 413-423. 10.1007/s00239-003-2563-x.
Lavrov DV, Boore JL, Brown WM: Complete mtDNA sequences of two millipedes suggest a new model for mitochondrial gene rearrangements: duplication and nonrandom loss. Mol Biol Evol. 2002, 19: 163-169.
Campbell NJH, Barker SC: The novel mitochondrial gene arrangement of the cattle tick, Boophilus microplus: fivefold tandem repetition of a coding region. Mol Biol Evol. 1999, 16: 732-740.
BLAST NCBI. [http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/BLAST/]
Server RNASE. [http://www.genetics.wustl.edu/eddy/tRNAscan-SE/]
Homepage NCBIT: http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/Taxonomy/taxonomyhome.html/. [http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/Taxonomy/taxonomyhome.html/]
Folmer O, Black M, Hoeh W, Lutz R, Vrijenhoek R: DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates. Mol Mar Biol Biotechnol. 1994, 3: 294-299.
Palumbi SR, Martin A, Romano S, McMillan WO, Stice L, Grabowki G: The Simple Fool's Guide to PCR. 1996, Honolulu, Kewalo Marine Laboratory and University of Hawaii
This work was funded by the UK Biotechnology and Biology Research Council. CEC is currently funded by the UK Natural Environment Research Council. Many thanks to M. Averof for purchasing an individual S. mantis in Crete and to J. Boore for primer sequences and helpful discussions regarding methodology, annotation, and data presentation.
About this article
Cite this article
Cook, C.E. The complete mitochondrial genome of the stomatopod crustacean Squilla mantis. BMC Genomics 6, 105 (2005) doi:10.1186/1471-2164-6-105
- Mitochondrial Genome
- Codon Usage
- Synonymous Codon
- Codon Usage Bias
- Complete Mitochondrial Genome