- Research article
- Open Access
Uncovering the novel characteristics of Asian honey bee, Apis cerana, by whole genome sequencing
BMC Genomics volume 16, Article number: 1 (2015)
The honey bee is an important model system for increasing understanding of molecular and neural mechanisms underlying social behaviors relevant to the agricultural industry and basic science. The western honey bee, Apis mellifera, has served as a model species, and its genome sequence has been published. In contrast, the genome of the Asian honey bee, Apis cerana, has not yet been sequenced. A. cerana has been raised in Asian countries for thousands of years and has brought considerable economic benefits to the apicultural industry. A cerana has divergent biological traits compared to A. mellifera and it has played a key role in maintaining biodiversity in eastern and southern Asia. Here we report the first whole genome sequence of A. cerana.
Using de novo assembly methods, we produced a 238 Mbp draft of the A. cerana genome and generated 10,651 genes. A.cerana-specific genes were analyzed to better understand the novel characteristics of this honey bee species. Seventy-two percent of the A. cerana-specific genes had more than one GO term, and 1,696 enzymes were categorized into 125 pathways. Genes involved in chemoreception and immunity were carefully identified and compared to those from other sequenced insect models. These included 10 gustatory receptors, 119 odorant receptors, 10 ionotropic receptors, and 160 immune-related genes.
This first report of the whole genome sequence of A. cerana provides resources for comparative sociogenomics, especially in the field of social insect communication. These important tools will contribute to a better understanding of the complex behaviors and natural biology of the Asian honey bee and to anticipate its future evolutionary trajectory.
The honey bee, a social insect, has received considerable attention as a model for studying neurobiology , development , social behavior  and, most recently, epigenomics . To facilitate studies of this important biological model, the genomic sequence of the western honey bee, Apis mellifera, was published in 2006, providing a wealth of information for understanding the molecular basis of social behavior and eusocial evolution .
Honey bees are classified in the order Hymenoptera, which also includes ants, bees, sawflies, and wasps . The genus Apis consists of eight Asian species and one western species . Ancient bees first emerged 120–130 million years ago (mya) coincident with the emergence of early angiosperms . The ancestors of modern bees were geographically isolated in the Middle East during the late Pleistocene approximately 1–2 mya. It was from this population that the ancestral lineage of A. mellifera spread to Africa, and the ancestral lineages of A. cerana expanded throughout Europe and Asia [7, 9, 10]. As a result of allopatric speciation, these two honey bee species may have evolved in different ecological environments, which gave rise to different behavioral and physiological characteristics observed in the present day .
Compared to A. mellifera, the Asian honey bee, A. cerana, has several distinguishing behavioral traits (Table 1). It is able to adapt to extreme weather conditions  and has long flight duration , effective grooming and hygienic behaviors , and cooperative group-level defenses . A well-known behavior of A. cerana is aggregation when a colony is exposed to dangers, such as predators or intruders. In these situations, guard bees produce alarm pheromones that dictate group behavior [13–15]. In addition, A. cerana provides considerable economic benefits to the apicultural industry through its high quality bi-products, perhaps even more so than A. mellifera.
In recent years, population decreases similar to those documented in the western honey bee have also been seen in A. cerana colonies . In Korea, more than ninety percent of A. cerana colonies were destroyed by sacbrood virus (SBV) infection . However, few studies have been conducted on the underlying molecular mechanisms and immune responses to this virus .
This study reports the A. cerana genome produced with deep sequence coverage using next-generation technologies. We generated gene sets of A. cerana using transcriptome data from seven tissues. Then, we focused on the characterization of important genes related to chemosensory reception and immunity. This genome sequence will provide invaluable information on the novel characteristics of the honey bee species indigenous to eastern and Southern Asia. The data will also provide a resource for comparative sociogenomic studies with the seven ants and western honey bee species for which genomes are already available [5, 27–30].
Results and discussion
Genomic features of A. cerana
Sequencing and assembly
We performed whole genome sequencing of the Asian honey bee using seven drones derived from a single colony. Because the honey bee has a haplodiploid mating system, males (drones) are haploid and females (workers and queens) are diploid. To minimize possible contamination from foreign genomes such as bacteria and viruses, we eliminated mid-gut tissues from the individual drone bees prior to sequencing. Genomic sequence libraries were constructed with a combination of short reads (500 bp) and two longer insert libraries (3 and 10 Kb), using Illumina sequencing technology (152-fold coverage) (Table 2). The assembly consisted of 2,430 scaffolds with a total length of 228 Mb, which covered 96% of the estimated genome size (238 Mb) . General information on the genome assembly is presented in Table 3. The N50 scaffold size was 1,421 kb (Table 3), much longer than the N50 scaffold size found in the initial and recently improved assemblies of A. mellifera (359 kb and 997 kb, Amel_4.0 and Amel_4.5, respectively; Additional file 1: Table S1) . To assess the accuracy of the scaffolds, we compared the genome of A. mellifera and A. cerana to identify genomic synteny (Additional file 1: Figure S1). Results revealed several scaffolds of A. cerana and chromosome 3 of A. mellifera that showed syntenic relationships with no large-scale rearrangement. Additionally, we found that the mitochondrial genome of A. mellifera (NCBI GQ162109, ) and one contig of A. cerana had high sequence similarity, ~99% (Additional file 1: Figure S2). This contig, covering the entire A. cerana mitochondrial genome, is 15,915 bp and includes 13 protein-coding genes (Additional file 1: Figure S3). All sequence information was submitted in the NCBI [Submission number: SUB582139].
Guanine plus cytosine (GC) content
The A. cerana assembly contains 30% GC (Table 3), similar to the mean GC content of A. mellifera (33%). In addition, six ant species (Linepithema humile, Camponotus floridanus, Pogonomyrmex barbatus, Solenopsis invicta, Atta cephalotes, and Acromyrmex echinatior) have similar GC contents ranging from 33% to 38% [5, 27–30, 34, 35]. In contrast, Drosophila melanogaster (42%), Nasonia vitripennis (42%), and Harpegnathos saltator (45%) have higher GC contents compared to A. cerana[34, 36]. According to comparative studies of two ant species, C. floridanus and H. saltator, organisms with more complex social traits may have AT-biased genomes [30, 34].
Relative AT bias correlates with DNA methylation, as DNA methyltransferases (Dmnts) are almost entirely targeted to cytosine residues followed by guanines in the 5′ to 3′ orientation (CpG dinucleotides). Methylcytosine tends to mutate to thymine (T), thus the gradational accumulation of mutation that convert CpG dinucleotides to TpG dinucleotides lead to AT-rich genomes [37–39]. In particular, normalized CpG observed/expected (CpG o/e) values have a negative relationship with levels of DNA methylation . DNA methylation is one of the major part of epigenetic regulation and has functional roles in gene expression regulation in vertebrates and insects . In contrast to vertebrate genomes, which are depleted of CpG dinucleotides , most hymenopteran insects, including A. cerana (1.61), A. mellifera (1.65), C. floridanus (1.58), H. saltator (1.49), and N. vitripennis (1.35), exhibit high levels of CpG o/e in their genomes . Another intriguing discovery is that normalized CpG o/e value within protein coding sequences of A. cerana showed bimodal distribution, similar to A. mellifera (Figure 1, Additional file 1: Figure S4) and the pea aphid Acyrthosiphon pisum[5, 42]. Interestingly, two distinct classes of genes are documented to perform different functions , which low-CpG genes are mainly involved in housekeeping function and high-CpG genes are involved in development. Indeed, we found that genes that are represented low-CpG classes are categorized with metabolic process, and transcriptional and translational regulation. In contrast, high-CpG genes represented GO categories that specific to biological functions.
The genome of A. cerana, A. mellifera, and A. pisum encode full complements of DNA methylation proteins (Dmnts) [5, 42], but, according to a recent discovery, several insects possess a full set of Dnmts without any striking depletion pattern of coding exons [27, 36, 44]. Thus, this genomic feature may not be species specific, but mechanisms of epigenetic regulation may be conserved in both honey bee species.
The A. cerana assembly comprised 6.48% (14.79 Mb) repetitive elements, consisting of 3.58% (8.16 Mb) simple repeats and 1.95% (4.44 Mb) interspersed repeats (Additional file 1: Table S2). Seventy-five A. cerana-specific repeat elements were found using the de novo repeat finding program, RepeatModeler (version 1.0.7). Since the A. cerana genome assembly contained 9.79% N’s, we assumed that repetitive sequences in the current assembly may be underestimated. In comparison to A. mellifera, only long terminal repeat elements were over-represented in A. cerana, which accounted for 0.1% (218 kb) of the genome, compared to 0.02% (49.6 kb) in A. mellifera. In contrast, long interspersed elements and short interspersed elements were not detected in the genome of A. cerana. Both were found in the A. mellifera genome at frequencies of 0.04% (83.1 kb) and 0.03% (70 kb), respectively. DNA transposons constitute 0.11% (247 kb) of the A. cerana genome and 0.57% (1.34 Mb) of the A. mellifera genome. Mariner transposable elements, first discovered in the fruit fly, have been found across honey bee species [45, 46]. The genome of the western honey bee, A. mellifera, contained multiple copies of mariner transposons, ranging from AmMar1 to AmMar6. In contrast, the A. cerana genome contained orthologs of AmMar1 and AmMar3-6, but AmMar2 ortholog was not found. This is consistent with the speculation that AmMar1 and AmMar2 were transferred to the A. mellifera genome relatively recently [45, 46].
Although genome-wide repeat analyses need to be investigated further, the results of this study showed a striking reduction of transposable elements (TEs) and retrotransposons in the A. cerana genome compared to A. mellifera. Lack of TEs is one of the major features of the honey bee genome, compared to other sequenced Hymenoptera [30, 32]. Some evidence suggests that grooming and hygienic behaviors in eusocial organisms reduced insertion of TEs from foreign genomes . However, both social and non-social hymenopteran insect genomes, including seven ants and the parasitoid Nasonia, have been sequenced, and they include significantly different quantities of repetitive elements comprising 11% to 28% of genomes [30, 36]. Therefore, hygienic behavior is not the only factor influencing the accumulation of repetitive elements in genomes.
Analysis of the A. ceranagene set
Due to limited data on expressed sequence tags (ESTs) and complementary DNAs (cDNAs) available for A. cerana, we established a gene annotation pipeline using multiple evidence data (Table 4). First, we generated 213,327 transcripts covering 515,809,639 bp using a de novo assembly from 68 Gb of A. cerana RNA-seq reads. Second, RNA-seq data was aligned to scaffold sequences, which resulted in 31,027 gene models representing 96,495,948 bp. Third, we performed computational gene prediction based on the scaffold sequence information, which generated 24,579 genes covering 18,397,306 bp. We also employed A. mellifera gene sequences collected from the National Center for Biotechnology Information Reference Sequence Database (NCBI RefSeq, ) as a model to obtain homology-based gene annotation. Subsequently, we merged all predicted gene models using the MAKER  program to generate a primary gene set. All genes were queried with the NCBI non-redundant database using BLASTX. Lastly, we manually checked for missing genes, partial genes, or separated genes. Chemoreceptor genes, including gustatory receptors (Grs), odorant receptors (Ors), and ionotropic receptors (Irs), were investigated more carefully using analyses of functional sequence domains. Finally, 10,651 genes were annotated as the official gene set (OGS) of A. cerana, OGS version 1.0 (Table 4), of which approximately 84% of genes were annotated with NCBI non-redundant database and 70% were annotated in the Uniprot database . Overall, the total number of genes in the A. cerana OGS v1.0 was comparable to the number in the A. mellifera OGS v1.0 (10,157 genes). However, the number is less than the current release of the A. mellifera genome, OGS v3.2 (15,314 genes; Table 5) [5, 32].
We classified genes by function using the gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases [50, 51]; 6,338 genes (60%) had more than one GO term and 1,696 enzymes were categorized into 125 pathways (Additional files 2 and 3). Here, several interesting molecular pathways that could represent honey bee-specific molecular mechanisms were revealed. For example, the fatty acid biosynthesis, glutathione metabolism, and cytochrome P450 pathways may be involved in nest-mate recognition and detoxification of pesticides (Additional file 1: Figure S5) [52, 53]. The surface of the honey bee is composed of fatty acids and hydrocarbons, which reflect identity, and guard bees recognize these compounds to discriminate colony members from intruders . KEGG analyses revealed that classes of enzymes involved in fatty acid biosynthesis are shared between A. cerana and A. mellifera, and A. cerana has fewer detoxification enzymes compared to fly and mosquito but similar numbers to A. mellifera. The contribution of pesticides to global colony losses of A. mellifera is still a controversial issue, but data indicate that A. mellifera is unusually sensitive to various insecticides [54, 55]. Interestingly, colonies of A. cerana have not shown similar levels of collapse to A. mellifera, but this could be explained by other differences that may reduce exposure to pesticides, such as frequent absconding behavior, small nest architecture, and foraging in high altitude regions [7, 56].
Genes unique to A. ceranaand orthologous to honey bee
To investigate whether non-orthologous genes are associated with features of A. cerana biology, we compared three hymenopteran insects, Apis mellifera (social), Apis cerana (social), Nasonia vitripennis (non-social), and one dipteran insect, Drosophila melanogaster (non-social) by orthology based clustering. Amongst 2,182 unique genes in A. cerana (Figure 2), most of the significantly enriched GO-terms were involved in neuromuscular junction, neuromuscular process, regulation of muscle organ development, muscle cell differentiation, and muscle tissue development (p < 0.05, Additional file 4). A. cerana has a higher wing beat frequency (306 beats/s) compared to A. mellifera (235 beats/s) and quick, impetuous, and unpredictable flight patterns, therefore some of the enriched proteins involved in muscle movement might account for A. cerana-specific flight patterns . Future studies should be performed to dissect this relationship.
Notably, neural signaling-related GO-terms, including neuron recognition, signaling receptor activity, transmembrane receptor signaling pathway, ionotropic glutamate receptor signaling pathway, and active transmembrane transporter activity, which are closely related with chemosensory reception and chemical signaling, were also enriched (p < 0.05) in the A. cerana unique gene set (Additional file 4). Genes involved in chemical signaling have evolved rapidly, especially in eusocial organisms [57, 58]. Neural signaling processes play major roles mediating social communication in honey bee society. A. cerana shows a number of group-level behaviors distinct from A. mellifera, including a unique defense behaviors against a hornets. A. cerana guard bees raise their abdomens and shake or flutter, producing alarm pheromones, when hornets approach the hive [7, 56]. Additional research is required to determine whether molecular regulation mechanisms found uniquely in A. cerana may be responsible for these unique social defense behaviors.
Since A. mellifera and A. cerana diverged recently, we hypothesized there would be protein orthologs conserved in both honey bee species that explain shared honey bee characteristics. A total of 1,061 A. cerana proteins were identified with orthologs in A. mellifera but no other non-social species. These orthologs were categorized with GO-terms “sensory perception of smell” (p < 1.75E−04) and “sensory perception of chemical stimulus” (p < 7.55E−04), which are crucial features for social communication and physical interaction . In addition, the GO-term “carbohydrate transporter activity” (p < 1.87E−02), which describes cuticular hydrocarbon detection , and “regulation of short-term neuronal synaptic plasticity” (p < 2.21E−02) and “transmembrane signaling receptor activity” (p < 3.04E−02), which are involved in neuronal signaling during social interaction, were also enriched in orthologs shared by the two honey bee species (Additional file 1: Table S3).
Chemoreceptor gene family
Chemoreceptors play important roles in communication and social behaviors, in part by mediating detection of chemical signals from nest-mates . Major groups of chemoreceptor genes include gustatory receptors (Grs), odorant receptors (Ors), and ionotropic receptors (Irs) [59–63]. In social insects, such as ants and honey bees, chemical communication is crucial for colony maintenance and cooperation [64–66]. Here, we have characterized 10 new Grs, 119 new Ors, and 10 new Irs in the A. cerana genome. Gene expression patterns, examined using RNA-seq data, revealed that annotated chemoreceptor genes were well organized and comparable to those of A. mellifera and N. vitripennis, although they were slightly underrepresented compared to the A. mellifera genome .
The gustatory receptor family
The gustatory receptor family plays an important role in taste and is used to collect nectar and pollen for energy and brood care . In honey bee society, colony members have division labor and perform different tasks. Nurse bees take care of the brood and the queen, and they clean inside the nest. Forager bees collect food or resin from outside and bring it to the hive . Peripheral and internal regulation of Gr gene expression is involved in this behavioral transition .
According to Robertson and Wanner , the western honey bee, A. mellifera has 13 Grs (H. M. Robertson, personal communication), a small number compared to the fruit fly D. melanogaster (68 Grs, ), the mosquito Aedes aegypti (79 Grs, ), the parasitoid wasp N. vitripennis (58 Grs, ), and the ant Linepithema humile (116 Grs, ). Similar to A. mellifera, 10 Gr genes were identified in the A. cerana genome. They were named based on their orthology to A. mellifera Grs (AmGrs). All identified Grs in A. cerana showed simple orthologous relationships with Grs in A. mellifera, and AcGr1, 2, 3, 6, 7, 9, and 10 also had orthologs in N. vitripennis (Additional file 1: Figure S6). These data indicated that Gr genes are highly conserved among hymenopteran species. Similar to the A. mellifera Gr repertoire, AcGr1 and AcGr2 were positioned in expanded lineages to sugar receptors of D. melanogaster, including DmGr5a, DmGr61a and DmGr64a/f (Figure 3A) [71–73]. In addition, AcGr3 shared a clade with DmGr43a, which functions as a fructose receptor in the periphery and a nutrient sensor in the brain of Drosophila (Figure 3A) . In contrast, AcGr6, 7, 9, 10, and X lineages showed no apparent relationships with DmGrs, implying that they may be unique to the honey bee. Bitter taste receptors also seem to be lost in the A. cerana genome, which may be related to the evolution of flower preference in the honey bee  compared to other social insects such as the ant in which bitter receptors are preserved [27, 28]. Additionally, orthologs to Drosophila carbon dioxide (CO2) receptors, Gr21a and Gr63a[28, 75], were not present in the A. cerana genome, similar to A. mellifera. However, honey bees are known to detect CO2, indicating that they may have evolved novel molecular mechanisms similar to the acid sensing mechanism found in Drosophila for detecting high CO2 concentrations . Partial sequences of A. cerana Gr4 and Gr5 orthologs were located using TBLASTN searches. A Gr8 ortholog could not be found in the A. cerana genome.
Expression patterns of Gr orthologs in A. cerana and A. mellifera were determined by relative gene expression analyses (Figure 3B). Surprisingly, expression patterns of Gr orthologs between the two honey bee species were distinct. Candidate sugar receptors, Gr1 and Gr2, were expressed higher in A. cerana compared to A. mellifera (Figure 3B), suggesting that A. cerana may have a greater ability to sense sugars. Similarly, Gr5 and Gr7 were highly expressed in A. cerana compared to A. mellifera. In contrast, Gr3, 6, 9, and 10 were more highly expressed in A. mellifera compared to A. cerana. Gr4 and GrX were not detected in antennal transcriptome of A. cerana (data not shown), implying that Gr4 and GrX might be expressed at undetectable levels or in other tissues, such as the tongue or legs. Future functional studies on Grs may reveal taste sensing and internal regulation differences between the species.
The odorant receptor family
Insect odorant receptors play important roles in environmental signal recognition and inter- and intra-species communication . Honey bees use odorant receptors in contexts, including kin recognition, food navigation, and pheromone detection [59, 65, 78]. However, despite the importance of odorant receptors, the functional identification of Ors in honey bees is lacking compared to other model insects, including fly and mosquito species .
In the A. cerana genome, 119 AcOrs, including a few 5′- or 3′- partial sequences containing the odorant receptor domain, were identified. We named A. cerana Ors by sequence positions in scaffolds. Most AcOrs were not spread evenly across scaffolds, but were clustered at a few locations in the genome. For example, clusters of 37 Ors, 15 Ors, and 17 Ors were arrayed on scaffold 3, 103, and 139, respectively (Additional file 1: Figure S7). In A. mellifera, the largest tandem array of 60 Ors was found on chromosome 2 . This expansion of Ors implies unequal crossing-over by neighboring genes occurred. The large number of Or paralogs indicate diverse roles for odorant recognition in honey bee society, such as pheromone blends, cuticular hydrocarbons, and floral odor cocktails . Since A. mellifera and A. cerana diverged recently , it was hypothesized that there may be synteny between Or clusters. Regions of A. mellifera chromosome 2 with microsynteny conservation were identified by comparing the Or gene arrangement in the A. cerana genome to the A. mellifera genome. Consistent with the hypothesis, conserved microsynteny and clear orthologs of A. cerana Ors to A. mellifera Ors were found (Figure 4C, Additional file 5), suggesting that honey bee Or paralogs are clustered in conserved genomic regions [80, 81].
Insects have a number of variable Ors, which form a chaperone with olfactory receptor co-receptor (Orco) in vivo[82–85]. In the present study, A. cerana Or5 shared orthology with insect Orcos including D. melanogaster Or83b, N. vitripennis Or1, and A. mellifera Or2 (Figure 4A). Overall, identified AcOrs showed simple orthologous relationships with AmOrs, such as 1:1, 1:2, and 1:3 (AcOrs : AmOrs).
Among 177 A. mellifera Ors, AmOr11 was functionally characterized as a queen pheromone receptor responding to 9-oxo-2-decenoic acid (9-ODA) . In our study, AcOr30 showed 1:1 orthology to AmOr11 with 98.7% identity (Additional file 1: Figure S7), implying that queen pheromone components may be conserved between A. mellifera and A. cerana.
Transcriptome data revealed that Or homologs are differentially expressed between A. cerana and A. mellifera (Figure 4B). Forty-four Or homologs were more highly expressed in A. mellifera, and 56 Or homologs were more highly expressed in A. cerana. Different expression patterns support the idea that coding sequences are well conserved among Or homologs, but their promoter sequences have diverse regulatory motifs. These data imply that the two honey bee species express different odorant spectra. Specifically, seven AcOrs (AcOr21, 38, 40, 45, 56, 58, and 116) were expressed only in A. cerana, indicating functions specific to A. cerana. Functional studies using heterologous expression systems are needed to better understand the various functions of Ors in honey bees.
The ionotropic receptor family
Recently, a new family of chemosensory receptors, the ionotropic receptor (Ir) family, was identified in D. melanogaster. Irs in D. melanogaster constitute distinct and divergent subfamilies of ionotropic glutamate receptors (iGluRs) . Sixty-six Ir homologs have been identified in D. melanogaster, and 16 were expressed specifically in antennae [61, 63]. This suggested that Irs belong to two subgroups: conserved antennal Irs and species-specific divergent Irs. These subgroups represent classes of Ors and Grs, respectively . In contrast to Ors, which respond broadly to alcohols, ketones, and esters, Irs primarily respond to acids, amines, and carbon dioxide, which can be physiologically important in many insect species [77, 86–88]. Although the functions of these receptors are not yet known, Irs may have more general function in the detection of environmental chemicals including odorants and tastes .
The number of identified Irs in insects is increasing [63, 90, 91], and a large complement of Irs has been described in the complete genomes of four hymenopteran species: A. mellifera (10 Irs), N. vitripennis (10 Irs), L. humile (32 Irs), and P. barbatus (24 Irs) [5, 27, 28, 36]. In this study, 10 Ir homologs were found in the A. cerana genome (Figure 5A). Sequence comparison and phylogenetic analyses of Irs with D. melanogaster and A. mellifera identified putative orthologs of conserved Irs in the A. cerana genome: Ir8a, Ir25a, Ir68a, Ir75a, Ir76a, and Ir93a. As expected, highly conserved orthologs of antennal Irs were identified in the A. cerana genome. These results support the hypothesis that antennal expression of Ir orthologs has been conserved for over 350 mya since dipteran and hymenopteran insects diverged . Other Irs in A. cerana with low similarity to orthologs of other insect receptors appear to be honey bee-specific. These Irs may be used for species-specific recognition, including candidates for cuticular hydrocarbon receptors and brood pheromone receptors. However, expression patterns for the vast majority of Irs are unknown and no ligands for honey bee Irs have been identified. In this study, AcIr expression profiles were different in A. mellifera and A. cerana (Figure 5B). Their functions and evolutionary basis for diversity remain to be investigated.
Honey bees are invaluable models for studying social defense dynamics and individual molecular and behavioral defense mechanisms . In contrast to A. mellifera, A. cerana is not susceptible to the ectoparasitic mite, Varroa destructor, one of the major vectors of bee pathogens. In contrast, A. cerana has suffered greatly from viral and bacterial diseases in recent years . A recent report indicated that more than 90% of Asian honey bee colonies collapsed due to sacbrood virus (SBV) infection in Korea . Many Asian countries also suffered declines in A. cerana colonies for several reasons. However, the molecular defense mechanisms of A. cerana are still unknown. Therefore, we investigated immune genes present in the A. cerana genome by comparing genomic information to other sequenced insect genomes .
Using multiple TBLASTN searches, 160 immune gene orthologs were identified in A. cerana and 11 additional genes were detected by manual annotation. All major pathways were identified in A. cerana, including components of the Toll, Imd, Jak/Stat, and JNK pathways . Notably, the FADD, Dredd, and Kenny, components of the Imd pathway and Pelle of the Toll pathway were not detected in the A. cerana genome (Figure 6). The total number of innate immune genes in A. cerana is similar to other social Hymenoptera  (Additional file 1: Table S4), and most immune genes in A. cerana shared higher sequence similarity with A. mellifera compared to other sequenced insect species. This may be explained by conservation of the innate immune system between A. cerana and A. mellifera. Eusocial insects have additional social immune systems, such as cleaning behaviors (hygienic behavior, grooming, and undertaking), thermal defenses (A. mellifera lacks this behavior), and antibiotic nest architecture (resin collection), which may contribute to reducing exposure to pathogens .
Previous studies indicate more antimicrobial proteins are encoded in the A. cerana genome compared to A. mellifera. Indeed, defensive peptides, including venomous peptides, in A. cerana are more strongly expressed than those in A. mellifera. In addition, some reports show the uniqueness of strong behavioral defenses of A. cerana, such as hygienic and grooming behaviors [11, 12, 94]. Together, these data indicate that A. cerana, through a combination of elaborate molecular and behavioral mechanisms, may have a more effective social defense system compared to A. mellifera. Functional studies of immune genes will inform knowledge of A. cerana-specific disease control methods and provide a valuable model for comparative studies of social insect immune systems.
The Asian honey bee, A. cerana, a close relative of the western honey bee, A. mellifera, provides considerable economic benefits to the apicultural industry and serves as a new model for biodiversity in eastern and southern Asia. Despite similarities in Asian and western honey bees, differences in biological traits led us to explore the genome and transcriptome of A. cerana. This is the first report to elucidate the whole genome sequence of the Asian honey bee, A. cerana, by employing de novo assembly of the A. cerana genome and computational gene prediction followed by manual annotation. A. cerana has unique features for muscle movement and neural signaling, reflecting the wild nature of A. cerana. In addition, chemosensory receptors and immune-related genes, which might be responsible for sophisticated and well-organized life styles in honey bees, were identified. Comparative studies of chemoreceptor gene families showed similarities in receptor family expansions relative to A. mellifera. However, significant differences in gene expression were also identified, potentially reflecting different capabilities for odor perception. In addition, immune-related genes in A. cerana exhibited expression patterns that may reflect an advanced social immune system, advancing understanding of the molecular basis of social immunity. This genome analysis will provide invaluable information on the novel characteristics of A. cerana, and contribute to the understanding of comparative sociogenomics with other sequenced social insect genomes.
Sample collection and DNA and RNA extraction
Adult drone bees of A. cerana (n = 4) were collected from a single colony at the College of Agriculture and Life Sciences, Seoul National University (SNU), Seoul, Korea during summer 2012. Mid-gut tissues were removed to reduce contamination with genomes of intestinal microorganisms. Genomic DNA was extracted from individual drone bees using the Wizard Genomic DNA Purification kit (Promega, MI, US) according to the manufacturer’s protocol. For RNA sequencing, A. mellifera and A. cerana worker bees were collected from 3 different colonies at SNU and stored immediately in liquid nitrogen. After flash freezing, samples were stored at −80°C until dissection. Tissues from A. cerana (brain, antenna, hypopharyngeal gland, gut, fat body, and venom gland, and larvae) and from A. mellifera (antenna) were dissected in cold RNase free PBS (pH = 7.4). Total RNA was isolated using a QIAGEN RNeasy Mini kit (Qiagen, CA, US) according to the manufacturer’s protocol.
DNA and cDNA library construction
Genomic DNA for shotgun sequencing was sheared with average fragment sizes of 500 bp, 3 kb, and 10 kb. Three DNA libraries were constructed using the Illumina sequencing platform according to the manufacturer’s protocol (Illumina CA, US). Briefly, 5, 10, and 20 μg of genomic DNA were used for whole genome random shearing to construct 500 bp, 3 kb, and 10 kb libraries, respectively. Fragments were ligated with 3′ and 5′ adapters to produce libraries for PCR amplification.
For mRNA sequencing, cDNA libraries were prepared for each tissue using the Illumina mRNA sequencing kit (Illumina, CA, US) and the Clontech SMART cDNA Library Construction Kit (Invitrogen). Libraries were sequenced using the Illumina HiSeq2000 and Roche 454 Genome Sequencer FLX instrument (Roche, UK).
De novogenome assembly and sequence analyses
Raw sequencing reads with phred scores ≤ 20 were filtered out using the CLC_quality_trim (CLC 3.22.55705). Duplicate sequences were removed with the remove_duplicate program (CLC-bio) using the default options. After filtration, genome libraries with inserts of 500 bp, 3 kb, and 10 kb were assembled using the AllPaths-LG (version 42411, ) algorithm with default parameters. The A. cerana genome sequence is available from the NCBI with project accession PRJNA235974. Repeat elements in the A. cerana genome were identified using RepeatModeler (version 1.0.7, ) with default options. Subsequently, RepeatMasker (version 4.03, ) was used to screen DNA sequences against RepBase (update 20130422, ), the repeat database, and mask all regions that matched known repetitive elements. Comparison of experimental mitochondrial DNA to published mitochondrial DNA (NCBI accession GQ162109) was performed using the CGView Server with the default options . The percent identity shared between the A. cerana mitochondrial genome assembly and NCBI GQ162109 was determined by BLAST2 . To examine the distribution of observed to expected (o/e) CpG ratios in protein coding sequences of A. cerana, we used in-house perl scripts to calculate normalized CpG o/e values . Normalized CpG was calculated using the formula:
where freq(CpG) is the frequency of CpG, freq(C) is the frequency of C and freq(G) is the frequency of G observed in a CDS sequence.
Evidence-based gene model prediction
Assembly of RNAseq data was performed using de novo assembler software, Trinity (version r2013-02-25, ). Alignment of RNAseq reads against genome assemblies was performed using Tophat and transcript assemblies were determined using Cufflinks (version 2.1.1, ). Gene set predictions were generated using GeneMark.hmm (version 2.5f, ). Homolog alignments were made using NCBI RefSeq and A. mellifera as a reference gene set (Amel_4.5). A final gene set was created synthetically by integrating evidence-based data using the gene modeling program, MAKER (version 2.26-beta), including the exonerate pipeline with default options [48, 104]. Subsequently, we performed blast searches with the NCBI non-redundant dataset to annotate combined gene models. All gene predictions were provided as input to the Apollo genome annotation editor (version 1.9.3, ), and genes included in phylogenetic analyses were manually checked against transcript information generated by Cufflinks to correct for 1) missing genes, 2) partial genes, and 3) separated genes.
Gene orthology and ontology analysis
The protein sets of four insect species were obtained from A. cerana OGS v1.0, A. mellifera OGS v3.2 , N. vitripennis OGS v1.2 , and D. melanogaster r5.54 . We used OrthoMCL v 2.0  to perform ortholog analysis with default parameter for all steps in the program. GO annotation proceeded in Blast2GO (version 2.7) with default Blast2GO parameters. Enrichment analysis for statistical significance of GO annotation between two groups of annotated sequences was performed using Fisher’s Exact Test with default parameters.
Gene family identification and phylogenetic analysis
Total 10,651 sequences of OGS v1.0 were classified with Gene Ontology (GO) and KEGG database using blast2GO (version 2.7) with MySQL DBMS (version 5.0.77). To search the sequence of A. cerana odorant receptors (Ors), gustatory receptors (Grs), and ionotropic receptors (Irs), we prepared three sets of query protein sequences: 1) first set includes Or and Gr protein sequences from A. mellifera (provided by Dr. Robertson H. M. at the University of Illinois, USA), 2) second set includes Or, Gr, and Ir protein sequences of previously known insects from NCBI Refseq , 3) third set includes functional domain of chemoreceptor from Pfam (PF02949, PF08395, PF00600) . The TBLASTN of these three sets of receptor proteins was performed against A. cerana genome. Candidate chemoreceptor sequences from the results of TBLASTN were compared with ab initio gene predictions (see Gene annotation section) and verified its functional domain using the MOTIF search program . Annotated Or, Gr, and Ir proteins were aligned with ClustalX  to corresponding proteins of A. mellifera and were manually corrected. Alignments were performed iteratively and each sequence was refined based on alignments to make complete Or, Gr, and Ir sequences for A. cerana. Sequences were aligned with ClustalX , and a tree was built with MEGA5  using the maximum likelihood method. Bootstrap analysis was performed using 1000 replicates.
Tissue expression analysis
To compare tissue transcriptomes of A. cerana and A. mellifera, RNAseq reads from antenna tissue of each species were mapped against annotated Gr, Or, and Ir gene sequences using RSEM . TMM (Trimmed Mean of M-values)-normalized FPKM (fragments per kilobase per million) expression values were calculated by running perl scripts included in Trinity assembler (version r2013-02-25) and were used to draw a heat map .
Analysis of microsynteny between AmOrs and AcOrs
Microsynteny analyses of orthologous Or genes between A. cerana and A. mellifera were carried out based on a phylogenetic tree. In addition, we performed reciprocal BLASTZ searches  between two species in syntenic regions and identified clear orthologs based on E-values (cut-off: 1e−40) and phylogenetic analyses.
Immune related gene family
Holometabola immune gene orthologs obtained from Immunodb  were used as queries against the A. cerana genome using TBLASTN. Genomic sequences encoding immune gene candidates were compared with ab initio gene predictions to confirm completeness of the gene models.
Availability of supporting data
The data sets supporting the results of this article are available through a BioProject (accession number PRJNA235974) at the NCBI.
The Illumina transcript data generated for A. cerana are available in the NCBI SRA  with accessions SRR1380970 (brain), SRR1380976 (antenna), SRR1380979 (hypopharyngeal gland), SRR1380984 (gut), SRR1388774 (fat body), SRR1406762 (venom gland), and the accession for 454 transcript data is SRR1408614 (head). The Illumina raw data from A. mellifera are available at the NCBI SRR1386316 (brain), SRR1407793 (hypopharyngeal gland), and SRR1408090 (antenna). Whole genome sequences of A. cerana are submitted in the NCBI Whole Genome Shotgun database (SUB582139). The genome and gene information is also available at A. cerana database (http://mnbldb.snu.ac.kr/genome_seq.php).
Million years ago
Mate paired end
Guanine plus cytosine
Cytosine followed by guanine in 5' to 3' orientation
Thymine followed by guanine in 5' to 3' orientation
DNA methylation protein
Expressed sequence tag
National Center for Biotechnology Information
Official gene set
Kyoto Encyclopedia of Genes and Genomes
Ionotropic glutamate receptors
Galizia CG, Eisenhardt D, Giurfa M, Menzel R: Honeybee Neurobiology and Behavior : A Tribute to Randolf Menzel. Dordrecht Netherlands. New York: Springer; 2012.
Begna DHB, Feng M, Li J: Differential expression of nuclear proteomes between honeybee (Apis melliferaL.) queen and worker larvae: a deep insight into caste pathway decisions.J Proteome Res 2012, 11:1317–1329. 10.1021/pr200974a
Zayed ARG: Understanding the relationship between brain gene expression and social behavior: lessons from the honey Bee.Annu Rev Genet 2012, 46:591–615. 10.1146/annurev-genet-110711-155517
Foret S, Kucharski R, Pellegrini M, Feng S, Jacobsen SE, Robinson GE, Maleszka R: DNA methylation dynamics, metabolic fluxes, gene splicing, and alternative phenotypes in honey bees.Proc Natl Acad Sci U S A 2012, 109:4968–4973. 10.1073/pnas.1202392109
Honeybee: Insights into social insects from the genome of the honeybee Apis mellifera.Nature 2006, 443:931–949. 10.1038/nature05260
Monica C, Munoz-Torres JTR, Christopher P, Childers Anna K, Bennett Jaideep P, Sundaram Kevin L, Childs Juan M, Anzola Natalia M, Elsik CG: Hymenoptera genome database: integrated community resources for insect species of the order hymenoptera.Nucleic Acids Res 2011, 39:D658-D662. 10.1093/nar/gkq1145
Oldroyd BP, Wongsiri S: Asian Honey Bees : Biology, Conservation, and Human Interactions. Cambridge, Mass: Harvard University Press; 2006.
Engel MS: A monography of the Baltic amber bees and evolution of the Apoidea (Hymenoptera).Bull Am Mus Nat Hist 2001, 259:5–192.
Arias MC, Sheppard WS: Molecular phylogenetics of honey bee subspecies (Apis melliferaL.) inferred from mitochondrial DNA sequence.Mol Phylogenet Evol 1996, 5:557–566. 10.1006/mpev.1996.0050
Arias MC, Sheppard WS: Phylogenetic relationships of honey bees (Hymenoptera:Apinae:Apini) inferred from nuclear and mitochondrial DNA sequence data.Mol Phylogenet Evol 2005, 37:25–35. 10.1016/j.ympev.2005.02.017
Xu P, Shi M, Chen XX: Antimicrobial peptide evolution in the Asiatic honey beeApis cerana.Plos one 2009, 4:e4239. 10.1371/journal.pone.0004239
YS Peng YF, Xu S, Ge S: The resistance mechanism of the Asian honey Bee,apis ceranafabr., to an ectoparasitic mite,varroa jacobsonioudemans.J Invertebr Pathol 1987, 49:54–60. 10.1016/0022-2011(87)90125-X
Ono M, Okada I, Sasaki M: Heat protection by balling in the japanese honeybeeApis cerana japonicaas a defensive behavior against the hornet,Vespa simillima xanthoptera(Hymenoptera: Vespidae).Experientia 1987, 43:1031–1032. 10.1007/BF01952231
Morse RA, Shearer DA, Bosh SR, Benton AW: Observation on alarm substances in the genusApis.J Api Res 1967, 6:113–118.
Ono M, Garashi T, Ohno E, Sasaki M: Unusual thermal defence by a honeybee against mass attack by hornets.Nature 1995, 377:334–336. 10.1038/377334a0
Wang Zi L, Liu TT, Huang Zachary Y, Wu Xiao B, Wei Yu Y, Zhi Jiang Z: Transcriptome analysis of the Asian honey Beeapis cerana cerana.Plos one 2012, 7:e47954. 10.1371/journal.pone.0047954
Dyer FC, Seeley TD: Interspecific comparisons of endothermy in honeybees (Apis): deviations from the expected size-related patterns.J Exp Biol 1987, 122:1–26.
Seeley TD, Seeley RH: Colony defense strategies of the honeybee in Thailand.Ecol Monogr 1982, 52:43–63. 10.2307/2937344
Seeley TD, Morse RA: Dispersal behavior of honey Bee swarms.Psyche 1977, 84:199–209. 10.1155/1977/37918
Hadisoesilo S, Otis G: Differences in drone cappings ofApis ceranaandApis nigrocincta.J Apic Res 1998, 37:11–15.
Atwal AS, Dhaliwal GS: Some behavioural characteristics ofApis indicaF andApis melliferaL.Indian Bee J 1969, 31:83–90.
Koetz AH: Ecology, behaviour and control ofapis ceranawith a focus on relevance to the Australian incursion.Insects 2013, 4:558–592. 10.3390/insects4040558
Li JKFM, Zhang L, Zhang ZH, Pan YH: Proteomics analysis of major royal jelly protein changes under different storage conditions.J Proteome Res 2008, 7:3339–3353. 10.1021/pr8002276
Miyamoto S: Biological studies on Japanese Bees. X Defferences in flower relationship between a Japanese and a European honeybee.Sci Rep Hyogo Univ Agric Ser agric Biol 1958, 3:99–101.
Hongxia Ai XY, Richou H: Occurrence and prevalence of seven bee viruses inApis melliferaandApis ceranaapiaries in China.J Invertebr Pathol 2012, 109:160–164. 10.1016/j.jip.2011.10.006
Choe SE, Nguyen TT, Hyun BH, Noh JH, Lee HS, Lee CH, Kang SW: Genetic and phylogenetic analysis of South Korean sacbrood virus isolates from infected honey bees (Apis cerana).Vet Microbiol 2012, 157:32–40. 10.1016/j.vetmic.2011.12.007
Smith CD, Zimin A, Holt C, Abouheif E, Benton R, Cash E, Croset V, Currie CR, Elhaik E, Elsik CG, Fave MJ, Fernandes V, Gadau J, Gibson JD, Graur D, Grubbs KJ, Hagen DE, Helmkampf M, Holley JA, Hu H, Viniegra AS, Johnson BR, Johnson RM, Khila A, Kim JW, Laird J, Mathis KA, Moeller JA, Muñoz-Torres MC, Murphy MC, et al.: Draft genome of the globally widespread and invasive Argentine ant (Linepithema humile).Proc Natl Acad Sci U S A 2011, 108:5673–5678. 10.1073/pnas.1008617108
Smith CR, Smith CD, Robertson HM, Helmkampf M, Zimin A, Yandell M, Holt C, Hu H, Abouheif E, Benton R, Cash E, Croset V, Currie CR, Elhaik E, Elsik CG, Favé MJ, Fernandes V, Gibson JD, Graur D, Gronenberg W, Grubbs KJ, Hagen DE, Viniegra AS, Johnson BR, Johnson RM, Khila A, Kim JW, Mathis KA, Munoz-Torres MC, Murphy MC, et al.: Draft genome of the red harvester antPogonomyrmex barbatus.Proc Natl Acad Sci U S A 2011, 108:5667–5672. 10.1073/pnas.1007901108
Wurm Y, Wang J, Riba-Grognuz O, Corona M, Nygaard S, Hunt BG, Ingram KK, Falquet L, Nipitwattanaphon M, Gotzek D, Dijkstra MB, Oettler J, Comtesse F, Shih CJ, Wu WJ, Yang CC, Thomas J, Beaudoing E, Pradervand S, Flegel V, Cook ED, Fabbretti R, Stockinger H, Long L, Farmerie WG, Oakey J, Boomsma JJ, Pamilo P, Yi SV, Heinze J, et al.: The genome of the fire antSolenopsis invicta.Proc Natl Acad Sci U S A 2011, 108:5679–5684. 10.1073/pnas.1009690108
Gadau J, Helmkampf M, Nygaard S, Roux J, Simola DF, Smith CR, Suen G, Wurm Y, Smith CD: The genomic impact of 100 million years of social evolution in seven ant species.Trends Genet 2012, 28:14–21. 10.1016/j.tig.2011.08.005
Gnerre S, MacCallum I, Przybylski D, Ribeiro F, Burton J, Walker B, Sharpe T, Hall G, Shea T, Sykes S, Berlin A, Aird D, Costello M, Daza R, Williams L, Nicol R, Gnirke A, Nusbaum C, Lander ES, Jaffe DB: High-quality draft assemblies of mammalian genomes from massively parallel sequence data.Proc Natl Acad Sci U S A 2010, 108:1513–1518.
Elsik CG, Worley KC, Bennett AK, Beye M, Camara F, Childers CP, de Graaf DC, Debyser G, Deng J, Devreese B, Elhaik E, Evans JD, Foster LJ, Graur D, Guigo R, Hoff KJ, Holder ME, Hudson ME, Hunt GJ, Jiang H, Joshi V, Khetani RS, Kosarev P, Kovar CL, Ma J, Maleszka R, Moritz RF, Munoz-Torres MC, Murphy TD, HGSC production teams, et al.: Finding the missing honey bee genes: lessons learned from a genome upgrade.BMC Genomics 2014, 15:86. 10.1186/1471-2164-15-86
Tan HW, Liu GH, Dong X, Lin RQ, Song HQ, Huang SY, Yuan ZG, Zhao GH, Zhu XQ: The complete mitochondrial genome of the Asiatic cavity-nesting honeybeeapis cerana(hymenoptera: apidae).Plos one 2011, 6:e23008. 10.1371/journal.pone.0023008
Bonasio R, Zhang G, Ye C, Mutti NS, Fang X, Qin N, Donahue G, Yang P, Li Q, Li C, Zhang P, Huang Z, Berger SL, Reinberg D, Wang J, Liebig J: Genomic comparison of the antsCamponotus floridanusandHarpegnathos saltator.Science 2010, 329:1068–1071. 10.1126/science.1192428
Suen G, Teiling C, Li L, Holt C, Abouheif E, Bornberg-Bauer E, Bouffard P, Caldera EJ, Cash E, Cavanaugh A, Denas O, Elhaik E, Favé MJ, Gadau J, Gibson JD, Graur D, Grubbs KJ, Hagen DE, Harkins TT, Helmkampf M, Hu H, Johnson BR, Kim J, Marsh SE, Moeller JA, Muñoz-Torres MC, Murphy MC, Naughton MC, Nigam S, Overson R, et al.: The genome sequence of the leaf-cutter antAtta cephalotesreveals insights into its obligate symbiotic lifestyle.PLoS Genet 2011, 7:e1002007. 10.1371/journal.pgen.1002007
Werren JH, Richards S, Desjardins CA, Niehuis O, Gadau J, Colbourne JK, Rechtsteiner A, Reese JT, Reid JG, Riddle M, Robertson HM, Romero-Severson J, Rosenberg M, Sackton TB, Sattelle DB, Schlüns H, Schmitt T, Schneider M, Schüler A, Schurko AM, Shuker DM, Simões ZL, Sinha S, Smith Z, Solovyev V, Souvorov A, Springauf A, Stafflinger E, Stage DE, Stanke M, et al.: Functional and evolutionary insights from the genomes of three parasitoidnasoniaspecies.Science 2010, 327:343–348. 10.1126/science.1178028
Coulondre C, Miller JH, Farabaugh PJ, Gilbert W: Molecular basis of base substitution hotspots inEscherichia coli.Nature 1978, 274:775–780. 10.1038/274775a0
Yi SV, Goodisman MA: Computational approaches for understanding the evolution of DNA methylation in animals.Epigenetics Off J DNA Methylation Soc 2009, 4:551–556. 10.4161/epi.4.8.10345
Bird AP: DNA methylation and the frequency of CpG in animal DNA.Nucleic Acids Res 1980, 8:1499–1504. 10.1093/nar/8.7.1499
Glastad KM, Hunt BG, Yi SV, Goodisman MA: DNA methylation in insects: on the brink of the epigenomic era.Insect Mol Biol 2011, 20:553–565. 10.1111/j.1365-2583.2011.01092.x
Okamura K, Matsumoto KA, Nakai K: Gradual transition from mosaic to global DNA methylation patterns during deuterostome evolution.BMC Bioinform 2010,7(11 Suppl):S2.
International Aphid Genomics Consortium: Genome sequence of the pea aphidAcyrthosiphon pisum.PLoS Biol 2010, 8:e1000313. 10.1371/journal.pbio.1000313
Simola DF, Wissler L, Donahue G, Waterhouse RM, Helmkampf M, Roux J, Nygaard S, Glastad KM, Hagen DE, Viljakainen L, Reese JT, Hunt BG, Graur D, Elhaik E, Kriventseva EV, Wen J, Parker BJ, Cash E, Privman E, Childers CP, Muñoz-Torres MC, Boomsma JJ, Bornberg-Bauer E, Currie CR, Elsik CG, Suen G, Goodisman MA, Keller L, Liebig J, Rawls A, et al.: Social insect genomes exhibit dramatic evolution in gene composition and regulation while preserving regulatory features linked to sociality.Genome Res 2013, 23:1235–1247. 10.1101/gr.155408.113
International Silkworm Genome C: The genome of a lepidopteran model insect, the silkwormBombyx mori.Insect Biochem Mol Biol 2008, 38:1036–1045. 10.1016/j.ibmb.2008.11.004
Robertson HM: Themarinertransposable element is widespread in insects.Nature 1993, 362:241–245. 10.1038/362241a0
Lampe DJ, Witherspoon DJ, Soto-Adames FN, Robertson HM: Recent horizontal transfer of mellifera subfamily mariner transposons into insect lineages representing four different orders shows that selection acts only during horizontal transfer.Mol Biol Evol 2003, 20:554–562. 10.1093/molbev/msg069
Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O’Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, DiCuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM: RefSeq: an update on mammalian reference sequences.Nucleic Acids Res 2014, 42:D756-D763. 10.1093/nar/gkt1114
Brandi L, Cantarel IK, Sofia MC, Robb G, Parra Eric R, Barry M, Carson H, Alejandro Sánchez A, Mark Y: MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes.Genome Res 2008, 18:188–196.
Consortium UP: Ongoing and future developments at the Universal Protein Resource.Nucleic Acids Res 2011, 39:D214-D219.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.Nat Genet 2000, 25:25–29. 10.1038/75556
Kanehisa M, Goto S: KEGG: kyoto encyclopedia of genes and genomes.Nucleic Acids Res 2000, 28:27–30. 10.1093/nar/28.1.27
Breed MD, Seth Perry P, Bjostad LB: Testing the blank slate hypothesis: why honey bee colonies accept young bees.Insect Sociaux 2004, 51:12–16. 10.1007/s00040-003-0698-9
Claudianos C, Ranson H, Johnson RM, Biswas S, Schuler MA, Berenbaum MR, Feyereisen R, Oakeshott JG: A deficit of detoxification enzymes: pesticide sensitivity and environmental response in the honeybee.Insect Mol Biol 2006, 15:615–636. 10.1111/j.1365-2583.2006.00672.x
Devillers J, Pham-Delegue MH, Decourtye A, Budzinski H, Cluzeau S, Maurin G: Structure-toxicity modeling of pesticides to honey bees.SAR QSAR Environ Res 2002, 13:641–648. 10.1080/1062936021000043391
Stefanidou M, Athanaselis S, Koutselinis A: The toxicology of honey bee poisoning.Vet Hum Toxicol 2003, 45:261–265.
Ruttner F: Biogeography and Taxonomy of Honeybees. Berlin New York: Springer-Verlag; 1988.
Woodard SH, Fischman BJ, Venkat A, Hudson ME, Varala K, Cameron SA, Clark AG, Robinson GE: Genes involved in convergent evolution of eusociality in bees.Proc Natl Acad Sci U S A 2011, 108:7472–7477. 10.1073/pnas.1103457108
Fischman BJ, Woodard SH, Robinson GE: Molecular evolutionary analyses of insect societies.Proc Natl Acad Sci U S A 2011,108(Suppl 2):10847–10854.
Robertson HM, Wanner KW: The chemoreceptor superfamily in the honey bee,Apis mellifera: expansion of the odorant, but not gustatory, receptor family.Genome Res 2006, 16:1395–1403. 10.1101/gr.5057506
Robertson HM, Warr CG, Carlson JR: Molecular evolution of the insect chemoreceptor gene superfamily inDrosophila melanogaster.Proc Natl Acad Sci U S A 2003, 100:14537–14542. 10.1073/pnas.2335847100
Benton R, Vannice KS, Gomez-Diaz C, Vosshall LB: Variant ionotropic glutamate receptors as chemosensory receptors indrosophila.Cell 2009, 136:149–162. 10.1016/j.cell.2008.12.001
Robertson HM, Gadau J, Wanner KW: The insect chemoreceptor superfamily of the parasitoid jewel waspNasonia vitripennis.Insect Mol Biol 2010, 19:121–136.
Croset V, Rytz R, Cummins SF, Budd A, Brawand D, Kaessmann H, Gibson TJ, Benton R: Ancient protostome origin of chemosensory ionotropic glutamate receptors and the evolution of insect taste and olfaction.PLoS Genet 2010, 6:e1001064. 10.1371/journal.pgen.1001064
Morel RKVMaL: NESTMATE RECOGNITION IN ANTS. 1998.
Dani FR, Jones GR, Corsi S, Beard R, Pradella D, Turillazzi S: Nestmate recognition cues in the honey Bee: differential importance of cuticular alkanes and alkenes.Chem Senses 2005, 30:477–489. 10.1093/chemse/bji040
Châline N, Sandoz JC, Martin SJ, Ratnieks FL, Jones GR: Learning and discrimination of individual cuticular hydrocarbons by honeybees (apis mellifera).Chem Senses 2005, 30:327–335. 10.1093/chemse/bji027
de Brito Sanchez MG: Taste perception in honey bees.Chem Senses 2011, 36:675–692. 10.1093/chemse/bjr040
Robinson GE: Genomics and integrative analyses of division of labor in honeybee colonies.Am Nat 2002,160(Suppl 6):S160-S172.
Wang Y, Brent CS, Fennern E, Amdam GV: Gustatory perception and fat body energy metabolism are jointly affected by vitellogenin and juvenile hormone in honey bees.PLoS Genet 2012, 8:e1002779. 10.1371/journal.pgen.1002779
Kent LB, Walden KK, Robertson HM: The Gr family of candidate gustatory and olfactory receptors in the yellow-fever mosquitoAedes aegypti.Chem Senses 2008, 33:79–93. 10.1093/chemse/bjm067
Peter J, Clyne CGW, John R, Carlson: Candidate taste receptors inDrosophila.Science 2000, 287:1830–1834. 10.1126/science.287.5459.1830
Montell C: A taste of theDrosophilagustatory receptors.Curr Opin Neurobiol 2009, 19:345–353. 10.1016/j.conb.2009.07.001
Tetsuya Miyamoto YC, Jesse S, Hubert A: Identification of adrosophilaglucose receptor using Ca2+ imaging of single chemosensory neurons.Plos one 2013, 8:e56304. 10.1371/journal.pone.0056304
Miyamoto T, Slone J, Song X, Amrein H: A fructose receptor functions as a nutrient sensor in theDrosophilabrain.Cell 2012, 151:1113–1125. 10.1016/j.cell.2012.10.024
Suh GS, Wong AM, Hergarden AC, Wang JW, Simon AF, Benzer S, Axel R, Anderson DJ: A single population of olfactory sensory neurons mediates an innate avoidance behaviour inDrosophila.Nature 2004, 431:854–859. 10.1038/nature02980
Stange G: The response of the honeybee antennal CO 2 -receptors to N 2 O and Xe.J Comp Physiol 1973, 86:139–158. 10.1007/BF00702534
Ai M, Min S, Grosjean Y, Leblanc C, Bell R, Benton R, Suh GS: Acid sensing by the Drosophila olfactory system.Nature 2010, 468:691–695. 10.1038/nature09537
Wanner KW, Nichols AS, Walden KK, Brockmann A, Luetje CW, Robertson HM: A honey bee odorant receptor for the queen substance 9-oxo-2-decenoic acid.Proc Natl Acad Sci U S A 2007, 104:14383–14388. 10.1073/pnas.0705459104
Hallem EA, Dahanukar A, Carlson JR: Insect odor and taste receptors.Annu Rev Entomol 2006, 51:113–135. 10.1146/annurev.ento.51.051705.113646
SCHMIDT J: Toxinology of venoms from the honeybee genus apis.Toxicon 1995, 33:917–927. 10.1016/0041-0101(95)00011-A
Raffiudin R, Crozier RH: Phylogenetic analysis of honey bee behavioral evolution.Mol Phylogenet Evol 2007, 43:543–552. 10.1016/j.ympev.2006.10.013
Krieger J, Klink O, Mohl C, Raming K, Breer H: A candidate olfactory receptor subtype highly conserved across different insect orders.J Comp Physiol A 2003, 189:519–526. 10.1007/s00359-003-0427-x
Larsson MC, Domingos AI, Jones WD, Chiappe ME, Amrein H, Vosshall LB: Or83b encodes a broadly expressed odorant receptor essential forDrosophilaolfaction.Neuron 2004, 43:703–714. 10.1016/j.neuron.2004.08.019
Jones WD, Nguyen TA, Kloss B, Lee KJ, Vosshall LB: Functional conservation of an insect odorant receptor gene across 250 million years of evolution.Curr Biol 2005, 15:R119-R121.
Benton R, Sachse S, Michnick SW, Vosshall LB: Atypical membrane topology and heteromeric function ofDrosophilaodorant receptors in vivo.PLoS Biol 2006, 4:e20. 10.1371/journal.pbio.0040020
Yao CA, Ignell R, Carlson JR: Chemosensory coding by neurons in the coeloconic sensilla of theDrosophilaantenna.J Neurosci 2005, 25:8359–8367. 10.1523/JNEUROSCI.2432-05.2005
Hallem EA, Carlson JR: Coding of odors by a receptor repertoire.Cell 2006, 125:143–160. 10.1016/j.cell.2006.01.050
Silbering AF, Rytz R, Grosjean Y, Abuin L, Ramdya P, Jefferis GS, Benton R: Complementary function and integrated wiring of the evolutionarily distinctDrosophilaolfactory subsystems.J Neurosci 2011, 31:13357–13375. 10.1523/JNEUROSCI.2360-11.2011
Rytz R, Croset V, Benton R: Ionotropic receptors (IRs): chemosensory ionotropic glutamate receptors inDrosophilaand beyond.Insect Biochem Mol Biol 2013, 43:888–897. 10.1016/j.ibmb.2013.02.007
Chao Liu RJP, Jonathan D, Bohbot JD, Jones Patrick L, Wang G, Zwiebel LJ: Distinct olfactory signaling mechanisms in the malaria vector mosquitoAnopheles gambiae.PLoS Biol 2010, 8:e1000467. 10.1371/journal.pbio.1000467
Olivier V, Monsempes C, François MC, Poivet E, Jacquin-Joly E: Candidate chemosensory ionotropic receptors in a Lepidoptera.Insect Mol Biol 2010, 20:189–199.
Wiegmann BM, Trautwein MD, Kim JW, Cassel BK, Bertone MA, Winterton SL, Yeates DK: Single-copy nuclear genes resolve the phylogeny of the holometabolous insects.BMC Biol 2009, 7:34. 10.1186/1741-7007-7-34
Evans JD, Spivak M: Socialized medicine: Individual and communal disease barriers in honey bees.J Invertebr Pathol 2010, 103:S62-S72.
Li J, Qin H, Wu J, Sadd BM, Wang X, Evans JD, Peng W, Chen Y: The prevalence of parasites and pathogens in Asian honeybeesApis Ceranain china.Plos one 2012, 7:e47955. 10.1371/journal.pone.0047955
Yoo MS, Yoon BS: Incidence of honey bee disease in Korea 2009.Korean J Apic 2009, 24:273–278.
Evans JD, Aronstein K, Chen YP, Hetru C, Imler JL, Jiang H, Kanost M, Thompson GJ, Zou Z, Hultmark D: Immune pathways and defence mechanisms in honey beesApis mellifera.Insect Mol Biol 2006, 15:645–656. 10.1111/j.1365-2583.2006.00682.x
Park D, Jung JW, Lee MO, Lee SY, Kim B, Jin HJ, Kim J, Ahn YJ, Lee KW, Song YS, Hong S, Womack JE, Kwon HW: Functional characterization of naturally occurring melittin peptide isoforms in two honey bee species,Apis melliferaandApis cerana.Peptides 2014, 53:185–193.
Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J, Couger MB, Eccles D, Li B, Lieber M, Macmanes MD, Ott M, Orvis J, Pochet N, Strozzi F, Weeks N, Westerman R, William T, Dewey CN, Henschel R, Leduc RD, Friedman N, Regev A: De novotranscript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.Nat Protoc 2013, 8:1494–1512. 10.1038/nprot.2013.084
Lukashin AV, Borodovsky M: GeneMark.hmm: new solutions for gene finding.Nucleic Acids Res 1998, 26:1107–1115. 10.1093/nar/26.4.1107
Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, Walichiewicz J: Repbase Update, a database of eukaryotic repetitive elements.Cytogenet Genome Res 2005, 110:462–467. 10.1159/000084979
Grant JR, Stothard P: The CGView Server: a comparative genomics tool for circular genomes.Nucleic Acids Res 2008, 36:W181-W184. 10.1093/nar/gkn179
Altschul SFGW, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool.J Mol Biol 1990, 215:403–410. 10.1016/S0022-2836(05)80360-2
Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, Pimentel H, Salzberg SL, Rinn JL, Pachter L: Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks.Nat Protoc 2012, 7:562–578. 10.1038/nprot.2012.016
Slater GS, Birney E: Automated generation of heuristics for biological sequence comparison.BMC Bioinformatics 2005, 6:31. 10.1186/1471-2105-6-31
Lee E, Harris N, Gibson M, Chetty R, Lewis S: Apollo: a community resource for genome annotation editing.Bioinformatics 2009, 25:1836–1837. 10.1093/bioinformatics/btp314
St Pierre SE, Ponting L, Stefancsik R, McQuilton P, FlyBase Consortium: FlyBase 102–advanced approaches to interrogating FlyBase.Nucleic Acids Res 2014, 42:D780-D788. 10.1093/nar/gkt1092
Fischer S, Brunk BP, Chen F, Gao X, Harb OS, Iodice JB, Shanmugam D, Roos DS, Stoeckert CJ Jr: Using OrthoMCL to Assign Proteins to OrthoMCL-DB Groups or to Cluster Proteomes into new Ortholog Groups.Current Protocols in Bioinformatics /Editoral Board, Andreas D Baxevanis 2011. Chapter 6:Unit 6 12 11–19
Stephen F, Altschul TLM, Alejandro A, Schäffer Jinghui Z, Zheng Zhang WM, David J, Lipman: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.Nucleic Acids Res 1997, 25:3389–3402. 10.1093/nar/25.17.3389
Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, Heger A, Holm L, Sonnhammer EL, Eddy SR, Bateman A, Finn RD: The Pfam protein families database.Nucleic Acids Res 2012, 40:290–301. 10.1093/nar/gkr717
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD, Gibson TJ, Higgins DG: Clustal W and Clustal X version 2.0.Bioinformatics 2007, 23:2947–2948. 10.1093/bioinformatics/btm404
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.Mol Biol Evol 2011, 28:2731–2739. 10.1093/molbev/msr121
Li B, Dewey CN: RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.BMC Bioinformatics 2011, 12:323. 10.1186/1471-2105-12-323
Schwartz S, Zhang Z, Frazer KA, Smit A, Riemer C, Bouck J, Gibbs R, Hardison R, Miller W: PipMaker–a web server for aligning two genomic DNA sequences.Genome Res 2000, 10:577–586. 10.1101/gr.10.4.577
Kodama Y, Shumway M, Leinonen R, International Nucleotide Sequence Database C: The sequence read archive: explosive growth of sequencing data.Nucleic Acids Res 2012, 40:D54-D56. 10.1093/nar/gkr854
We appreciate RDA (Rural Development Administration) of Korea, Korean honey bee association, and Korea Beekeeping Association for helpful advice on beekeeping. We also thank Dr. Hugh Robertson at the University of Illinois for helpful comments on honey bee chemosensory receptors. We also appreciate NICEM of Seoul National University and two bioinformatics companies, MACROGEN and PHYZEN, for fast and accurate sequencing service and helpful discussion. Many thanks go to the members of the Kwon Lab at Seoul National University for assisting bee hive maintenance and collection of bees. This work was supported by Cooperative Research Program for Agriculture Science & Technology Development (Project No. PJ010487) from the RDA of the Republic of Korea to HK.
The authors declare that they have no competing interests.
BC, MJ, JLim, IC, YP, and TY carried out the sequence analysis and bioinformatics work from genome sequence. Specifically, BC conducted genome analysis and DP and MJ analysed RNAseq data. JLee participated in phylogenetic analysis and YY performed CpG analysis. DP, JJ, and YC, and ML collected honey bees and isolated RNA and DNA from the samples and participated in the design of the study. DP, ORE, GN, and HK conceived of the study, and participated in its design and coordination. DP, GN, and HK wrote the manuscript. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Table S1: Statistics of the honey bee genome assembly, Apis mellifera and Apis cerana.Table S2. Summary of repetitive elements in the A. cerana genome. Table S3. GO terms enriched in honey bee shared orthologs. Table S4. Immune related gene set counts in social and non-social insect, Apis cerana, Apis mellifera, Nasonia vitripennis, Linepithema humile, and Drosophila melanogaster. Figure S1. Syntenic view of the A. mellifera chromosome 3 and A. cerana scaffolds. Figure S2. Syntenic view of the mitochondrial genome of A. mellifera and A. cerana. Figure S3. The mitochondrial genome of A. cerana. Figure S4. Nomarlized CpG dinucleotide in protein-coding sequences in A. cerana. Figure S5. KEGG pathways in A. cerana. Figure S6. Phylogenetic tree of the gustatory receptor family. Figure S7. Phylogenetic tree of the odorant receptor family. Figure S8. Amino acid alignment between AmOr11 and AcOr30. (PDF 2 MB)
Additional file 2:A. ceranaOGS v 1.0.(TXT 3 MB)
Additional file 3:A. ceranaOGS v 1.0.(XLSX 66 KB)
Additional file 4:A. ceranaspecific orthologs.(XLSX 21 KB)
Additional file 5:A. ceranaandA. mellifera Orgenes.(XLSX 17 KB)
About this article
Cite this article
Park, D., Jung, J.W., Choi, BS. et al. Uncovering the novel characteristics of Asian honey bee, Apis cerana, by whole genome sequencing. BMC Genomics 16, 1 (2015). https://0-doi-org.brum.beds.ac.uk/10.1186/1471-2164-16-1
- Apis cerana
- Asian honey bee
- Social insect
- Chemosensory receptors
- Honey bee immunity