Skip to main content


An advanced bioinformatics approach for analyzing RNA-seq data reveals sigma H-dependent regulation of competence genes in Listeria monocytogenes

Article metrics

  • 1712 Accesses

  • 7 Citations



Alternative σ factors are important transcriptional regulators in bacteria. While σB has been shown to control a large regulon and play important roles in stress response and virulence in the pathogen Listeria monocytogenes, the function of σH has not yet been well defined in Listeria, even though σH controls a large regulon in the closely related non-pathogenic Bacillus subtilis.


Using RNA-seq characterization of a L. monocytogenes strain with deletions of all 4 genes encoding alternative σ factors (ΔBCHL), which was further modified to overexpress sigHBCHL::P rha -sigH), we identified 6 transcription units (TUs) that are transcribed from σH-dependent promoters. Five of these TUs had not been previously identified. Identification of these promoters was facilitated by use of a bio-informatics approach that compared normalized RNA-seq coverage (NRC), between ΔBCHL::P rha -sigH and a ΔBCHL control, using sliding windows of 51 nt along the whole genome rather than comparing NRC calculated only for whole genes. Interestingly, we found that three operons that encode competence genes (comGABCDEFG, comEABC, coiA) are transcribed from σH-dependent promoters. While these promoters were highly conserved in L. monocytogenes, none of them were found in all Listeria spp. and coiA and its σH-dependent promoter were only found in L. monocytogenes.


Our data indicate that a number of L. monocytogenes competence genes are regulated by σH. This σH-dependent regulation of competence related genes is conserved in the pathogen L. monocytogenes, but not in other non-pathogenic Listeria strains. Combined with prior data that indicated a role of σH in virulence in a mouse model, this suggests a possible novel role of σH-dependent competence genes in L. monocytogenes virulence. Development and implementation of a sliding window approach to identify differential transcription using RNA-seq data, not only allowed for identification of σH-dependent promoters, but also provides a general approach for sensitive identification of differentially transcribed promoters and genes, particularly for genes that are transcribed from multiple promoter elements only some of which show differential transcription.


At the transcriptional level, bacterial gene expression under rapidly changing environmental conditions is controlled by changes in associations between different alternative σ factors and the catalytic core of RNA polymerase. Alternative σ factors are important contributors to gene expression under stress conditions in both Gram-positive and Gram-negative bacteria [1]. In the foodborne pathogen Listeria monocytogenes, four alternative sigma factors (σB, σH, σL, σC) play a role in transcriptional regulation. σB and the σB regulon have been well defined in this pathogen through a variety of different approaches [24]. A number of different studies have indicated that σB controls a large regulon that plays important roles in both stress response and virulence. The regulons controlled by the other alternative σ factors in L. monocytogenes have been less well defined.

While σH specifically has been shown to play an important role in regulating spore formation and competence in B. subtilis [5], the function of σH in Listeria has not yet been well defined. In B. subtilis, σH has been reported to regulate >400 genes (approx. 240 genes positively and approx. 180 negatively) [6]. B. subtilis σH is also involved in the transition from exponential phase to stationary phase, nutrient transport, and the regulation of many other transcription factors and cell-wall-binding proteins [7]. In L. monocytogenes characterization of a sigH null mutant suggested a role for σH in growth on minimal medium and under alkaline conditions as well as a role in virulence, as assessed in an intraperitoneal inoculated mouse model [8]. While previous microarray experiments identified 56 genes as being directly upregulated by σH, these experiments also found that the largest category of co-regulated genes were represented by genes that showed transcript levels affected by both the σB and σH deletions [9]. Based on this considerable overlap between the σB and σH regulon, we surmised that different approaches are needed to further define σH-dependent genes in L. monocytogenes. In order to eliminate redundancies between transcriptional regulation by σH and other alternative σ factors, we chose to perform RNA-seq based comparisons of transcript levels between a L. monocytogenes mutant with deletions of all 4 genes encoding alternative σ factors (ΔBCHL), and one with the same background that was modified to overexpress sigHBCHL::Prham-sigH). We also implemented an advanced bio-informatics approach where we compare normalized RNA-seq coverage (NRC), between these two strains using a sliding window of 51 nt and 25 nt overlap along the whole genome, rather than comparing NRC calculated only for whole genes; a similar approach was detailed independently in a recent study [10]. We surmised that this approach would allow for more sensitive identification of differentially regulated genes and gene fragments, particularly for genes that may be preceded by multiple promoters (e.g., a σH and a σA dependent promoter), where differential transcript levels may only be detectable downstream of the σH-dependent promoter, but not in the actual open reading frame (ORF) where transcription from the σA-dependent promoter may obscure the differential transcription from the σH-dependent promoter.

Results and discussion

A sliding window method for identification of differentially regulated genes and promoters provides a sensitive approach for identification of σH dependent genes

Traditionally, analysis of RNA-seq data is performed by calculating the normalized RNA-seq coverage (NRC) for a given annotated gene and comparing NRCs between strains with different genetic backgrounds or strains grown under different conditions. When this approach was used here, we initially identified 5 genes that showed significantly higher transcript levels (FDR < 0.05; FC > 2.0) in the L. monocytogenes strain overexpressing sigH (ΔBCHL::P rha -sigH), as compared to the control strain ΔBCHL::P rha , which does not contain sigH. These 5 genes represented LMRG_00908, LMRG_00935 (comEA), LMRG_00937 (comEC), LMRG_01629 (lytG), and LMRG_01643 (coiA) (see Additional file 1: Table S1). A total of four σH-dependent promoters were identified upstream of these five genes as comEA and comEC share the same promoter. Considering these promoters and previously reported operon structures in L. monocytogenes [11, 12], these 5 σH-dependent genes were found to represent 2 multigene operons, including (i) LMRG_00908, LMRG_00907 (dnaG) and LMRG_00906 (rpoD); and (ii) LMRG_00935 (comEA), LMRG_00936 (comEB) and LMRG_00937 (comEC)) and 2 monocistronic genes, coiA and lytG, for a total of 8 genes that are positively regulated by σH (Fig.1a–d). We did not find any genes that were significantly downregulated by σH (FDR < 0.05; FC < 0.5).

Fig. 1

Schematic of σH-dependent transcriptional units identified here. a LMRG_00908-dnaG-rpoD operon; b comEABC operon; c coiA, d lytG; e comGABCDEFG operon; f LMRG_01010-LMRG_01005 operon. Lines show average RNAseq coverage for a sliding window of 100 nt. Blue lines indicate RNA-seq coverage for the three replicates experiments with the 10403S::ΔBCHL P rha -sigH strain (which over expresses sigH), while green lines indicate RNA-seq coverage for the three replicates experiments with the 10403S::ΔBCHL P rha strain (which does not contain sigH). Maximum average coverages are shown on the left side of each panel. Black arrows indicate the direction in which the RNA-seq reads were mapped to the chromosome. Genes significantly differentially expressed by the standard approach are labeled with a * next to the gene name. Position of significant fragments is shown as dotted lines underneath the genes with their maximum sliding window fold change shown underneath. Stem loop symbols indicate transcriptional terminators. Genes colored in gray are part of the operons found to be significantly differentially expressed by the sliding window approach and are drawn to scale. Genes colored in magenta are not part of the significant operon and are not drawn to scale. Promoters are indicated by stemmed arrows. A window size of 100 nt was used for the smoothing method. Values on the graph represent the center of these 100 nt windows

Subsequent sliding window analysis, using the same RNA-seq data analyzed with the standard BaySeq approach (described in Methods), identified two additional σH-dependent promoters, in addition to also identifying the promoters described above. The two newly identified promoters, along with previously reported operon structures in L. monocytogenes [11, 12], identified two additional multigene operons, including (i) LMRG_00797 to LMRG_00791 and (ii) LMRG_01010 to LMRG_01005) as σH-dependent (Table 1). Overall, the new σH-dependent operons identified using the sliding window approach represent a total of 13 genes that are positively regulated by σH (Fig. 1e–f). The LMRG_01010-LMRG_01005 operon, which was identified only with the sliding window approach, showed both a σH- and two σA-dependent promoters (Fig. 1f). These data indicate that the sliding window RNA-seq analysis approach described here provides a superior sensitivity for identification of differentially transcribed genes, particularly when multiple promoters are found upstream a gene or operon.

Table 1 Results of Sliding Window approach for identification of differentially expressed fragments regulated by σH

Newly identified σH-dependent genes include three σH-dependent operons that encode competence proteins

A comparison of the σH-dependent genes identified here with σH-dependent genes and operons that were previously found with microarray based characterization of a L. monocytogenes ΔsigH strain [9] showed that we identified a number of σH-dependent genes and operons that had not been previously identified, including LMRG1629 (lytG), LMRG_00937 (comEA), LMRG_00935 (comEC), LMRG001643 (coiA) (Table 1). The previous microarray study used for comparison used a ΔsigH strain in a wildtype background and used a standard probe based microarray (not a tiling array); no additional transcriptomic studies characterizing the σH regulon in Listeria were available for comparison. Interestingly, among the 56 genes identified as directly regulated by σH in the previous microarray based study of a ΔsigH strain [9], only 16 showed a FC >2.0 (the other 40 genes showed FCs between 1.5 and 2.0). Only one of these previously identified genes (LMRG_00908) was also found to be σH-dependent here. This may suggest that a number of genes recognized as σH-dependent in this previous study only show differential σH-dependent transcription in the presence of the other three alternative σ factors, which were not present in the L. monocytogenes ΔBCHL strain that was used here. These observations further highlight the importance of the use of alternative approaches (e.g., use of a ΔBCHL strain overexpressing sigH) we have taken here to allow for identification of additional σH-dependent genes. Ectopic and artificial induction of sigma factors and other regulatory proteins has been used successfully in the past to study those regulators where physiological induction signals have not been found [13] and/or where mutants cannot be made [14]. In some cases, large periods of time have passed since linking a set of genes to a regulator (using artificial induction) and discovering a physiological condition where a given regulator and regulated genes are induced. For example in S. aureus, σH dependent transcription of competence genes after overexpression was first reported in 2003 [15], while functional confirmation of σH contributions to competence were only reported almost 10 years later [16]. While our data reported here thus provide important evidence for contributions of σH to regulation of competence genes in L. monocytogenes, future experiments will be needed to probe the phenotypic importance of this regulatory pathway and to define under which conditions competence gene expression is upregulated by σH in wildtype L. monocytogenes.

Importantly, we identified three σH-dependent transcription units that encode competence proteins, including (i) comGABCDEFG, (ii) comEABC and (iii) coiA (Fig. 1). In B. subtilis, a naturally competent bacterium, ComG proteins are required for exogenous DNA to reach the membrane bound receptor ComEA during transformation [17, 18]. Among the genes in the comEABC operon, only comEA and comEC (encoding a polytopic membrane protein that forms the membrane translocation channel) have been shown to be required for transformation in B. subtilis [19]. B. subtilis coiA has also been shown to be involved in the establishment of DNA transport [20]. Even though competence traditionally implies uptake of DNA, competence genes have been shown to also play roles in survival [21] and virulence [22], processes which do not necessarily require DNA uptake (reviewed in [23]).

In L. monocytogenes, competence is not well understood. While this organism seems to possess the machinery for competence, the gene encoding a key regulator of competence in B. subtilis, comK, is often interrupted by prophages in L. monocytogenes strains [24]. Excision of this prophage and restoration of an intact comK has been shown to be involved in phagosomal escape and virulence [22] but competence was not tested in this study. The one published study that experimentally tested for competence in L. monocytogenes did not find evidence for competence even among two L. monocytogenes strains that carried an intact comK [25].

Interestingly, S. aureus σH has also been shown to regulate competence-related genes [15]. Although S. aureus was originally thought to not be competent, it has now been reported that, facilitated by a complex regulatory mechanism, S. aureus cells are able to uptake exogenous DNA, such as antibiotic resistance determinants, through horizontal gene transfer [16]. Even though the specific conditions for competence in L. monocytogenes have not been found yet, the fact that this pathogen has conserved σH-dependent mechanisms of regulation of competence genes suggests a specific physiological role of these genes in L. monocytogenes.

Among other genes identified as being directly regulated by σH are the housekeeping genes, rpoD, encoding σA and dnaG, encoding the DNA primase (Fig. 1). Another σH-dependent gene, lytG, encodes an enzyme with a mannosyl-glycoprotein endo-beta-N-acetylglucosamidase-like domain, which is found in enzymes such as lysozymes and the flagellar protein J that can hydrolyze peptidoglycan [26]. Two genes involved in oxidative stress, msrA and msrB were also found to be directly regulated by σH, suggesting a possible role of σH in oxidative stress response [27]. Moreover, the σH-dependent promoter upstream this operon has a long 5’ UTR (270 nt), which overlaps with the 5’ UTR of another gene, LMRG_01011, encoded in the opposite strand. LMRG_01011 encodes a hemolysin III protein (Hly III) and the expression of this protein, could therefore, be σH regulated post-transcriptionally through RNA interference caused by the 5’ UTR from PLMRG_01010.

σH - dependent promoters are highly conserved among different L. monocytogenes genomes

The six σH-dependent promoters identified in L. monocytogenes 10403S were all found to be present in the 23 additional L. monocytogenes genomes analyzed here (Additional file 2: Table S2). In general, −35 and −10 promoter regions were highly conserved (Fig. 2, Table 2). For both the comGA and the lytG promoter sequence, the −35 region was completely conserved across all 24 genomes, while the −10 region presented two variants for each of these two promoters. The comEA promoter showed a completely conserved −10 region, while the −35 region showed three variants. On the other hand the −35 region and −10 regions for the three other σH-dependent promoters (upstream of LMRG_00908, LMRG_01010 and coiA) were each completely conserved across all 24 genomes. By comparison, sigH was found to be present in all 24 genomes with a high level of conservation (only 4 polymorphic amino acid residues over 201 amino acids). Overall, these data indicate considerable conservation of σH-dependent promoters identified, even though some promoters show lineage or strain specific sequence features. Comparative analysis of the frequency of nucleotide changes within the −35 and −10 signal regions compared to the non-functional sequences (assumed here to be under neutral selection) between these two functional regions suggests that both −35 and −10 signals are under selective pressure among the L. monocytogenes strains analyzed (Table 3). This suggests that the σH-dependent promoters identified in this study are functional across all L. monocytogenes strains and, therefore, are probably important for L. monocytogenes physiology at certain conditions that remain to be established. Future studies are needed to explore the functional importance of the polymorphic sequence features. Interestingly, a previous study [28] found evidence for positive selection of a promoter region that regulates virulence gene expression in L. monocytogenes. The sites found under positive selection in this previous study generated a putative σB-dependent promoter in some lineage I strains, which supports that strain and lineage specific promoter region polymorphisms may affect virulence gene expression in L. monocytogenes.

Fig. 2

Sequence logos for σH-dependent promoters. a Sequence logos for the six σH-dependent promoters found in this study based on alignment of 24 L. monocytogenes strains. −35 and −10 regions are shown. b Sequence logo of the consensus sequence of σH-dependent promoters based on alignment of sequences from [9]

Table 2 σH promoter sequences for the σH-dependent operons identified in this study
Table 3 Conservation of identified σH-dependent promoters across Listeria monocytogenes genomes

σH promoters and the σH regulon have diverged among different Listeria species

Further analysis of the six σH-dependent promoters identified in L. monocytogenes showed that only the LMRG_00908 and LMRG_01010 promoters were found in all 5 Listeria species analyzed (i.e., L. monocytogenes, L. innocua, L. ivanovii, L. welshimeri and L. seeligeri). The comGA promoter was not found in the L. welshimeri genome (Additional file 2: Table S2). The comEA promoter was not found in the L. ivanovii and L. welshimeri genomes and the coiA and lytG promoters were only found in the L. monocytogenes genomes. Analyses of the genes transcribed by these promoters revealed the respective genes were also missing in the genomes where a promoter could not be found. Therefore, the competence genes represented by the comGABCDEFG operon, the comEABC operon and the coiA gene are not found across all Listeria species and the σH regulon is not conserved across these species. Taking into consideration that a sigH mutant has been shown to have reduced virulence in a mouse model [8], the absence of these σH-dependent genes in non-pathogenic Listeria spp. could indicate that these genes may have evolved to contribute σH-dependent virulence related functions in L. monocytogenes. By comparison, sigH was present in all Listeria species genomes; in addition to the 4 polymorphic amino acid residues found in L. monocytogenes, a further 18 polymorphic amino acid residues were found in the Listeria spp. genomes.

Among the promoters found in other Listeria species besides L. monocytogenes, only the LMRG_00908 operon was highly conserved (Fig. 3). This promoter showed a completely conserved −35 region, while two variants of the −10 region were found. The comEA promoter, which was only found in L. monocytogenes, L. innocua and L. seeligeri, had a perfectly conserved −10 region across all three species. However, besides the two variants of the −35 region found among the L. monocytogenes strains, each the L. innocua and the L. seeligeri strain presented distinct −35 sequences. The comGA promoter showed variable −35 and −10 regions across the Listeria species. The LMRG_01010 -35 sequence in L. innocua, L. ivanovii, L. welshimeri and L. seeligeri matched that of the L. monocytogenes strains. The only −10 region that diverged from the others was that found in the L. ivanovii strain. Therefore, in addition to a σH regulon that is not conserved across all Listeria species, there is also some variation in the σH-dependent promoters that are present in other Listeria species. While the σH-dependent promoters present generally seem to be conserved enough to suggest conservation of σH-dependent regulation, future experimental studies are needed to explore the potential functional importance of these polymorphic sequence features.

Fig. 3

Alignment of σH-dependent promoters found in Listeria species. L. monocytogenes strains 10403S (lineage II), F2365 (lineage I) and HCC23 (lineage III) are used for comparison. −35 and −10 regions are shown


Combined with prior data that indicated a role of σH in virulence in a mouse model, identification of a number of σH-dependent L. monocytogenes competence genes that are conserved in the pathogen L. monocytogenes, but not in other non-pathogenic Listeria strains, suggests a possible novel role of σH-dependent competence genes in L. monocytogenes virulence. The development and use of a sliding window approach to identify differential transcription using RNA-seq data not only allowed for identification of new σH-dependent promoters in L. monocytogenes, but also provides a general approach for sensitive identification of differentially transcribed promoters and genes. We predict that this approach will be particularly valuable for identification of differentially transcribed genes and genomic regions that are transcribed from multiple and possibly redundant promoter elements.


Bacterial strains, mutant construction, and growth conditions

The quadruple alternative σ factor mutant (ΔBCHL; FSL C3-135) of L. monocytogenes strain 10403S [29] was used as the background strain in this study. This strain was modified to overexpress sigH from a rhamnose inducible promoter. Briefly, the sigH gene was amplified from L. monocytogenes 10403S by PCR and cloned into the plasmid pLF1 [30] downstream of the rhamnose inducible promoter P rha . The plasmid construct was confirmed by PCR and sequencing. All the cloning steps were performed in E. coli DH5α (NEB). The final plasmid was transformed into E. coli strain SM10 to allow for conjugation of the plasmid into L. monocytogenes 10403S ΔBCHL, followed by chromosomal integration of the P rha sigH construct (yielding strain 10403S::ΔBCHL P rha -sigH; FSL B2-426). A control strain (ΔBCHL-P rha ; FSL B2-429) was constructed by introducing the empty plasmid pLF1 into the chromosome of L. monocytogenes 10403S ΔBCHL through conjugation and chromosomal integration. Transconjugants were selected with 200 μg/ml Streptomycin and 7.5 μg/ml Chloramphenicol, and confirmed by PCR.

For RNA isolation, strains were streaked from frozen Brain Heart Infusion (BHI) stock, stored at −80 °C in 15 % glycerol, onto a BHI agar plate, followed by incubation at 37 °C for 24 h. A single colony was subsequently inoculated into 5 ml of BHI broth in 16 mm tubes, followed by incubation at 37 °C with shaking (230 rpm) for 18 h (Series 25 Incubator, New Brunswick Scientific, Edison, NJ). After 18 h, 50 μl BHI culture was inoculated into fresh 5 ml BHI broth and grown to OD600 0.4–0.5 at 37 °C.

Rhamnose induction

Induction of sigH transcription was performed by adding 250 μl of 1M rhamnose stock solution to 5 ml OD600 0.4–0.5 bacterial cultures (for a final concentration of 50 mM rhamnose), followed by incubation at 37 °C for an additional 30 min. Induction with rhamnose was performed for both 10403S::ΔBCHL P rha -sigH and ΔBCHL-P rha . qRT-PCR using the SYBR Green Master Mix Reagent (Life Technologies) and the ABI Prism 7000 Sequence Detection System (Applied Biosystems, Foster City, CA) determined that the optimum rhamnose concentration for sigH induction was 50 mM rhamnose. Transcript levels were determined for sigH and the housekeeping gene rpoB in strain 10403S::ΔBCHL P rha -sigH. Expression level differences were determined by ΔΔCt method [31] using the housekeeping gene rpoB as reference gene.

RNA isolation

RNA isolation was performed as previously described by our group [32]; minor modifications were made to this previous RNA isolation protocol. Briefly, for each sample, 3 ml of RNAprotect bacteria reagent (Qiagen, Valencia, CA) was added to 3 ml of bacterial culture. The mix was incubated at room temperature for 10 min to ensure that the bacterial RNA was stabilized. Cells were pelleted by centrifugation (4,637 × g, 30 min) at 4 °C and suspended in nuclease free water with proteinase K (25 mg/ml) and lysozyme (50 mg/ml), followed by incubation at 37 °C for 30 min. After RNA isolation with TRI reagent, total RNA was incubated with Turbo DNase (Life Technologies) to remove remaining DNA in the presence of RNasin (Promega). Subsequently, RNA was purified using phenol-chloroform/chloroform extractions, followed by precipitation and resuspension. Purity of RNA and efficiency of the DNase treatment was assessed by UV spectrophotometry (Nanodrop, Wilmington, DE) and qRT-PCR for the housekeeping gene rpoB (all samples showed Ct > 35, indicating absence of DNA contamination at relevant levels), respectively. All experiments were performed in three biological replicas.

cDNA libraries and RNA-seq

Preparation of directional cDNA fragment libraries was performed using the ScriptSeq Complete Kit (Bacteria)-Low Input (Epicentre, Madison, WI). 16S and 23S rRNA was removed from total RNA with Ribo-Zero rRNA Removal Reagents (Bacteria)-Low Input and Magnetic Core Kit-Low Input. rRNA-depleted samples were run on the 2100 Bioanalyzer (Agilent Technology, Santa Clara, CA) to confirm reduction of 16S and 23S rRNA and followed by purification using Agencourt RNAClean XP Kit (Beckman Coulter Inc, Brea, CA). Indexed RNA-seq libraries were quantified by digital PCR and sequencing was carried out on a Hiseq 2500 (single-end, 150-bp per read) at the Cornell Core Facility for RNA-sequencing.

RNA-Seq alignment, coverage and differential expression analysis

Sequence reads were aligned to a 10403S genome using the BWA mem algorithm in BWA version 0.7.3a [33] and the data for coverage per base on sense and antisense strands were analyzed separately using samtools [34].

Differential expression of genes in the two different strains (ΔBCHL::P rha and ΔBCHL::P rha- sigH) was initially analyzed using the Bayseq package for R version 2.2.0 [35]. Genes were considered differentially expressed if the FDR (False Discovery Rate) was < 0.05 and the FC (Fold Change) was > 2.0 or < 0.5 (FC = average (ΔBCHL-P rha- sigH) /average (ΔBCHL-P rha )).

Promoter search using a Sliding Window approach

In order to identify further σH-dependent promoters that may not result in differential expression of the actual gene due to the coexistence of σA-dependent promoters regulating a given gene, a sliding window approach was implemented. Three different window sizes and window sliding values were used (window sizes = 26, 51 and 102 nt; window sliding values = 13, 25 and 50 nt, respectively), which all resulted in the identification of the same promoters. However, the combination of window size = 51 nt and window sliding = 25 nt resulted in a lower number of fragments that were not preceded by upstream promoters (which was considered a “false positive” finding); a sliding window size of 51 yielded 3 false positive fragments as compared to 18 and 10 with sliding windows sizes of 25 and 100, respectively. Therefore, results obtained with window size = 51 nt and window sliding = 25 nt are presented here. Briefly, the 10403S genome was divided into windows of 51 nt (window size) with 25 nt overlap (window sliding) and the RNA-Seq coverage was obtained for each of the 116,123 resulting windows. RNA-Seq coverage was obtained as described above and the coverage per nt was used to obtain the total coverage of each window (e.g., the coverage of the window ranging from nt 1 to nt 51 is the sum of the coverage of each nt between nt 1 and nt 51). Bayseq version 2.2.0 was then used, as described above, to identify windows with significant differential expression between the ΔBCHL::P rha- sigH strain and the ΔBCHL::P rha strain. Fold change was also calculated as described above and windows presenting an FDR < 0.05 and FC > 2.0 were considered for further analysis. Windows matching the required thresholds were then sorted and overlapping windows were considered part of the same fragment. Fragments that were mapped to the same gene or operon were considered as being part of the same transcriptional unit (TU). The 5’ end of significant TUs were then manually scanned for promoter sequences located 5 to 30 nt upstream the identified transcriptional start sites using the genome browser Artemis to identify σH-dependent promoters based on the consensus sequence (A/G)(A/C)AGG(A/G)(A/T)(A/T)(A/T) – N11-12nt – (A/G)(A/G)(A/C)GAA(A/T)) [9] (Additional file 3: Table S3).

Identification and characterization of σH-dependent promoters in additional L. monocytogenes and Listeria spp. genomes

A total of 28 finished genome sequences representing 10 L. monocytogenes lineage I strains, 10 L. monocytogenes lineage II strains (including 10403S), 4 L. monocytogenes lineage III strains, 1 L. innocua strain, 1 L. ivanovii strain, 1 L. welshimeri strain and 1 L. seeligeri strain were retrieved from NCBI GenBank database (Additional file 2: Table S2). Standalone Blast searches using the σH-dependent promoter sequences or their respective regulated genes as queries were carried out against two databases containing (i) all 24 L. monocytogenes genomes (L. monocytogenes database) and (ii) all 4 non-L. monocytogenes genomes (Listeria spp. database). Blast searches against both databases were carried out with the following parameters: “Expectation Value (E)” set to 0.2, “Word Size” set to 8, and “Filter Query Sequence (DUST)” set to FALSE. For searches against the L. monocytogenes database, the “Penalty for Nucleotide Mismatch” was set to −4 while the “Reward for a Nucleotide Match” was set to 5. For searches against the Listeria spp. database, the mismatch penalty was set to −2 and the reward for a match was set to 2 to allow for matches against more divergent sequences. Matches against the query sequences were parsed from the output using in-house Perl scripts. Only the best match against each query was retrieved and sequences were aligned using standalone ClustaW [36]. Two or more equally best matches were not obtained in any of the BLAST searches. BLAST results and alignments are available and will be provided by request. Sequence logos were created using the WebLogo generator available online at [37].

Availability of supporting data

RNA-seq data have been deposited in NCBI's Gene Expression Omnibus and are accessible through GEO Series accession number GSE73008 (



Brain Heart Infusion


Burrows-Wheeler Aligner


fold change


False Discovery Rate


New England Biolabs


normalized RNA-seq coverage


open reading frame


RNA sequencing


transcription units


untranslated region


  1. 1.

    Feklistov A, Sharon BD, Darst SA, Gross CA. Bacterial sigma factors: a historical, structural, and genomic perspective. Annu Rev Microbiol. 2014;68:357–76.

  2. 2.

    Mujahid S, Orsi RH, Vangay P, Boor KJ, Wiedmann M. Refinement of the Listeria monocytogenes sigmaB regulon through quantitative proteomic analysis. Microbiology. 2013;159(Pt 6):1109–19.

  3. 3.

    Kazmierczak MJ, Mithoe SC, Boor KJ, Wiedmann M. Listeria monocytogenes sigma B regulates stress response and virulence functions. J Bacteriol. 2003;185(19):5722–34.

  4. 4.

    Palmer ME, Chaturongakul S, Wiedmann M, Boor KJ. The Listeria monocytogenes sigmaB regulon and its virulence-associated functions are inhibited by a small molecule. MBio. 2011;2(6). doi: 10.1128/mBio.00241-11.

  5. 5.

    Grossman AD. Genetic networks controlling the initiation of sporulation and the development of genetic competence in Bacillus subtilis. Annu Rev Genet. 1995;29:477–508.

  6. 6.

    Britton RA, Eichenberger P, Gonzalez-Pastor JE, Fawcett P, Monson R, Losick R, Grossman AD. Genome-wide analysis of the stationary-phase sigma factor (sigma-H) regulon of Bacillus subtilis. J Bacteriol. 2002;184(17):4881–90.

  7. 7.

    Chaturongakul S, Raengpradub S, Wiedmann M, Boor KJ. Modulation of stress and virulence in Listeria monocytogenes. Trends Microbiol. 2008;16(8):388–96.

  8. 8.

    Rea RB, Gahan CG, Hill C. Disruption of putative regulatory loci in Listeria monocytogenes demonstrates a significant role for Fur and PerR in virulence. Infect Immun. 2004;72(2):717–27.

  9. 9.

    Chaturongakul S, Raengpradub S, Palmer ME, Bergholz TM, Orsi RH, Hu Y, Ollinger J, Wiedmann M, Boor KJ. Transcriptomic and phenotypic analyses identify coregulated, overlapping regulons among PrfA, CtsR, HrcA, and the alternative sigma factors sigmaB, sigmaC, sigmaH, and sigmaL in Listeria monocytogenes. Appl Environ Microbiol. 2011;77(1):187–200.

  10. 10.

    Shell SS, Wang J, Lapierre P, Mir M, Chase MR, Pyle MM, Gawande R, Ahmad R, Sarracino DA, Ioerger TR et al. Leaderless transcripts and small proteins are common features of the mycobacterial translational landscape. PLoS Genet. 2015;11(11):e1005641.

  11. 11.

    Toledo-Arana A, Dussurget O, Nikitas G, Sesto N, Guet-Revillet H, Balestrino D, Loh E, Gripenland J, Tiensuu T, Vaitkevicius K et al. The Listeria transcriptional landscape from saprophytism to virulence. Nature. 2009;459(7249):950–6.

  12. 12.

    Oliver HF, Orsi RH, Ponnala L, Keich U, Wang W, Sun Q, Cartinhour SW, Filiatrault MJ, Wiedmann M, Boor KJ. Deep RNA sequencing of L. monocytogenes reveals overlapping and extensive stationary phase and sigma B-dependent transcriptomes, including multiple highly transcribed noncoding RNAs. BMC Genomics. 2009;10:641.

  13. 13.

    Taniguchi H, Wendisch VF. Exploring the role of sigma factor gene expression on production by Corynebacterium glutamicum: sigma factor H and FMN as example. Front Microbiol. 2015;6:740.

  14. 14.

    Leang C, Krushkal J, Ueki T, Puljic M, Sun J, Juarez K, Nunez C, Reguera G, DiDonato R, Postier B et al. Genome-wide analysis of the RpoN regulon in Geobacter sulfurreducens. BMC Genomics. 2009;10:331.

  15. 15.

    Morikawa K, Inose Y, Okamura H, Maruyama A, Hayashi H, Takeyasu K, Ohta T. A new staphylococcal sigma factor in the conserved gene cassette: functional significance and implication for the evolutionary processes. Genes Cells. 2003;8(8):699–712.

  16. 16.

    Morikawa K, Takemura AJ, Inose Y, Tsai M, Nguyen Thi le T, Ohta T, Msadek T. Expression of a cryptic secondary sigma factor gene unveils natural competence for DNA transformation in Staphylococcus aureus. PLoS Pathog. 2012;8(11):e1003003.

  17. 17.

    Chung YS, Breidt F, Dubnau D. Cell surface localization and processing of the ComG proteins, required for DNA binding during transformation of Bacillus subtilis. Mol Microbiol. 1998;29(3):905–13.

  18. 18.

    Chung YS, Dubnau D. All seven comG open reading frames are required for DNA binding during transformation of competent Bacillus subtilis. J Bacteriol. 1998;180(1):41–5.

  19. 19.

    Provvedi R, Dubnau D. ComEA is a DNA receptor for transformation of competent Bacillus subtilis. Mol Microbiol. 1999;31(1):271–80.

  20. 20.

    Kramer N, Hahn J, Dubnau D. Multiple interactions among the competence proteins of Bacillus subtilis. Mol Microbiol. 2007;65(2):454–64.

  21. 21.

    Charpentier X, Polard P, Claverys JP. Induction of competence for genetic transformation by antibiotics: convergent evolution of stress responses in distant bacterial species lacking SOS? Curr Opin Microbiol. 2012;15(5):570–6.

  22. 22.

    Rabinovich L, Sigal N, Borovok I, Nir-Paz R, Herskovits AA. Prophage excision activates Listeria competence genes that promote phagosomal escape and virulence. Cell. 2012;150(4):792–802.

  23. 23.

    Johnston C, Martin B, Fichant G, Polard P, Claverys JP. Bacterial transformation: distribution, shared mechanisms and divergent control. Nat Rev Microbiol. 2014;12(3):181–96.

  24. 24.

    Loessner MJ, Inman RB, Lauer P, Calendar R. Complete nucleotide sequence, molecular analysis and genome structure of bacteriophage A118 of Listeria monocytogenes: implications for phage evolution. Mol Microbiol. 2000;35(2):324–40.

  25. 25.

    Borezee E, Msadek T, Durant L, Berche P. Identification in Listeria monocytogenes of MecA, a homologue of the Bacillus subtilis competence regulatory protein. J Bacteriol. 2000;182(20):5931–4.

  26. 26.

    Nambu T, Minamino T, Macnab RM, Kutsukake K. Peptidoglycan-hydrolyzing activity of the FlgJ protein, essential for flagellar rod formation in Salmonella typhimurium. J Bacteriol. 1999;181(5):1555–61.

  27. 27.

    Singh VK, Vaish M, Johansson TR, Baum KR, Ring RP, Singh S, Shukla SK, Moskovitz J. Significance of four methionine sulfoxide reductases in Staphylococcus aureus. PLoS One. 2015;10(2):e0117594.

  28. 28.

    Orsi RH, Maron SB, Nightingale KK, Jerome M, Tabor H, Wiedmann M. Lineage specific recombination and positive selection in coding and intragenic regions contributed to evolution of the main Listeria monocytogenes virulence gene cluster. Infect Genet Evol. 2008;8(5):566–76.

  29. 29.

    Mujahid S, Orsi RH, Boor KJ, Wiedmann M. Protein level identification of the Listeria monocytogenes sigma H, sigma L, and sigma C regulons. BMC Microbiol. 2013;13:156.

  30. 30.

    Fieseler L, Schmitter S, Teiserskas J, Loessner MJ. Rhamnose-inducible gene expression in Listeria monocytogenes. PLoS One. 2012;7(8):e43444.

  31. 31.

    Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(−Delta Delta C(T)) Method. Methods. 2001;25(4):402–8.

  32. 32.

    Tang S, Orsi RH, den Bakker HC, Wiedmann M, Boor KJ, Bergholz TM. Transcriptomic analysis of the adaptation of Listeria monocytogenes to growth on vacuum-packed cold smoked salmon. Appl Environ Microbiol. 2015;81(19):6812–24.

  33. 33.

    Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010;26(5):589–95.

  34. 34.

    Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, Genome Project Data Processing S. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–9.

  35. 35.

    Hardcastle TJ, Kelly KA. baySeq: empirical Bayesian methods for identifying differential expression in sequence count data. BMC Bioinformatics. 2010;11:422.

  36. 36.

    Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23(21):2947–8.

  37. 37.

    Crooks GE, Hon G, Chandonia JM, Brenner SE. WebLogo: a sequence logo generator. Genome Res. 2004;14(6):1188–90.

Download references


We thank Dr. Martin J. Loessner at ETH Zurich for the kind gift of plasmid pLF1. This project was supported by a grant from the National Institute of Allergy and Infectious Diseases (NIAID) of the National Institutes of Health (NIH) (2 RO1 AI052151-05A1).

Author information

Correspondence to Veronica Guariglia-Oropeza.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

YL performed the RNA-seq experiments and initial data analyses; RHO performed sliding window analysis of RNA-Seq data and promoter identification; KJB, MW and VGO co-wrote the manuscript and conceived the study. All authors read and approved the final manuscript.

Additional files

Additional file 1: Table S1.

Genes identified as differentially expressed between L. monocytogenes overexpressing sigH (10403S::ΔsigBCHL P rha -sigH) and a ΔsigBCHL control strain, based on the RNA-seq coverage data calculated for complete ORFs. (DOCX 15 kb)

Additional file 2: Table S2.

Listeria genomes used for comparative analysis of SigmaH-dependent promoters. (XLSX 10 kb)

Additional file 3: Table S3.

Significant fragments found with the sliding window approach. (XLSX 25 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Liu, Y., Orsi, R.H., Boor, K.J. et al. An advanced bioinformatics approach for analyzing RNA-seq data reveals sigma H-dependent regulation of competence genes in Listeria monocytogenes . BMC Genomics 17, 115 (2016) doi:10.1186/s12864-016-2432-9

Download citation


  • RNA-seq
  • Listeria monocytogenes
  • Sigma H
  • Competence