- Research article
- Open Access
Genome analysis and in vivo virulence of porcine extraintestinal pathogenic Escherichia coli strain PCN033
BMC Genomics volume 16, Article number: 717 (2015)
Strains of extraintestinal pathogenic Escherichia coli (ExPEC) can invade and colonize extraintestinal sites and cause a wide range of infections. Genomic analysis of ExPEC has mainly focused on isolates of human and avian origins, with porcine ExPEC isolates yet to be sequenced. To better understand the genomic attributes underlying the pathogenicity of porcine ExPEC, we isolated two E. coli strains PCN033 and PCN061 from pigs, assessed their in vivo virulence, and completed and compared their genomes.
Animal experiments demonstrated that strain PCN033, but not PCN061, was pathogenic in a pig model. The chromosome of PCN033 was 384 kb larger than that of PCN061. Among the PCN033-specific sequences, genes encoding adhesins, unique lipopolysaccharide, unique capsular polysaccharide, iron acquisition and transport systems, and metabolism were identified. Additionally, a large plasmid PCN033p3 harboring many typical ExPEC virulence factors was identified in PCN033. Based on the genetic variation between PCN033 and PCN061, corresponding phenotypic differences in flagellum-dependent swarming motility and metabolism were verified. Furthermore, the comparative genomic analyses showed that the PCN033 genome shared many similarities with genomic sequences of human ExPEC strains. Additionally, comparison of PCN033 genome with other nine characteristic E. coli genomes revealed 425 PCN033-special coding sequences. Genes of this subset included those encoding type I restriction-modification (R-M) system, type VI secretion system (T6SS) and membrane-associated proteins.
The genetic and phenotypic differences between PCN033 and PCN061 could partially explain their differences in virulence, and also provide insight towards the molecular mechanisms of porcine ExPEC infections. Additionally, the similarities between the genomes of PCN033 and human ExPEC strains suggest that some connections between porcine and human ExPEC strains exist. The first completed genomic sequence for porcine ExPEC and the genomic differences identified by comparative analyses provide a baseline understanding of porcine ExPEC genetics and lay the foundation for their further study.
Escherichia coli is a well-known prokaryotic organism that can exist both as a harmless intestinal inhabitant and a deadly pathogen . Pathogenic E. coli can be divided to intestinal pathogenic E. coli (IPEC) and ExPEC . ExPEC strains harbor diverse genomes and a wide range of virulence factors (VFs) that collectively confer the ability to colonize and invade extraintestinal sites, causing infections of the urinary tract, meningitis, pneumonia, osteomyelitis, and surgical site infections . ExPEC strains mainly include uropathogenic E. coli (UPEC), newborn meningitis-causing E. coli (NMEC), and avian pathogenic E. coli (APEC) . Infections caused by ExPEC strains occur worldwide and incur great economic cost .
The major VFs present in ExPEC strains are distinct from those found in IPEC strains. Characteristic ExPEC VFs include various adhesins (P and type I fimbriae), structural components of the bacterial outer membrane (capsule, lipopolysaccharide), iron acquisition and utilization systems (aerobactin and salmochelin siderophores), toxins (hemolysin, cytotoxic necrosis factor) and secretion systems [4, 5]. These VFs are frequently encoded on mobile genetic elements in unique organization . For example, possession of ColV plasmids are a defining trait of APEC and these plasmids are key players in APEC pathogenesis .
Some virulence-associated genes are shared between ExPEC independent of host species, suggesting that ExPEC may be zoonotically acquired . Previous studies have shown that some APEC strains shared similarities with human ExPEC strains, and a foodborne link between the two may exist . To date, twelve and two complete genomic sequences of human ExPEC and APEC strains are respectively available in GenBank (Additional file 1: Table S1); however, no porcine ExPEC have been sequenced yet. Recently, ExPEC infections have become epidemic in the Chinese pig industry, and these ExPEC could represent a public health threat . Therefore, a better understanding of porcine-source ExPEC genomics is needed to improve our understanding of its mechanisms of disease. In this study, PCN033 was chosen for study as its serotype O11 and phylogenetic group D were characteristic of porcine ExPEC strains [9–11]. E. coli strain PCN061, which was also isolated from extraintestinal site of diseased pig, was used to perform comparative genomic analyses with PCN033. Pig experiments were then conducted to compare the pathogenicity of these strains in an extraintestinal disease model. Additionally, comparative genomic analysis was conducted among PCN033 and other nine characteristic E. coli strains to identify genomic differences that may explain the ability of PCN033 to cause disease in the pig host.
Result and discussion
Choice of strain
Our lab previously performed an epidemiological analysis of porcine ExPEC in China [9, 11]. Among the porcine E. coli isolates, PCN033 was selected  as it was defined as a porcine ExPEC strain follow the criterion by Johnson et al.  and Ding et al. . PCN033 contained two of the ExPEC virulence markers, kpsMTII and iutA. Furthermore, it belonged to O11 serogroup which is one of the most prevalent among porcine ExPEC in China . Additionally, PCN033 belongs to phylogenetic group D (Fig. 1). Porcine ExPEC are mostly fell in phylogenetic groups A, B1 and D . More importantly, PCN033 presented as a highly virulent ExPEC strain in previous virulence assessment in the mouse model [9, 13]. In sum, PCN033 possessed characteristic traits of porcine ExPEC. Another porcine E. coli isolate PCN061 was chosen to perform comparative genomic analyses with PCN033 as it was also isolated from extraintestinal site of pig but contained none of the ExPEC virulence markers. PCN061 belongs to phylogenetic group A which is representaive of porcine E. coli strains. It was an O9 strain.
Four of the piglets in PCN033 group showed severe clinical signs, such as lying on the side, abdominal breathing, shaking, convulsion, lameness etc. and died successively within 5 days post-inoculation. All piglets in PCN061 group survived. Piglets in negative control group were all in good condition until euthanasia (Table 1). Meanwhile, our bacterial recovery test showed that the inoculated bacteria were re-isolated from blood samples of PCN033 group, however in PCN061 and negative control group, bacteria were not recovered. The test showed that PCN033 could cause pathological conditions and even death in pig model.
General features of PCN033 and PCN061 genomes
We sequenced the genomes of PCN033 and PCN061 using a Roche 454 GS-FLX sequencer. Sequencing of PCN033 and PCN061 generated 30-fold and 33-fold coverage of reads, and produced 113 and 148 large contigs (>500 bp), respectively. The gaps between large contigs were closed by PCR-based sequencing. PCN033 contained a circular chromosome of 4,987,958 bp (Fig. 2a) with an average G + C content of 50.7 %, and plasmids PCN033p1, PCN033p2 and PCN033p3 (Table 2). The PCN033 chromosome was predicted to harbor seven rRNA operons including a duplicate 5S rRNA gene, similar to that in other E. coli strains , and 85 tRNA genes. We observed 4,838 predicted coding sequences (CDSs) with an average length of 905 bp (Table 2) covering about 89.0 % of the PCN033 chromosome. Among all the protein-coding genes located on the chromosome, approximately 23.8 % (1152/4838) possessed no clear biological function, with 30 genes unique to the PCN033 genome. PCN061 contained a circular chromosome of 4,603,777 bp (Fig. 2b) with an average G + C content of 50.8 %, and plasmids PCN061p1, PCN061p2, PCN061p3, PCN061p4, PCN061p5, and PCN061p6 (Table 2). The PCN061 chromosome was predicted to harbor an extra tRNA gene and share 83 tRNAs compared with the PCN033 chromosome. In total, 4,432 CDSs with an average length of 883 bp covered about 88.0 % of the PCN061 chromosome (Table 2). Among all the protein-coding genes located on the chromosome, approximately 20.3 % (899/4432) genes were hypothetical proteins, with 40 genes unique to the PCN061 genome.
A phylogenetic tree was generated through sequence concatenation of 325 genes that were conserved in all 56 E. coli strains (Fig. 1; Additional file 1: Table S1; Additional file 2: Table S2). PCN033 was assigned to group D and clustered with enteroaggregative E. coli 042 and ExPEC UMN026, while PCN061 was assigned to group A and clustered with commensal E. coli HS. Among these E. coli, pathogenic strains mainly belonged to groups B2 and D, whereas the nonpathogenic strains mainly belonged to groups A and B1 [15, 16]. The phylogenetic positions of Shigella strains were mixed with E. coli strains (Fig. 1), which is in agreement with the phylogenetic tree produced by Zhang and Lin . Our findings support previous reports which suggested that Shigella strains should be included within the genus Escherichia [15, 18].
The PCN033 genome is 384,180 bp longer than that for PCN061. The MAUVE alignment of PCN033 and PCN061 at the nucleotide level identified eight Locally Co-linear Blocks (Fig. 3a). Approximately half of the PCN033 special regions were prophage-containing genomic islands (Fig. 3a). Besides prophage-associated genes, a number of VFs and metabolism-associated genes are located in the special regions of PCN033; these include genes for fimbrial biosynthesis, adhesin, O-antigen polysaccharide synthesis, capsular polysaccharide synthesis, type II secretion system (T2SS), type III secretion system (T3SS), T6SS, the type I R-M system, and the paa operon involved in glycolate metabolism. Those regions that likely represent the genetic basis for the virulence differences between PCN033 and PCN061 are further discussed below.
Comparison of PCN033 with other ExPEC and commensal E. coli strains
We chose other nine E. coli strains for comparative genomic analysis with PCN033: two commensal E. coli strains, K-12 strains MG1655 [GenBank: NC_000913]  and W3110 [GenBank: NC_007779] ; avian ExPEC strain, APECO1 [GenBank: NC_008563] ; six human ExPEC strains, including three UPEC strains, CFT073 [GenBank: NC_004431] , UTI89 [GenBank: NC_007946]  and UMN026 [GenBank: NC_011751.1] , and three NMEC strains, S88 [GenBank: NC_011742] , CE10 [GenBank: NC_017646]  and IHE3034 [GenBank: NC_017628] . These nine E. coli strains are representative of human and avian ExPEC and commensal E. coli strains. The main features of the above ten E. coli strains are shown in Table 3. Among the ExPEC strains, PCN033 has the smallest chromosome (Table 3). We used ANI value to analyse the pairwise similarities between PCN033 and each of the nine genomes  (Table 3). The result showed that PCN033 genome shared most similarities with human UPEC strain UMN026, followed by human MNEC strain CE10 and commensal E. coli strains, MG1655 and W3110. Interestingly, PCN033 shared least whole-genome sequence similarities with APEC O1. This supports the notion that some porcine ExPEC strains share more similarity with human ExPEC than avian ExPEC, and suggests a possible relationship between some human and porcine ExPEC. However, additional work is needed to confirm or reject this hypothesis.
To identify genomic features specific to PCN033, we initially compared this genome sequence with that of nonpathogenic E. coli strains MG1655 and W3110. The comparison identified 24 genomic islands (GIs) (>9 kb) absent in MG1655 and W3110 genomes (Additional file 3: Table S3). The size of the PCN033 special islands varies from 9 kb to 52 kb and 570 kb in all. Many of the islands harbor genes coding for putative virulence factors, phage components and metabolic enzyme, such as type VI secretion system (island I), lateral flagellar (island II), lipopolysaccharide (island X), type III secretion system (island XIII), polysialic acid capsular polysaccharide (island XIV), heme transporter system (island XVI), invasin (island XVII), adhesin (island XXII), prophage (island V-VII, XI, XII, XIX) and metabolic enzyme to break down hydroxyphenylacetic acid (hpa operons , island XXIV). All-against-all BLASTP comparison of all proteins among PCN033 and other nine E. coli strains mentioned above showed that PCN033 had 425 special CDSs (Additional file 4: Table S4, Fig. 4), 68.5 % of which encoded hypothetical proteins. Additionally, PCN033-specific CDSs included genes encoding a type I RM system, type VI secretion system, adhesins, membrane proteins, prophages and O-antigen (Additional file 4: Table S4). Those regions may contribute to this strain’s ability to cause extraintestinal infections and even death in pigs. However, their roles in the pathogenicity of porcine ExPEC will require further study and these special loci in PCN033 should be verified in a larger population of pig ExPEC strains.
Mobile genetic elements
Most ExPEC VFs that encode fimbriae or adhesins, and confer utilization of alternative nutrients, or resistance to biological stresses are clustered together on GIs known as pathogenicity islands (PAIs) [28, 29]. Using Island Viewer, PCN033 contains a total of 30 GIs (P33GI1–30; Additional file 5: Table S5). Among these, P33GI2, P33GI18 and P33GI19 contain T6SS, T3SS and T2SS, respectively. Additionally, P33GI1, P33GI3, P33GI8, P33GI27 and P33GI30 contain genes for fimbrial biosynthesis, along with P33GI19, P33GI21, P33GI22, and P33GI30, which are GIs containing genes that encode polysaccharide and lipopolysaccharide biosynthesis enzymes, and invasion and type I R-M enzymes. These regions constitute the major PAIs of PCN033. P33GI12 encodes mannose metabolism enzymes, contributing to alternative utilization of nutrients. P33GI14, P33GI23 and P33GI25 include genes associated with antibiotic resistance. In contrast with PCN033, PCN061contains 11 GIs (P61GI1–11; Additional file 5: Table S5). The T2SS is included within P61GI9, while P61GI2 contains tetracycline resistance-encoding genes. The genome of PCN061 contains two prophage regions (P61GI1, P61GI4).
In addition to the known chromosomally encoded VFs in human ExPECs and APEC [30, 31], many ExPEC VFs are also carried on plasmids [6, 32]. PCN033 harbored three plasmids (Table 2), including the largest (PCN033p3), which contained four replicons. Blast analysis demonstrated that the replicon regions belonged to Incompatibility (Inc) types Q1, FII, FIC and FIB, respectively . Plasmid PCN033p3 carried 241 predicted CDSs, among which 28.2 % (68/241) encoded hypothetical proteins. This plasmid also carried the genes encoding colicin V (ColV) (cvaA to cvaC, cvi), a representative characteristic of ExPEC ; a “constant” region of ColV plasmids which includes the RepFIB replicon [35, 36]; three different iron uptake and utilization systems (aerobactin, salmochelin, and the sitABCD genes) [37, 38]; an outer membrane protease-encoding gene ompT ; a novel ABC transport system, etsABC ; a hemolysin-encoding gene hlyF ; and the increased serum survival gene iss . Additionally, plasmid PCN033p3 also contained numerous resistance genes, including those encoding resistance to mercury (merR to merE), camphor (crcB), trimethoprim (dfrA17), kanamycin (aph), streptomycin (strAB), sulfonamide (sul2), bleomycin (ble), β-lactams (bla TEM1), tetracycline (tetB), chloramphenicol (cat) and olaquindox (oqxAB) (Fig. 5) . Many of these resistance-related genes were flanked by insertion sequence (IS) elements; in particular, cat (PCN033p3_223) was inserted via an IS6 element (Fig. 5). This plasmid contained a conjugal transfer region of 41 genes (traM to finO), that spanned 33,784 bp. The F-like transfer region shares high similarity with that of pECOS88 [GenBank: NC_011747] which is proved to own conjugal transfer ability via conjugations experiments (Fig. 6) . Besides the conjugal transfer region, several gene blocks of PCN033p3 sequence are also highly homologous to pECOS88, such as the virulence region including colicin V operon, iron uptake systems (iro, sit and iuc loci), iss, etsABC, ompT and hlyF (Fig. 6). As shown in Fig. 6, between the locus tra and the virulence region in PCN033p3 lies a region containing mostly resistance-associated genes, such as sul2, strAB, mer operon, tetB, bla TEM1 and oqxAB, and genes involved in plasmid stability, tir, encoding transfer inhibition protein; pemIK, coding for stable plasmid inheritance proteins. This region was absent in pECOS88 (Fig. 6). Plasmid pECOS88, harbored by E. coli strain S88 that cause neonatal meningitis, is a key virulence determinant, as it is involved in the ability of S88 to induce high level of bacteremia . The other two PCN033 plasmids only have five unknown functional CDSs respectively. PCN061 possesses six plasmids (Table 2). Blast analysis showed that PCN061p5 belonged to Inc group IncI1, and PCN061p6 was a mosaic plasmid belonging to Inc types IncN, IncFIB and IncX1 . PCN061 also contained several resistance determinants distributed among its plasmids. Plasmid PCN061p3 harbored the resistance genes sul and strAB. Plasmid PCN061p5 contained the gene floR, which encodes resistance to florfenicol/chloramphenicol. Resistance determinants for silver and other heavy metals (silESRCFBAP), aminoglycosides (aph, aadA4 and aac3IIa), β-lactams (bla TEM1), and olaquindox (oqxAB) were located in PCN061p6. The sil gene cluster was similar to that found in pAPEC-O2-R [GenBank : NC_ 006671.1] . Additionally, PCN061p5 encodes the colicin-Ib. Compared with PCN061 plasmids, PCN033 plasmids apparently harbor a greater number of virulence and resistance determinants.
Other putative virulence factors
E. coli adhesins are important for determining the bacterium to bind to and colonize the host. Dozens of fimbrial adhesin genes are located in the PCN033 GIs; Yeh-like, Ecp, F9, K99 and type I fimbrial biosynthesis related gene clusters are located on P33GI1, 3, 8, 27 and 30, respectively. In contrast, P61GIs contain no fimbrial adhesin genes, suggesting that the fimbrial adhesins of PCN033 were acquired to adapt towards enhanced colonization and fitness. The pathways involved in assembling fimbriae comprise the chaperone-usher (CU) pathway, the type IV secretion pathway and the extracellular nucleation precipitation pathway . The CU fimbriae found in PCN033 and PCN061 are classified according to Wurpel et al.  and Nuccio and Baumler . In the genome of PCN033 and PCN061, gene clusters associated with fimbrial biosynthesis were listed in Table 4. Accordingly, fimbriae were observed in both PCN033 and PCN061 (Fig. 7a). The primary structure of type 3 fimbriae in PCN061p6 shares 93–100 % similarity with that in pMAS2027 [GenBank: NC_013503] and pOLA52 [GenBank: NC_010378]. The expression of type 3 fimbriae in pMAS2027 and pOLA52 determines their biofilm formation [47, 48]. The biofilm formation assay showed that both PCN033 and PCN061 strains were biofilm producers, and that PCN061 exhibited a higher degree of biofilm formation than PCN033 (Fig. 7b).
Lipopolysaccharides (LPS) biosynthesis
As an essential structural component of the Gram-negative bacterial outer membrane, LPSs are considered virulence determinants, and help bacteria resist environmental stress. The result of serotyping test presented that PCN033 and PCN061 have an O11 and an O9 type O-antigen, respectively . Previous epidemiological studies of 81 porcine ExPEC isolates in central China showed that O11 is one of the predominant serogroups of porcine ExPECs . PCN033 contains the O-antigen synthesis-related gene cluster, which resembles that for E. coli O11 G1207 [GenBank: HQ388393] (Fig. 3b) . In PCN061, part of its O-antigen cluster is identical with that in the O9 rbf gene cluster of E. coli F719 [GenBank: D43637] (Fig. 3b) [50, 51]. The genetic analyses match the result of serotyping test.
Capsular polysaccharide (CPS)
Surface-associated CPSs are involved in protecting microorganisms from desiccation, promoting adhesion, and resisting the host immune system [52, 53]. Capsules of E. coli were separated into four different groups . Sequence alignment showed PCN033 capsule belongs to group 2. However, the PCN033 capsular region also encodes four hypothetical proteins and a hydrolase that show no similarity with products in other known group 2 capsule regions. E. coli K1 which also belong to group 2 capsule had the ability to invade brain microvascular endothelial cells . In PCN061, a single gene (PCN061_1020) involved in CPS biosynthesis was found; this gene encodes a protein with a capsule biosynthesis GfcC domain (Pfam: PF06251) which plays a role in group 4 capsule biosynthesis .
Iron acquisition and utilization
Iron transport systems are known to play an important part in bacterial pathogenesis [57, 58]. For PCN033 and PCN061, gene clusters involved in hydroxamate, enterobactin and ferric enterobactin-mediated iron uptake systems have been revealed (Fig. 8). Additionally, genes for ferrochelatase (HemH), ferritin proteins, ferric uptake regulator (Fur), ferric reductase, ferrous iron transporter (EfeUOB) and TonB systems also exist in the two strains, suggesting they have the capacity to maintain iron homeostasis allowing them to inhabit a host. Compared with PCN061, PCN033 harbors four extra iron uptake systems including a chromosome-encoded heme transporter system (Chu) and three plasmid-associated siderophore iron uptake and ABC iron transport systems (aerobactin, salmochelin, and the sitABCD genes).
Secretion systems are necessary for the transport of proteins across the cell envelope, mediating interactions between bacteria, their hosts, and the surrounding microenvironment . Differences between PCN033 and PCN061 secretion systems are mainly focused upon the presence of T3SSs and T6SSs (Fig. 8).
T3SSs are essential components of two complicate bacterial apparatuses: the flagellum, the motility equipment, and the non-flagellar T3SSs (NF-T3SSs) which deliver effectors into the cytosol of host cells . According to previous reports, two flagella systems exist in E. coli . The Flag-1 flagellum cluster is associated with peritrichous (lateral) flagellum biosynthesis and identified in PCN033 and PCN061. In PCN061, flhA and flhD which encode a peritrichous flagellum biosynthesis protein and transcriptional activator of Flag-1 operons respectively, both contain frame-shift mutations . The frame-shift mutation in flhD of Yersinia pestis is associated with its loss of motility , suggesting that the Flag-1 system is not functional in PCN061. In contrast, genes involved in the Flag-1 system are all intact in PCN033. In addition to the conventional Flag-1 cluster, PCN033 contains a Flag-2 gene cluster, which has been found in around 20 % of E. coli strains . This gene cluster was first reported in E. coli O42; however, the lfgC gene that encodes a FlgC-like rod protein in the Flag-2 system contains a frame-shift mutation. Ren et al. suspected that a frame-shift mutation in lfgC probably inactivated the Flag-2 system in E. coli O42 . The lfgC gene in PCN033 does not contain any mutations. Additionally, PCN033 LafK protein in Flag-2 contains a full-length Sigma54 activating domain [Pfam: PF00158]. Furthermore, consensus σ54 sites (TGGCAC-N5-TTGC) are identified upstream of both lfgB and lafB translation start codons. These motifs suggest that the Flag-2 system is functional in PCN033, while only four genes (PCN061_0225–0228) involved in the Flag-2 system are found in PCN061. The results of transmission electron microscopy and swarming motility assays (Fig. 7a, c) showed that PCN033 was indeed able to produce flagella and motile, while PCN061 did not possess flagella. Two NF-T3SSs were found in E. coli strains: E. coli type III secretion system 1 (ETT1); and E. coli type III secretion system 2 (ETT2) . ETT1 was absent in both PCN033 and PCN061, while an ETT2 gene cluster (PCN033_3083–3099) was found in PCN033 and located in P33GI18. This ETT2 gene cluster is highly homologous with that in meningitis-causing E. coli K1 strain EC10 (O7:K1), but contained less deletions and no insertions. EC10 mutants lacking ETT2 and/or eivA exhibited significant defects in invasion and intracellular survival in human brain microvascular endothelial cells (HBMECs) compared with parental strains , suggesting that ETT2 in PCN033 contributes to its pathogenicity. ETT2 is absent from the PCN061 genome.
T6SSs are the most recently identified protein secretion systems of Gram-negative bacteria [67, 68]. Although the gene order is diverse among different organisms, products of conserved genes compose the core elements of T6SS, including effector proteins, hemolysin coregulated protein (Hcp), valine-glycine repeat protein G (VgrG), intracellular multiplication factor (IcmF) and the chaperone ClpB [68, 69]. These core components are all found in PCN033, with IcmF (PCN033_0227), ClpB (PCN033_0230) and VgrG (vgrG1, PCN033_0247) sharing 98.0, 97.8 and 95.8 % identity respectively with the corresponding proteins in E. coli Sakai. PCN033 Hcp1 (hcp1, PCN033_0225) is 100 % identical to Hcp-like proteins in Shigella sonnei Ss046 and E. coli EDL933 strains . PCN033 Hcp2 (hcp2, PCN033_0245) shares 94.2 % identity with the Hcp2 (EvfV) protein of the meningitis-causing E. coli K1 strain RS218. Previous research has shown that Hcp2 of E. coli RS218 contributes to its interaction with HBMECs . Outside of this T6SS gene cluster, we identified a third Hcp (hcp3, PCN033_4200) that shared 32.2 % identity with Hcp1 and another VgrG (vgrG2, PCN033_1587); this Hcp protein exhibited 99 % identity with VgrG1. In contrast, we were unable to identify T6SS-associated genes in PCN061.
Predicted metabolic pathways
Complete sets of genes encoding enzymes necessary for glycolysis and gluconeogenesis, the tricarboxylic acid cycle, and the pentose phosphate and Entner-Doudoroff pathway, were identified in the genomes of PCN033 and PCN061 (Fig. 8). Bacterial uptake of sugars mainly occurs through specific phosphotransferase systems (PTSs) and ATP-binding cassette (ABC) transport systems. In the genomes of PCN061 and PCN033, predicted sets of genes encoding PTSs and ABC transport systems were identified [71–74] (Fig. 8). The CAP (crp, PCN033_3662; PCN061_3439)-cAMP (cyaA, PCN033_4172; PCN061_3937) system regulates the transcription of multiple sugar utilization operons and was found in PCN033 and PCN061 . The Cra (fruR, PCN033_0087; PCN061_0079) transcription factor plays a key role in balancing enzyme levels for carbon metabolism , and was also seen in strains PCN033 and PCN061, suggesting their ability to metabolize those sugars as carbon source. These metabolic characteristics of the two strains were verified by an API 20E test (Additional file 6: Table S6) and other biochemical tests (Fig. 8).
Sucrose is the most abundant disaccharide in most environments. The ability to utilize sucrose as a sole carbon source is a highly variable phenotype among enteric bacteria . This characteristic is dependent upon the presence of the csc regulon comprising three genes for a sucrose symporter (CscB, PCN033_2534), a sucrose hydrolase (CscA, PCN033_2536) and fructokinase (CscK, PCN033_2535), and was first reported in E. coli EC3132. In EC3132, sucrose is transported into cells by a sucrose:H+ symporter encoded by cscB. Jahreis et al. reported that csc genes optimally adapt to new hosts . This csc regulon exists in PCN033 but not in PCN061, and the resulting metabolic difference in sucrose utilization was proved through API 20E tests (Additional file 6: Table S6). D-allose is an analog of D-ribose, a key component of DNA and RNA, that can be converted into fructose-6-phosphate . PCN033 is possibly unable to catabolize allose because of the absence of the alsBACEK regulon . The rbsDACBKR regulon (PCN033_4126–4131) contributes to the metabolism of ribose, and was identified in PCN033. Both regulons were seen in PCN061. The glc gene locus is associated with glycolate utilization in E. coli, and is known to contain glcB (PCN033_3294), which encodes malate synthase G; glcC (PCN033_3299), which encodes the glc regulator protein; glcA (PCN033_3293) encoding a glycolate transporter; and glcDEFG (PCN033_3298–3295), which are required for glycolate oxidase activity . The presence of the glc gene locus in PCN033 suggests that this strain can use glycolate as another source of carbon; PCN061 did not contain this gene locus.
Vieira et al. conducted an analysis of core- and pan-metabolism in 29 E. coli strains, and observed that most commensal strains were able to degrade phenylacetate and phenylethylamine courtesy of the paa transcription unit. This particular transcription unit was absent from IPEC and most ExPECs [27, 81]. However, paa (PCN033_1521–1534) was found in PCN033, which is inconsistent with previous observations. Besides the ability to degrade phenylacetate and phenylethylamine, PCN033 may also possess the ability to break down 3- and 4-hydroxyphenylacetic acids as it contains the hpaCBAXIHFDBGR (PCN033_4854–4863, 4865) gene cluster . PCN061 does not harbor these two gene clusters that are involved in aromatic compound degradation.
The first full genomic analysis of porcine ExPEC strain demonstrates that porcine ExPEC strain PCN033 shared many similarities with human ExPEC strains. Previous studies have shown that some APEC strains share similarities with human ExPEC strains . These indicated that a possible foodborne link between animal and human ExPEC strains exists, and that animal ExPEC isolates may be a reservoir for human ExPEC [7, 8]. Comparative analysis of the virulent porcine ExPEC strain PCN033 and non-pathogenic porcine E. coli strain PCN061 showed that most genomic differences were in the mobile genetic elements of PCN033. Several virulence-associated phenotypic differences in PCN033 and PCN061 strains were verified, and most of the phenotypic differences observed corresponded with genotypic differences. Compared with PCN061, PCN033 has flagella and significantly stronger swarming motility. E. coli motility and the presence of flagella impact biofilm architecture ; however, the gene loci affecting biofilm formation in PCN033 and PCN061 were diverse and numerous, and we could not easily infer biofilm formation ability of the two strains purely by analyzing genomic sequences. According to our experimental results, PCN061 exhibited a higher degree of biofilm formation than PCN033. During the pathogenesis, bacteria require a balance between adherence, colonization and pervasion. The strong motility of PCN033 results in a reduced ability to form biofilms. Strain PCN033 contained more carbon source utilization genes, and was able to metabolize more carbon substrates. Comparison of PCN033 genome with other nine characteristic E. coli genomes revealed 425 PCN033-special CDSs. Genes of interest in this subset including those encoding type I R-M system, T6SS and membrane-associated proteins. Analysis of the T6SS in PCN033 revealed that the core elements of T6SS were intact. These special characteristics of PCN033 are worth further studying. Our genetic and phenotypic analyses of PCN033 have improved our understanding of the pathogenic mechanisms of porcine ExPEC strains and will hopefully lead to the development of new strategies for the prevention and treatment of porcine ExPEC infections.
The infection experiment was performed based on the International Council for Laboratory Animal Science Ethical Guideline for Researchers (1956). The animal experiment was conducted in Wuhan keqian Animal Biological Products Co., Ltd which obtained the experimental animal use license set by Science and Technology Department of Hubei province. The use license No. of our animal experiment is 00132619.
Strains and culture
E. coli strain PCN033 was isolated from the brain of a diseased pig from Hunan Province, China . E. coli strain PCN061 was isolated from the lung of a diseased pig from Hunan Province, China. These two strains were routinely cultured in Luria–Bertani (LB) medium at 37 °C. According to Johnson et al.  and Ding et al. , ExPECs were defined as E. coli isolates containing two or more virulence markers: papA/papC, sfa/foc, afa/dra, kpsMTII and iutA. PCN033 and PCN061 were examined in PCR for the presence of the above virulence markers. Serotyping, phylogenetic grouping and virulence analysis in mice model of these two strains were performed in previous study . The serotypes of these strains were identified by serum agglutination assay using specific O-antigen antiserum in the China Institute of Veterinary Drugs Control, Beijing, China. The phylogenetic groups of PCN033 and PCN061 strains were determined based on PCR detection of the chuA and yjaA genes and DNA fragment TSPE4.C2 .
Pathogenicity test in pig
We used 15 high-health-status pigs (4–5 weeks of age) to investigate the virulence of ExPEC strain PCN033. Twelve piglets were evenly assigned to PCN033 and PCN061 group (six pigs per group) and the remaining three were allocated to the PBS negative control group. Prior to the infection, the serum of pigs were tested negative by ELISA for PCN033 strain. Piglets in the PCN033 and PCN061 groups were inoculated with 8 × 108 CFU (1 ml) of each strain by ear vein infection. Three piglets infected with 1 ml PBS were served as negative control. All piglets were observed for morbidity and mortality for a week after infection. The remaining surviving piglets were euthanized. Meanwhile, bacteria were recovered from blood samples collected from the jugular vein before piglets were moribund or euthanized, and used to identified as E. coli by plating onto MacConkey selection Agar and 16S rRNA amplification. The infection experiment was performed based on the International Guiding Principles for Biomedical Research Involving Animals (1985).
DNA extraction and sequencing
Total genomic DNA was isolated from 15 ml of overnight culture using a DNEasy Blood & Cell Culture DNA Mini Kit (Qiagen, Hilden, Germany). The genomes of PCN033 and PCN061 were sequenced using a Roche 454 (GS FLX Tianium) system (http://www.454.com/ ). Sequence reads were assembled with the Newbler de novo assembler package. The relationship between contigs was displayed using ContigScape , determined by sequence alignment analysis and verified by PCR. Gaps between neighboring contigs were closed by sequencing PCR products using an ABI 3730 DNA sequencer (Applied Biosystems, Foster City, CA). The Phred/Phrap/Consed software was used for primer design, genome assembly, edition and quality assessment (http://www.phrap.org/consed/consed.html), and low quality regions of the genome were resequenced.
Gene prediction, annotation, and comparative analysis
Putative open reading frames (ORFs) were predicted using Glimmer  and GeneMark . A protein database was constructed based on the 48 E. coli genomes available in GenBank (Additional file 1: Table S1). The predicted proteins were searched for against the protein database using BLASTP . Predicted CDSs that did not have any matches, or only weak matches (E-value ≥ 1E −10, or amino acid sequence identity <40 %, or coverage of the protein <60 %) in the local protein database were compared with GenBank’s non-redundant protein database (http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/genbank/). Predicted CDSs were also searched against COG (Clusters of Orthologous Groups of proteins; http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/COG/ ) and KEGG (Kyoto Encyclopedia of Genes and Genomes; http://www.genome.jp/kegg/ ) for functional annotation and further function assignment. All predicted CDSs were compared with those in E. coli strains K-12 MG1655 [GenBank : NC_000913], HS [GenBank: NC_009800] and APECO1 [GenBank: NC_008563] using the tBLASTn algorithm to determine pseudogenes. Transfer RNA (tRNA) and ribosomal RNA (rRNA) genes were predicted using tRNAscan-SE  and RNAmmer (http://www.cbs.dtu.dk/services/RNAmmer/ ), respectively. Predicted ORFs that overlap with rRNA or tRNA genes were removed. ISs were annotated using BLASTN against the IS finder database . GIs were predicted by Island Viewer , and genome circular maps were generated using GenomeViz . Comparative genomic analyses of PCN033 and PCN061 was conducted using Mauve 2.3.1 genome alignment software . ANI value was used to analyse the similarities between two whole-genome sequences . Genomic comparison of porcine ExPEC strain PCN033 with other 7 available ExPEC and 2 commensal E. coli strains were based on protein-coding sequences. All-against-All BLASTP for amino acids was used to assign orthologs (E-value ≤1E−10, identity ≥60 %, coverage ≥80 %).
To generate the phylogenetic tree of Escherichieae strains, we identified orthologous CDSs that were conserved in all 56 fully sequenced strains of E. coli, Shigella, and Escherichia fergusonii, using BLASTClust (http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/Web/Newsltr/Spring04/blastlab.html) (Additional file 1: Table S1; Additional file 2: Table S2) (>90 % identity with the same length). DNA sets of 325 CDSs from each of the 56 strains were concatenated and used to generate a phylogenetic tree by the neighbor-joining method with 100 bootstrap iterations, using MEGA5 software .
Swarming motility assays were conducted as previously described . The swarming motility plates were prepared with 0.5 % agar, 1.0 % tryptone, 0.5 % Yeast Extract, 0.5 % NaCl and 0.5 % D-(+)-glucose. The plates were photographed after 16-h incubation at 37 °C.
Biofilm formation assays
Bacterial biofilm formation was assessed using crystal violet as previously described, with some modifications . The wells of a sterile 96-well microplate were filled with 200 μl of LB medium; 2 μl of overnight culture was added to each well. Each strain was tested in triplicate. Wells containing sterile LB medium were treated as negative controls. The plate was covered and incubated at 37 °C for 24 h without shaking. Planktonic cells in wells were aspirated, and washed twice with 200 μl of sterile PBS. Attached cells were stained with 200 μl of 1 % (w/v) crystal violet (Biosharp, Hefei, China) for 15 min at room temperature. Wells were rinsed twice with 200 μl of sterile PBS, and plates dried at 37 °C for 30 min. Stained adherent biofilms were solubilized with 200 μl of 33 % (v/v) glacial acetic acid. To quantify, the optical density (OD) at 600 nm (OD600) of each well was determined using a Synergy™ HT Multi-Detection Reader (BioTed Instruments, Winooski, VT). All tests were independently performed three times and the results averaged. The cutoff value for determining a biofilm producer was defined as 2-fold greater than the negative control .
Distribution differences in the functional categories of CDSs assigned according to COG between PCN033 and PCN061 strains were analyzed using Pearson’s χ2 tests. Differences of biofilm formation ability were determined using Mann–Whitney-Wilcoxon tests. A P-value of 0.05 was considered statistically significant.
The annotated complete genome sequences were deposited in GenBank database with accession numbers CP006632 (E. coli PCN033), CP006633 (plasmid PCN033p1 of E. coli PCN033), CP006634 (plasmid PCN033p2 of E. coli PCN033), CP006635 (plasmid PCN033p3 of E. coli PCN033), CP006636 (E. coli PCN061), CP006637 (plasmid PCN061p1 of E. coli PCN061), CP006638 (plasmid PCN061p2 of E. coli PCN061), CP006639 (plasmid PCN061p3 of E. coli PCN061), CP006640 (plasmid PCN061p4 of E. coli PCN061), CP006641 (plasmid PCN061p5 of E. coli PCN061) and CP006642 (plasmid PCN061p5 of E. coli PCN061), respectively.
Extraintestinal pathogenic Escherichia coli
Type VI secretion system
Intestinal pathogenic E. coli
Uropathogenic E. coli
Newborn meningitis-causing E. coli
Avian pathogenic E. coli
Type II secretion system
Type III secretion system
E. coli type III secretion system 1
E. coli type III secretion system 2
Human brain microvascular endothelial cells
Open reading frames
Kaper JB, Nataro JP, Mobley HL. Pathogenic Escherichia coli. Nat Rev Microbiol. 2004;2:123–40.
Russo TA, Johnson JR. Proposal for a new inclusive designation for extraintestinal pathogenic isolates of Escherichia coli: ExPEC. J Infect Dis. 2000;181:1753–4.
Johnson JR, Russo TA. Extraintestinal pathogenic Escherichia coli: "the other bad E coli". J Lab Clin Med. 2002;139:155–62.
Russo TA, Johnson JR. Medical and economic impact of extraintestinal infections due to Escherichia coli: focus on an increasingly important endemic problem. Microbes Infect. 2003;5:449–56.
Johnson JR, Stell AL. Extended virulence genotypes of Escherichia coli strains from patients with urosepsis in relation to phylogeny and host compromise. J Infect Dis. 2000;181:261–72.
Johnson TJ, Nolan LK. Pathogenomics of the virulence plasmids of Escherichia coli. Microbiol Mol Biol Rev. 2009;73:750–74.
Girardeau JP, Lalioui L, Said AM, De Champs C, Le Bouguenec C. Extended virulence genotype of pathogenic Escherichia coli isolates carrying the afa-8 operon: evidence of similarities between isolates from humans and animals with extraintestinal infections. J Clin Microbiol. 2003;41:218–26.
Johnson TJ, Kariyawasam S, Wannemuehler Y, Mangiamele P, Johnson SJ, Doetkott C, et al. The genome sequence of avian pathogenic Escherichia coli strain O1:K1:H7 shares strong similarities with human extraintestinal pathogenic E. coli genomes. J Bacteriol. 2007;189:3228–36.
Tan C, Tang X, Zhang X, Ding Y, Zhao Z, Wu B, et al. Serotypes and virulence genes of extraintestinal pathogenic Escherichia coli isolates from diseased pigs in China. Vet J. 2012;192:483–8.
Tan C, Xu Z, Zheng H, Liu W, Tang X, Shou J, et al. Genome sequence of a porcine extraintestinal pathogenic Escherichia coli strain. J Bacteriol. 2011;193:5038.
Ding Y, Tang X, Lu P, Wu B, Xu Z, Liu W, et al. Clonal analysis and virulent traits of pathogenic extraintestinal Escherichia coli isolates from swine in China. BMC Vet Res. 2012;8:140.
Johnson JR, Murray AC, Gajewski A, Sullivan M, Snippes P, Kuskowski M, et al. Isolation and molecular characterization of nalidixic acid-resistant extraintestinal pathogenic Escherichia coli from retail chicken products. Antimicrob Agents Chemother. 2003;47:2161–8.
Liu C, Chen Z, Tan C, Liu W, Xu Z, Zhou R, et al. Immunogenic characterization of outer membrane porins OmpC and OmpF of porcine extraintestinal pathogenic Escherichia coli. FEMS Microbiol Lett. 2012;337:104–11.
Archer CT, Kim JF, Jeong H, Park JH, Vickers CE, Lee SY, et al. The genome sequence of E. coli W (ATCC 9637): comparative genome analysis and an improved genome-scale reconstruction of E. coli. BMC Genomics. 2011;12:9.
Chaudhuri RR, Henderson IR. The evolution of the Escherichia coli phylogeny. Infect Genet Evol. 2012;12:214–26.
Dobrindt U, Agerer F, Michaelis K, Janka A, Buchrieser C, Samuelson M, et al. Analysis of genome plasticity in pathogenic and commensal Escherichia coli isolates by use of DNA arrays. J Bacteriol. 2003;185:1831–40.
Zhang Y, Lin K. A phylogenomic analysis of Escherichia coli / Shigella group: implications of genomic features associated with pathogenicity and ecological adaptation. BMC Evol Biol. 2012;12:174.
Pupo GM, Lan R, Reeves PR. Multiple independent origins of Shigella clones of Escherichia coli and convergent evolution of many of their characteristics. Proc Natl Acad Sci U S A. 2000;97:10567–72.
Blattner FR, Plunkett 3rd G, Bloch CA, Perna NT, Burland V, Rilev M, et al. The complete genome sequence of Escherichia coli K-12. Science. 2012;277:1453–62.
Hayashi K, Morooka N, Yamamoto Y, Fujita K, Isono K, et al. Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110. Mol Syst Biol. 2006;2006(2):0007.
Welch RA, Burland V, Plunkett 3rd G, Redford P, Roesch P, et al. Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli. Proc Natl Acad Sci U S A. 2002;99:17020–4.
Chen SL, Hung CS, Xu J, Reigstad CS, Magrini V, et al. Identification of genes subject to positive selection in uropathogenic strains of Escherichia coli: a comparative genomics approach. Proc Natl Acad Sci U S A. 2006;103:5977–82.
Touchon M, Hoede C, Tenaillon O, Barbe V, Baeriswyl S, et al. Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths. PLoS Genet. 2009;5:e1000344.
Lu ST, Zhang XB, Zhu YF, Kim KS, Yang J, et al. Complete genome sequence of the neonatal-meningitis-associated Escherichia coli strain CE10. J Bacteriol. 2011;193(24):7005.
Moriel DG, Bertoldi I, Spagnuolo A, Marchi S, Rosini R, et al. Identification of protective and broadly conserved vaccine antigens from the genome of extraintestinal pathogenic Escherichia coli. Proc Natl Acad Sci U S A. 2010;107:9072–7.
Goris J, Konstantinidis KT, Klappenbach JA, Coenye T, Vandamme P, Tiedje JM. DNA-DNA hybridization values and their relationship to whole-genome sequence similarities. Int J Syst Evol Microbiol. 2007;57:81–91.
Diaz E, Ferrandez A, Prieto MA, Garcia JL. Biodegradation of aromatic compounds by Escherichia coli. Microbiol Mol Biol Rev. 2001;65:523–69.
Hacker J, Blum-Oehler G, Muhldorfer I, Tschape H. Pathogenicity islands of virulent bacteria: structure, function and impact on microbial evolution. Mol Microbiol. 1997;23:1089–97.
Schmidt H, Hensel M. Pathogenicity islands in bacterial pathogenesis. Clin Microbiol Rev. 2004;17:14–56.
Johnson JR, Russo TA. Molecular epidemiology of extraintestinal pathogenic (uropathogenic) Escherichia coli. Int J Med Microbiol. 2005;295:383–404.
Dozois CM, Daigle F, Curtiss 3rd R. Identification of pathogen-specific and conserved genes expressed in vivo by an avian pathogenic Escherichia coli strain. Proc Natl Acad Sci U S A. 2003;100:247–52.
Tivendale KA, Allen JL, Browning GF. Plasmid-borne virulence-associated genes have a conserved organization in virulent strains of avian pathogenic Escherichia coli. J Clin Microbiol. 2009;47:2513–9.
Carattoli A, Zankari E, Garcia-Fernandez A, Volby Larsen M, Lund O, et al. PlasmidFinder and pMLST: in silico detection and typing of plasmids. Antimicrob Agents Chemother. 2014;58(7):3895–903.
Smith HW, Huggins MB. Further observations on the association of the colicine V plasmid of Escherichia coli with pathogenicity and with survival in the alimentary tract. J Gen Microbiol. 1976;92:335–50.
Waters VL, Crosa JH. Colicin V virulence plasmids. Microbiol Rev. 1991;55(3):437–50.
Gibbs MD, Spiers AJ, Bergquist PL. RepFIB: a basic replicon of large plasmids. Plasmid. 1993;29:165–79.
Johnson TJ, Siek KE, Johnson SJ, Nolan LK. DNA sequence of a ColV plasmid and prevalence of selected plasmid-encoded virulence genes among avian Escherichia coli strains. J Bacteriol. 2006;188:745–58.
Sabri M, Leveille S, Dozois CM. A SitABCD homologue from an avian pathogenic Escherichia coli strain mediates transport of iron and manganese and resistance to hydrogen peroxide. Microbiology. 2006;152:745–58.
Stumpe S, Schmid R, Stephens DL, Georgiou G, Bakker EP. Identification of OmpT as the protease that hydrolyzes the antimicrobial peptide protamine before it enters growing cells of Escherichia coli. J Bacteriol. 1998;180:4002–6.
Morales C, Lee MD, Hofacre C, Maurer JJ. Detection of a novel virulence gene and a Salmonella virulence homologue among Escherichia coli isolated from broiler chickens. Foodborne Pathog Dis. 2004;1:160–5.
Chuba PJ, Leon MA, Banerjee A, Palchaudhuri S. Cloningand DNA sequence of plasmid determinant iss, coding for increased serum survival and surface exclusion, which has homology with lambda DNA. Mol Gen Genet. 1898;216:287–92.
Hansen LH, Johannesen E, Burmolle M, Sorensen AH, Sorensen SJ. Plasmid-encoded multidrug efflux pump conferring resistance to olaquindox in Escherichia coli. Antimicrob Agents Chemother. 2004;48:3332–7.
Peigne C, Bidet P, Mahjoub-Messai F, Plainvert C, Barbe V, et al. The plasmid of Escherichia coli strain S88 (O45:K1:H7) that causes neonatal meningitis is closely related to avian pathogenic E. coli plasmids and is associated with high-level bacteremia in a neonatal rat meningitis model. Infect Immun. 2009;77:2272–84.
Johnson TJ, Siek KE, Johnson SJ, Nolan LK. DNA sequence and comparative genomics of pAPEC-O2-R, an avian pathogenic Escherichia coli transmissible R plasmid. Antimicrob Agents Chemother. 2005;49:4681–8.
Nuccio SP, Baumler AJ. Evolution of the chaperone/usher assembly pathway: fimbrial classification goes Greek. Microbiol Mol Biol Rev. 2007;71:551–75.
Wurpel DJ, Beatson SA, Totsika M, Petty NK, Schembri MA. Chaperone-Usher Fimbriae of Escherichia coli. PLoS One. 2013;8:e52835.
Ong CL, Beatson SA, McEwan AG, Schembri MA. Conjugative plasmid transfer and adhesion dynamics in an Escherichia coli biofilm. Appl Environ Microbiol. 2009;75:6783–91.
Burmolle M, Bahl MI, Jensen LB, Sorensen SJ, Hansen LH. Type 3 fimbriae, encoded by the conjugative plasmid pOLA52, enhance biofilm formation and transfer frequencies in Enterobacteriaceae strains. Microbiol. 2008;154:187–95.
Li Y, Perepelov AV, Guo D, Shevelev SD, Senchenkova SN, et al. Structural and genetic relationships of two pairs of closely related O-antigens of Escherichia coli and Salmonella enterica: E. coli O11/S. enterica O16 and E. coli O21/S. enterica O38. FEMS Immunol Med Microbiol. 2010;61:258–68.
Kido N, Torgov VI, Sugiyama T, Uchiya K, Sugihara H, et al. Expression of the O9 polysaccharide of Escherichia coli: sequencing of the E. coli O9 rfb gene cluster, characterization of mannosyl transferases, and evidence for an ATP-binding cassette transport system. J Bacteriol. 1995;177(8):2178–87.
Sugiyama T, Kodo N, Komatsu T, Ohta M, Jann K, et al. Genetic analysis of Escherichia coli O9 rfb: identification and DNA sequence of phosphomannomutase and GDP-mannose pyrophosphorylase genes. Microbiol. 1994;140(1):59–71.
Taylor CM, Roberts IS. Capsular polysaccharides and their role in virulence. Contrib Microbiol. 2005;12:55–66.
Ophir T, Gutnick DL. A role for exopolysaccharides in the protection of microorganisms from desiccation. Appl Environ Microbiol. 1994;60:740–5.
Whitfield C, Roberts IS. Structure, assembly and regulation of expression of capsules in Escherichia coli. Mol Microbiol. 1999;31:1307–19.
Badger JL, Kim KS. Enviromental growth conditions influence the ability of Escherichia coli K1 to invade brain microvascular endothelial cells and confer serum resistance. Infect Immun. 1998;66(12):5692–7.
Sathiyamoorthy K, Mills E, Franzmann TM, Rosenshine I, Saper MA. The crystal structure of Escherichia coli group 4 capsule protein GfcC reveals a domain organization resembling that of Wza. Biochem. 2011;50:5465–76.
Gao Q, Wang X, Xu H, Xu Y, Ling J, et al. Roles of iron acquisition systems in virulence of extraintestinal pathogenic Escherichia coli: salmochelin and aerobactin contribute more to virulence than heme in a chicken infection model. BMC Microbiol. 2012;12:143.
Braun V. Iron uptake mechanisms and their regulation in pathogenic bacteria. Int J Med Microbiol. 2001;291:67–79.
Tseng TT, Tyler BM, Setubal JC. Protein secretion systems in bacterial-host associations, and their description in the Gene Ontology. BMC Microbiol. 2009;9 Suppl 1:S2.
Arnold R, Jehl A, Rattei T. Targeting effectors: the molecular recognition of Type III secreted proteins. Microbes Infect. 2010;12:346–58.
Abby SS, Rocha EP. The non-flagellar type III secretion system evolved from the bacterial flagellum and diversified into host-cell adapted systems. PLoS Genet. 2012;8:e1002983.
Liu X, Matsumura P. The FlhD/FlhC complex, a transcriptional activator of the Escherichia coli flagellar class II operons. J Bacteriol. 1994;176(23):7345–51.
Parkhill J, Wren BW, Thomson NR, Titball RW, Holden MT, et al. Genome sequence of Yersinia pestis, the causative agent of plague. Nature. 2001;413:523–7.
Ren CP, Beatson SA, Parkhill J, Pallen MJ. The Flag-2 locus, an ancestral gene cluster, is potentially associated with a novel flagellar system from Escherichia coli. J Bacteriol. 2005;187:1430–40.
Ren CP, Chaudhuri RR, Fivian A, Bailey CM, Antonio M, et al. The ETT2 gene cluster, encoding a second type III secretion system from Escherichia coli, is present in the majority of strains but has undergone widespread mutational attrition. J Bacteriol. 2004;186:3547–60.
Yao Y, Xie Y, Perace D, Zhong Y, Lu J, et al. The type III secretion system is involved in the invasion and intracellular survival of Escherichia coli K1 in human brain microvascular endothelial cells. FEMS Microbiol Lett. 2009;300:18–24.
Coulthurst SJ. The Type VI secretion system - a widespread and versatile cell targeting system. Res Microbiol. 2013;164(6):640–54.
Shrivastava S, Mande SS. Identification and functional characterization of gene components of Type VI Secretion system in bacterial genomes. PLoS One. 2008;3:e2955.
Pukatzki S, McAuley SB, Miyata ST. The type VI secretion system: translocation of effectors and effector-domains. Curr Opin Microbiol. 2009;12:11–7.
Zhou Y, Tao J, Yu H, Ni J, Zeng L, et al. Hcp family proteins secreted via the type VI secretion system coordinately regulate Escherichia coli K1 interaction with human brain microvascular endothelial cells. Infect Immun. 2012;80:1243–51.
Erni B, Zanolari B, Kocher HP. The mannose permease of Escherichia coli consists of three different proteins. Amino acid sequence and function in sugar transport, sugar phosphorylation, and penetration of phage lambda DNA. J Biol Chem. 1987;262(11):5238–47.
Solomon E, Lin EC. Mutations affecting the dissimilation of mannitol by Escherichia coli K-12. J Bacteriol. 1972;111(2):566–74.
Postma PW, Lengeler JW, Jacobson GR. Phosphoenolpyruvate: carbohydrate phosphotransferase systems of bacteria. Microbiol Rev. 1993;57(3):543–94.
Dippel R, Boos W. The maltodextrin system of Escherichia coli: metabolism and transport. J Bacteriol. 2005;187(24):8322–31.
Kolb A, Busby S, Buc H, Garges S, Adhya S. Transcriptional regulation by cAMP and its receptor protein. Annu Rev Biochem. 1993;62:749–95.
Shimada T, Yamamoto K, Ishihama A. Novel members of the Cra regulon involved in carbon metabolism in Escherichia coli. J Bacteriol. 2011;193:649–59.
Jahreis K, Bentler L, Bockmann J, Hans S, Meyer A, et al. Adaptation of sucrose metabolism in the Escherichia coli wild-type strain EC3132. J Bacteriol. 2002;184:5307–16.
Gibbins LN, Simpson FJ. The Incorporation of D-Allose into the Glycolytic Pathway by Aerobacter Aerogenes. Can J Microbiol. 1964;10:829–36.
Kim C, Song S, Park C. The D-allose operon of Escherichia coli K-12. J Bacteriol. 1997;179(24):7631–7.
Pellicer MT, Badia J, Aguilar J, Baldomà L. glc locus of Escherichia coli: characterization of genes encoding the subunits of glycolate oxidase and the glc regulator protein. J Bacteriol. 1996;178(7):2051–9.
Vieira G, Sabarly V, Bourguignon PY, Durot M, Le Fevre F, et al. Core and panmetabolism in Escherichia coli. J Bacteriol. 2011;193:1461–72.
Wood TK, Gonzalez Barrios AF, Herzberg M, Lee J. Motility influences biofilm architecture in Escherichia coli. Appl Microbiol Biotechnol. 2006;72:361–7.
Clermont O, Bonacorsi S, Bingen E. Rapid and simple determination of the Escherichia coli phylogenetic group. Appl Environ Microbiol. 2000;66:4555–8.
Tang B, Wang Q, Yang M, Xie F, Zhu YQ, Zhuo Y, et al. ContigScape: a Cytoscape plugin facilitating microbial genome gap closing. BMC Genomics. 2013;14:289.
Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. Improved microbial gene identification with GLIMMER. Nucleic Acids Res. 1999;27:4636–41.
Lukashin AV, Borodovsky M. GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res. 1998;26:1107–15.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.
Tatusov RL, Koonin EV, Lipman DJ. A genomic perspective on protein families. Science. 1997;278:631–7.
Ogata H, Goto S, Sato K, Fujibuchi W, Bono H, Kanehisa M. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 1999;27:29–34.
Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–64.
Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35:3100–8.
Siguier P, Perochon J, Lestrade L, Mahillon J, Chandler M. ISfinder: the reference centre for bacterial insertion sequences. Nucleic Acids Res. 2006;34:D32–36.
Langille MG, Brinkman FS. IslandViewer: an integrated interface for computational identification and visualization of genomic islands. Bioinformatics. 2009;25:664–5.
Ghai R, Hain T, Chakraborty T. GenomeViz: visualizing microbial genomes. BMC Bioinformatics. 2004;5:198.
Darling AC, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004;14:1394–403.
Hall BG. Building phylogenetic trees from molecular data with MEGA. Mol Biol Evol. 2013;30:1229–35.
Gomez-Gomez JM, Manfredi C, Alonso JC, Blazquez J. A novel role for RecA under non-stress: promotion of swarming motility in Escherichia coli K-12. BMC Biol. 2007;5:14.
Jin H, Zhou R, Kang M, Luo R, Cai X, Chen H. Biofilm formation by field isolates and reference strains of Haemophilus parasuis. Vet Microbiol. 2006;118:117–23.
Konstantinidis KT, Tiedje JM. Genomic insights that advance the species definition for prokaryotes. Proc Natl Acad Sci U S A. 2005;102:2567.
This work was supported by grants from the National Natural Science Foundation of China (NSFC) (No.31472201, No. 31030065,No. 31421064), and the 863 Program (2012AA101601).
The authors declare that they have no competing interests financial or non-financial related to the content of this article.
CL, CT, ZX and HZ concepted and designed the experiements. CL, XW, FL, YZ, YD and XT performed the experiments. CL, MY and BT performed the analysis of data. CL and CT wrote the manuscript. CT, TJJ, HZ and MY modified the manuscript. All authors approve submission of this manuscript to BMC Genomics.
Genomes used for phylogenetic and Escherichia coli protein database construction (DOC 86 kb)
Core genome content used to construct phylogenetic tree (XLSX 225 kb)
Islands in PCN033 but absent from MG1655 or W3110 (XLS 39 kb)
425 special CDSs in PCN033 (XLSX 29 kb)
Overview of the genomic islands in PCN033 and PCN061 (XLSX 14 kb)
Biochemical characteristics of PCN033 and PCN061 strains with API 20E test strips (DOC 40 kb)
About this article
Cite this article
Liu, C., Zheng, H., Yang, M. et al. Genome analysis and in vivo virulence of porcine extraintestinal pathogenic Escherichia coli strain PCN033. BMC Genomics 16, 717 (2015). https://0-doi-org.brum.beds.ac.uk/10.1186/s12864-015-1890-9
- Porcine extraintestinal pathogenic Escherichia coli
- Comparative genomic analysis
- Pathogenicity analyses
- Virulence factors