Skip to main content

Table 2 Pan-genome statistics derived from analyses of several bacterial taxa displaying ‘open’ pan-genomes

From: Analysis of the Pantoea ananatis pan-genome reveals factors underlying its ability to colonize and interact with plant, insect and vertebrate hosts

Species

# strains

Mean # genome CDSs

Mean # core CDSs

% core CDSs/genome

Pan-genome (# CDS) compared strains

Estimated core genome (# CDS)

Unique CDSs per additional strain

Ref

Streptococcus pneumoniae *

44

-

-

-

3,221

1,666a

1a

[24]

Streptococcus agalactiae

8

2,029

2,245

90.4%

>2,700

1,806

33

[12]

Haemophilus influenzae *

13

1,793

1,461

81.5%

2,786

1,461

40

[25]

Erwinia amylovora

12

3,819

3,414

89.4%

5,751

-

52

[21]

Pantoea ananatis

8

4,342

4,033

92.9%

5,566

3,876

106

-

Escherichia coli

17

5,020

2,344

46.7%

>13,000

2,200

300

[22]

Bacillus thuringiensis

7

6,847

-

> 81%

>12,000

4,196

558

[26]

  1. The number of strains included in the comparison are indicated, as are the mean number of CDSs per compared genome, those CDSs which are core to all compared strains, the relative proportion of core CDSs per genome (%) and the pan-genome sizes (#CDSs) for the compared strains, where the data is available. The estimated core genomes and unique CDSs added to the pan-genome when additional sequenced strains would be added to the comparison are shown. *denotes those pan-genome analyses where the orthologous CDSs in a compared genome were first grouped ito orthologous clusters. aindicates the number of unique genes that are predicted to be added to the S. pneumoniae pan-genome if 100 genomes would be compared.