Skip to main content

Table 2 Chromosomal Coding Sequence (CDS) Counts From Different Annotation Providers

From: Comparative supragenomic analyses among the pathogens Staphylococcus aureus, Streptococcus pneumoniae, and Haemophilus influenzae Using a modification of the finite supragenome model

Genome

PGAAP

RAST

RefSeq

GenBank

CMR-P

CMR-T

IMG

   

CDS

Accession

CDS

Accession

   

CGSSa00

2,781

2,733

n.a

n.a.

n.a

ABWS00000000

n.a

n.a

n.a

CGSSa01

2,971

2,769

n.a

n.a.

n.a

ABWT00000000

n.a

n.a

n.a

CGSSa03

2,951

2,795

n.a

n.a.

n.a

ABWY00000000

n.a

n.a

n.a

COL

2,864

2,687

2,615

NC_002951.2

2,673

CP000046.1

2,712

n.a

2,649

JH1

2,992

2,828

2,747

NC_009632.1

2,747

CP000736.1

n.a

n.a

2,789

JH9

2,997

2,828

2,697

NC_009487.1

2,697

CP000703.1

n.a

n.a

2,731

MRSA252

2,901

2,823

2,656

NC_002952.2

2,744

BX571856.1

2,744

2,689

2,733

MSSA476

2,829

2,679

2,579

NC_002953.3

2,619

BX571857.1

2,619

2,524

2,614

Mu3

2,945

2,777

2,698

NC_009782.1

2,699

AP009324.1

n.a

n.a

2,698

Mu50

2,949

2,785

2,697

NC_002758.2

2,699

BA000017.4

2,714

2,628

2,697

MW2

2,860

2,695

2,632

NC_003923.1

2,632

BA000033.2

2,632

2,849

2,632

N315

2,837

2,688

2,588

NC_002745.2

2,593

BA000018.3

2,592

2,762

2,588

NCTC8325

2,924

2,747

2,892

NC_007795.1

2,892

CP000253.1

2,892

2,654

2,894

Newman

3,025

2,813

2,614

NC_009641.1

2,614

AP009351.1

n.a

n.a

2,614

RF122

2,795

2,715

2,509

NC_007622.1

2,589

AJ938182.1

2,589

2,595

2,579

USA300

2,957

2,778

2,560

NC_007793.1

2,560

CP000255.1

2,578

n.a

2,646

USA300TCH15

2,955

2,783

2,657

NC_010079.1

2,657

CP000730.1

n.a

n.a

2,710

  1. Abbreviations: PGAAP, NCBI's "Prokaryotic Genome Automated Annotation Pipeline"; RAST, Argonne National Laboratory's "Rapid Annotation using Subsystem Technology" system; CMR, J. Craig Venter Institute's Comprehensive Microbial Resource (v 21.0); CMR-P and CMR-T, primary annotations and JCVI's re-annotations; IMG, DOE-Joint Genome Institute's Integrated Microbial Genomes (v. 2.5); n.a., not available. A RefSeq is derived from an underlying GenBank record, but the annotations in each record may differ.