Skip to main content

Table 1 Definition of Geneset provenance

From: Comparison of GENCODE and RefSeq gene annotation and the impact of reference geneset on variant effect prediction

Geneset

Provenance

GENCODE Comprehensive

All transcripts at protein-coding genes. Includes transcripts with NMD, retained_intron and processed_transcript biotypes.

GENCODE Basic

Only full-length, protein-coding transcripts at protein-coding genes.

RefSeq NXR

All RefSeq transcripts at protein-coding genes. Includes manually annotated NM, NR and automated XM transcripts.

RefSeq NR

Only manually-annotated transcripts at protein-coding genes. Includes NM and NR transcripts

  1. Transcript functional biotypes and source e.g. manual or automated annotation, for the four genesets used in this study.