Skip to main content

Table 1 Summary of the data sets.

From: Improving the specificity of exon prediction using comparative genomics

 

clearly orthologous exons (TP)

potential non-exons (FP)

potential non-genes (FP)

RefSeq exons (TP)

RefSeq genes (TP)

size

76,229 (1.2 × 107 bps)

1,518,082 (8.3 × 108 bps)

--

172,042 (2.9 × 107)

20,193

GENSCAN

--

--

--

117,860

3,497

TWINSCAN

--

--

--

118,650

5,131

GENSCAN (w/mouse)

53,217

54,360

4,856

115,551

3,284

TWINSCAN (w/mouse)

54,879

12,276

1,172

117,100

4,944

GENSCAN (w/dog)

52,712

49,899

--

--

--

TWINSCAN (w/dog)

54,257

11,095

--

--

--

  1. The first row lists the type of sequences in the data set. The second row lists the number of the sequences in each type and the corresponding base pairs. The row of GENSCAN lists the number of exons predicted by GENSCAN with both ends matching RefSeq exons, the number of genes predicted by GENSCAN that exactly match RefSeq genes. The row of GENSCAN (w/mouse) lists the number of exons predicted by GENSCAN, which have full alignments with mouse, with both ends matching clearly orthologous exons, the number of the predicted exons, which have full alignments with mouse, with both ends within or matching potential non-exons, and the number of genes predicted by GENSCAN, which have full alignments with mouse, having all exons being in potential non-exons. The row of GENSCAN (w/dog) lists the number of exons predicted by GENSCAN, which have full alignments with dog, with both ends matching clearly orthologous exons and the number of the predicted exons, which have full alignments with dog, with both ends within or matching potential non-exons. The row of TWINSCAN, TWINSCAN (w/mouse), and TWINSCAN (w/dog) list the number of exons and genes collected the same way as those related to GENSCAN from TWINSCAN's prediction.