Skip to main content

Table 2 The predictive performance of the naïve Bayesian inference program, achieved when implementing a Gaussian likelihood function of a.) the observed structural characteristics alone, b.) when implementing the observed protein family frequencies alone as likelihoods and c.) when combining the observed protein family frequencies with the Gaussian likelihood functions of observed structural characteristics.

From: Bayesian prediction of bacterial growth temperature range based on genome sequences

Class

Test set

a. Structural features

MCC

% Correct predictions

Thermophiles

0.24

80.0

Mesophiles

0.36

50.0

Psychrophiles

0.47

25.0

b. Protein families

MCC

% Correct predictions

Thermophiles

0.60

92.9

Mesophiles

0.13

28.6

Psychrophiles

0.51

50.0

c. Combined

MCC

% Correct predictions

Thermophiles

0.67

92.0

Mesophiles

0.40

57.1

Psychrophiles

0.68

50.0

  1. (For the individual predictions, see Additional file 5, 6 and 7)