Skip to main content

Table 1 The training data

From: Identification of combinatorial host-specific signatures with a potential to affect host adaptation in influenza A H1N1 and H3N2 subtypes

 

Nr. of sequences for each subtype

 

Features after MCFS

H1N1

H3N2

Protein

Avian

Human

Avian

Human

Total features

H1N1

H3N2

HA

214

5205

164

3715

628

115

88

NA

205

3093

173

3412

517

93

79

NS1

150

1258

150

1176

249

98

85

NEP

61

407

54

299

124

31

26

NP

125

839

93

773

506

61

69

M1

45

467

42

355

275

18

15

M2

65

461

64

503

98

25

23

PA

192

1677

143

1358

726

65

47

PA-X

57

164

45

244

252

28

24

PB1

171

1654

132

1347

762

59

33

PB2

184

1817

136

1297

776

52

42

PB1-F2

151

224

112

737

101

64

54

  1. Total Features are the total number of aa positions that are investigated. Features after MCFS are the aa positions that are ranked significant, i.e. having power to discriminate avian from human sequences