Skip to main content

Table 12 Results with different data reduction

From: PCVR: a pre-trained contextualized visual representation for DNA sequence classification

Pre-training dataset proportion

Fine-tuning dataset proportion

Dist. related

Final

  

Supk.

Phyl.

Supk.

Phyl.

Genus

100%

100%

94.59

75.08

98.97

96.29

74.65

50%

100%

88.59

68.35

98.92

96.07

73.75

50%

50%

89.31

68.90

98.52

95.21

71.37

50%

25%

89.10

66.27

98.10

94.23

68.35

25%

100%

6.67

0.11

8.08

0.05

0.35

25%

50%

6.67

0.11

8.08

0.05

0.35

25%

25%

6.67

0.11

8.08

0.05

0.35