Skip to main content

Table 4 Comparison of the misclassification error between the raw datasets, the adjusted data using the default HarmonizR adjustment and two blocking approaches using a block size of 2 and 4 respectively. The tests were done on the datasets from Petralia et al. and Krug et al., using the KNN classifier provided by the Python package ’sklearn’. No prior sorting was performed

From: HarmonizR: blocking and singular feature data adjustment improve runtime efficiency and data preservation

Approach

Raw data (%)

Default

adjustment (%)

block = 2 (%)

block = 4 (%)

Misclassification

error (Krug et al.)

57.2

27.4

27.0

26.5

Misclassification

error (Petralia et al.)

71.3

19.4

19.6

20.2