It does this by evaluating the prediction errors of the two types around a certain period of time. The check checks the null speculation which the two versions have the identical performance on normal, towards the alternative that they don't. When the take a look at statistic exceeds a critical worth, we reject the null hypothesis, indicating that