We consider the comparison of multiple (possibly all misspecified) models in terms of their out of sample predictive ability. Typically, candidate models compared contain parameters estimated using recursive (or related rolling) estimation schemes. In some cases, predictive evaluation tests have a limiting distribution which is a functional over a Gaussian process, with a covariance kernel that...