Inverse theory teaches us that the residual, or misfit function, should be weighted by the inverse covariance matrix of the noise. Because the covariance operator is often difficult to estimate, we can approximate it with a diagonal weight that can be more easily computed. This paper investigates the possible choices of weighting functions for the data residual when prediction-error filters are...