In most low-level computer vision problems, very little information (if any) is known about the true underlying probability density function, such as its shape, number of mixture components, etc.. Due to this lack of knowledge, parametric approaches are less relevant, rather one has to rely on non-parametric methods. In this note we consider the construction and convergence proof of the non-par...