The relevance vector machine (RVM) (Tipping, 2001) encapsulates a sparse probabilistic model for machine learning tasks. Like support vector machines, use of the kernel trick allows modelling in high dimensional feature spaces to be achieved at low computational cost. However, sparsity is controlled not just by the automatic relevance determination (ARD) prior but also by the choice of basis fu...