In general, approximating classes of functions deened over high-dimensional input spaces by linear combinations of a xed set of basis functions or \features" is known to be hard. Typically, the worst-case error of the best basis set decays only as fast as ? n ?1=d , where n is the number of basis functions and d is the input dimension. However , there are many examples of high-dimensional patte...