Learning Decision Trees Using the Fourier Spectrum
نویسندگان
چکیده
This work gives a polynomial time algorithm for learning decision trees with respect to the uniform distribution. (This algorithm uses membership queries.) The decision tree model that is considered is an extension of the traditional boolean decision tree model that allows linear operations in each node (i.e., summation of a subset of the input variables over GF(2)). This paper shows how to learn in polynomial time any function that can be approximated (in norm L2) by a polynomially sparse function (i.e., a function with only polynomially many nonzero Fourier coefficients). The authors demonstrate that any function f whose L -norm (i.e., the sum of absolute value of the Fourier coefficients) is polynomial can be approximated by a polynomially sparse function, and prove that boolean decision trees with linear operations are a subset of this class of functions. Moreover, it is shown that the functions with polynomial L -norm can be learned deterministically. The algorithm can also exactly identify a decision tree of depth d in time polynomial in 2a and n. This result implies that trees of logarithmic depth can be identified in polynomial time. Key words, machine learning, decision trees, Fourier transform AMS subject classifications. 42A16, 68Q20, 68T05
منابع مشابه
CS 880 : Advanced Complexity Theory 2 / 11 / 2008 Lecture 8 : Active Learning
Last time we studied computational learning theory and saw how harmonic analysis could be used to design and analyze efficient learning algorithms with respect to the uniform distribution. We developed a generic passive learning algorithm for concepts whose Fourier spectrum is concentrated on a known set, and applied it to decision trees. We also started developing an approach for the case wher...
متن کاملMining Decision Trees from Data Streams in a Mobile Environment
This paper presents a novel Fourier analysis-based technique to aggregate, communicate, and visualize decision trees in a mobile environment. Fourier representation of a decision tree has several useful properties that are particularly useful for mining continuous data streams from small mobile computing devices. This paper presents algorithms to compute the Fourier spectrum of a decision tree ...
متن کاملA Fourier Analysis Based Approach to Learning Decision Trees in a Distributed Environment
Spurred by advances in communication technologies, mobile computing and databases that are distributed have become widespread. Such a computing environment involves data that is stored at geographically dispersed locations, and the so-called “slim” computing devices such as palmtops and wearable computers. The decentralized nature of data storage and this new paradigm in computing give rise to ...
متن کاملAn inequality for the Fourier spectrum of parity decision trees
We give a new bound on the sum of the linear Fourier coefficients of a Boolean function in terms of its parity decision tree complexity. This result generalizes an inequality of O’Donnell and Servedio for regular decision trees [OS08]. We use this bound to obtain the first non-trivial lower bound on the parity decision tree complexity of the recursive majority function.
متن کاملSteel Buildings Damage Classification by damage spectrum and Decision Tree Algorithm
Results of damage prediction in buildings can be used as a useful tool for managing and decreasing seismic risk of earthquakes. In this study, damage spectrum and C4.5 decision tree algorithm were utilized for damage prediction in steel buildings during earthquakes. In order to prepare the damage spectrum, steel buildings were modeled as a single-degree-of-freedom (SDOF) system and time-history...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- SIAM J. Comput.
دوره 22 شماره
صفحات -
تاریخ انتشار 1993