Function Discovery using Data Transformation

نویسندگان

  • Thong H. Phan
  • Ian H. Witten
چکیده

Function discovery is the problem of nding a symbolic formula for a function f : D1!D2 from a set of examples f(x; y) j y = f(x)g D1 D2. Acceptable solutions are restricted to formulas expressible in some given description language L. In most previous discovery systems|for example, Bacon [4], Abacus [1], Coper [2], Fahrenheit [3] and IDS [5]|the description language is restricted to contain only rational functions so that symbolic descriptions can easily be enumerated. This paper shows how the idea of data transformation, a technique pioneered in FFD [6], can be used as the basis of a far more comprehensive description language that includes all functions that can be transformed to rational functions by di erentiation and logarithm operations. The data transformation approach induces functions by searching through a space of transformations that can be expressed both symbolically and as operations on the example set. It expresses the solution implicitly, as a small system of simultaneous equations, some of which may be di erential equations. Solving these symbolically would (where possible) give a formula for the unknown function. In order to operationalize transformations while maintaining the ability of the example set to faithfully represent transformed functions, it is necessary to allow for new examples to be requested interactively. The main contribution of this paper is to de ne a transformation-based description language and characterize its representational power. We also brie y sketch a practical implementation of a function induction system that uses this approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data transformation:Asemantically-based approach to function discovery

This paper presents the method of data transformation for discovering numeric functions from their examples. Based on the idea of transformations between functions, this method can be viewed as a semantic counterpart to the more common approach of formula construction used in most previous discovery systems. Advantages of the new method include a exible implementation through the design of tran...

متن کامل

Knowledge Discovery from Heterogeneous Dynamic Systems using Change-Point Correlations

Most of the stream mining techniques presented so far have primary paid attention to discovering association rules by direct comparison between time-series data sets. However, their utility is very limited for heterogeneous systems, where time series of various types (discrete, continuous, oscillatory, noisy, etc.) act dynamically in a strongly correlated manner. In this paper, we introduce a n...

متن کامل

Inverse Sturm-Liouville problems with transmission and spectral parameter boundary conditions

This paper deals with the boundary value problem involving the differential equation ell y:=-y''+qy=lambda y, subject to the eigenparameter dependent boundary conditions along with the following discontinuity conditions y(d+0)=a y(d-0), y'(d+0)=ay'(d-0)+b y(d-0). In this problem q(x), d, a , b are real, qin L^2(0,pi), din(0,pi) and lambda is a parameter independent of x. By defining a new...

متن کامل

A New Group Data Envelopment Analysis Method for Ranking Design Requirements in Quality Function ‎Deployment

‎Data envelopment analysis (DEA) is an objective method for priority determination of decision making units (DMUs) with the same multiple inputs and outputs. DEA is an efficiency estimation technique, but it can be used for solving many problems of management such as rankig of DMUs. Many researchers have found similarity between DEA and MCDM techniques. One of the earliest techniques in MCDM is...

متن کامل

Detection of Mo geochemical anomaly in depth using a new scenario based on spectrum–area fractal analysis

Detection of deep and hidden mineralization using the surface geochemical data is a challenging subject in the mineral exploration. In this work, a novel scenario based on the spectrum–area fractal analysis (SAFA) and the principal component analysis (PCA) has been applied to distinguish and delineate the blind and deep Mo anomaly in the Dalli Cu–Au porphyry mineralization area. The Dalli miner...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994