Additive Groves of Regression Trees

نویسندگان

  • Daria Sorokina
  • Rich Caruana
  • Mirek Riedewald
چکیده

We present a new regression algorithm called Additive Groves and show empirically that it is superior in performance to a number of other established regression methods. A single Grove is an additive model containing a small number of large trees. Trees added to a Grove are trained on the residual error of other trees already in the model. We begin the training process with a single small tree and gradually increase both the number of trees in the Grove and their size. This procedure ensures that the resulting model captures the additive structure of the response. A single Grove may still overfit to the training set, so we further decrease the variance of the final predictions with bagging. We show that in addition to exhibiting superior performance on a suite of regression test problems, Additive Groves are very resistant to overfitting.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Additive Groves to the Learning to Rank Challenge

This is a description of the team AG submission to the Learning to Rank Challenge. This solution has scored 4th place in the main track. The primary algorithm used is Additive Groves of regression trees.

متن کامل

Additive Groves in LTRC Application of Additive Groves to the Learning to Rank Challenge

This paper describes a submission of team AG to the Yahoo! Learning to Rank Challenge held in 2010. This solution has scored 4th place in the main track. The primary algorithm used is Additive Groves of regression trees. 1. Competition and Data Yahoo! Labs organized the first Learning to Rank Challenge in spring 2010. The challenge ran from March 1 to May 31 and received 4, 736 submissions from...

متن کامل

Modeling Additive Structure and Detecting Interactions with Groves of Trees

Discovery of additive structure is an important step towards understanding a complex multi-dimensional function, because it allows for expressing this function as the sum of lower-dimensional or otherwise simpler components. Modeling additive structure also opens up opportunities for learning better regression models. The term statistical interaction is used to describe the presence of non-addi...

متن کامل

The role of trees as a natural index in post-disaster reconstruction (Case Study: Palm groves of Bam, Following the 2003 Bam earthquake)

Background & objective: Trees, as an influential element, have an important role in post disaster reconstruction in four aspects; they can be used as "temporary settlement materials", "reviving collective memories", "creating calm” and “motivation for reconstruction". In addition, as "living memorials”, they remind the disaster and indicate the necessity of preparedness and resilience of societ...

متن کامل

Evaluation of Palm Groves Technical Efficiency Using Bootstrap Data Envelopment Analysis: A Case Study of Roodkhanehbar Area, Iran

Roodkhnehbar area, having approximately 111 thousands of Keriteh palm trees, is one of the most important areas of date production in the Rudan County[1]and the source of peoples’ income in this area, directly or indirectly. As a result, its production efficiency has a critical importance to the orchardists in this region. This study aims to evaluate technical efficiency of palm groves in this ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007