Node harvest
نویسنده
چکیده
When choosing a suitable technique for regression and classification with multivariate predictor variables, one is often faced with a tradeoff between interpretability and high predictive accuracy. To give a classical example, classification and regression trees are easy to understand and interpret. Tree ensembles like Random Forests provide usually more accurate predictions. Yet tree ensembles are also more difficult to analyze than single trees and are often criticized, perhaps unfairly, as ‘black box’ predictors. Node harvest is trying to reconcile the two aims of interpretability and predictive accuracy by combining positive aspects of trees and tree ensembles. Results are very sparse and interpretable and predictive accuracy is extremely competitive, especially for low signal-to-noise data. The procedure is simple: an initial set of a few thousand nodes is generated randomly. If a new observation falls into just a single node, its prediction is the mean response of all training observation within this node, identical to a tree-like prediction. A new observation falls typically into several nodes and its prediction is then the weighted average of the mean responses across all these nodes. The only role of node harvest is to ‘pick’ the right nodes from the initial large ensemble of nodes by choosing node weights, which amounts in the proposed algorithm to a quadratic programming problem with linear inequality constraints. The solution is sparse in the sense that only very few nodes are selected with a nonzero weight. This sparsity is not explicitly enforced. Maybe surprisingly, it is not necessary to select a tuning parameter for optimal predictive accuracy. Node harvest can handle mixed data and missing values and is shown to be simple to interpret and competitive in predictive accuracy on a variety of data sets.
منابع مشابه
Throughput Maximization for Wireless Powered Communications Harvesting from Non-dedicated Sources
We consider the wireless powered communications where users harvest energy from non-dedicated sources. The user follows a harvest-then-transmit protocol: in first phase of a slot time the source node harvests energy from a nearby conventional Access Point, then transmit information to its destination node or relay node in the second phase. We obtain the optimal harvesting ratio to maximize the ...
متن کاملInfluence of microsatellite instability and KRAS and BRAF mutations on lymph node harvest in stage I-III colon cancers.
Lymph node (LN) harvest is influenced by several factors, including tumor genetics. Microsatellite instability (MSI) is associated with improved node harvest, but the association to other genetic factors is largely unknown. Research methods included a prospective series of stage I-III colon cancer patients undergoing ex vivo sentinel-node sampling. The presence of MSI, KRAS mutations in codons ...
متن کاملAmplify-and-Forward Full-Duplex Relay with Power Splitting-Based SWIPT
This paper proposes a virtual harvest-transmit model and a harvest-transmit-store model for amplify-andforward full-duplex relay (FDR) networks with power splittingbased simultaneous wireless information and power transfer. The relay node employs a battery group consisting of two rechargeable batteries. By switching periodically between two batteries for charging and discharging in two consecut...
متن کاملLymph Node Harvest in Dukes' A Cancer Pathologist May Need to Consider Fat Dissolving Technique: An Observational Study
BACKGROUND National institute of clinical excellence (NICE) recommends that a median of 12 lymph nodes be examined in patients operated on with curative intent- to- treat colorectal cancer (CRC). Patients with lymph node harvest less than this may be considered under staged and may receive adjuvant chemotherapy. The aim of our study was to ascertain median number of lymph nodes examined in earl...
متن کاملLymph node revealing solutions in colorectal cancer: should they be used routinely?
The Royal College of Pathologists (RCPath) and College of American Pathologists recommend that at least 12 lymph nodes should be harvested for adequate staging of colorectal carcinoma. Just one nodal tumour deposit upstages the malignancy from pN0 to pN1. This is critically important as node-positive patients (pN1) are considered for adjuvant chemotherapy whereas node-negative patients (pN0) ma...
متن کامل