We study active feature selection, a novel selection setting in which unlabeled data is available, but the budget for labels limited, and examples to label can be actively selected by algorithm. focus on using classical mutual information criterion, selects k features with largest label. In setting, goal use significantly fewer than set size still find whose based entire large. explain experime...