Submodular Maximization using Test Scores
نویسندگان
چکیده
We consider the closely related problems of maximizing a submodular function subject to a cardinality constraint and that of maximizing the sum of submodular functions subject to a partition matroid contraint. Motivated by applications in team selection, we focus on settings where the function(s) can be expressed as the expected value of a symmetric submodular value function of a set of independent random variables with given prior distributions, and where, the algorithm designer may not be able to access the submodular function by means of value oracle queries. We consider a novel approach towards submodular maximization known as test score algorithms whose functioning is restricted to computing a test score for each element of the ground set and using these test scores to estimate the outputs of oracle calls. Our main contributions are test score algorithms that yield constant factor and logarithmic approximations respectively for the two problems along with new insights pertaining to function sketching. We also identify necessary and sufficient conditions under which test score algorithms provide constant-factor approximations for the submodular maximization problem. We evaluate the quality of our algorithms for team selection using data from a popular online labour platform for software development.
منابع مشابه
Multi-document Summarization via Budgeted Maximization of Submodular Functions
We treat the text summarization problem as maximizing a submodular function under a budget constraint. We show, both theoretically and empirically, a modified greedy algorithm can efficiently solve the budgeted submodular maximization problem near-optimally, and we derive new approximation bounds in doing so. Experiments on DUC’04 task show that our approach is superior to the bestperforming me...
متن کاملPhrase Table Pruning via Submodular Function Maximization
Phrase table pruning is the act of removing phrase pairs from a phrase table to make it smaller, ideally removing the least useful phrases first. We propose a phrase table pruning method that formulates the task as a submodular function maximization problem, and solves it by using a greedy heuristic algorithm. The proposed method can scale with input size and long phrases, and experiments show ...
متن کاملDifferentially Private Submodular Maximization: Data Summarization in Disguise (Full version)
How can we extract representative features from a dataset containing sensitive personal information, while providing individual-level privacy guarantees? Many data summarization applications are captured by the general framework of submodular maximization. As a consequence, a wide range of efficient approximation algorithms for submodular maximization have been developed. However, when such app...
متن کاملDistributed Submodular Maximization
Many large-scale machine learning problems – clustering, non-parametric learning, kernel machines, etc. – require selecting a small yet representative subset from a large dataset. Such problems can often be reduced to maximizing a submodular set function subject to various constraints. Classical approaches to submodular optimization require centralized access to the full dataset, which is impra...
متن کاملMirror-Descent-like Algorithms for Submodular Optimization
In this paper we develop a framework of submodular optimization algorithms in line with the mirror-descent style of algorithms for convex optimization. We use the fact that a submodular function has both a subdifferential and a superdifferential, which enables us to formulate algorithms for both submodular minimization and maximization. This reveals a unifying framework for a number of submodul...
متن کامل