Predicting Execution Time of Computer Programs Using Sparse Polynomial Regression
نویسندگان
چکیده
Predicting the execution time of computer programs is an important but challenging problem in the community of computer systems. Existing methods require experts to perform detailed analysis of program code in order to construct predictors or select important features. We recently developed a new system to automatically extract a large number of features from program execution on sample inputs, on which prediction models can be constructed without expert knowledge. In this paper we study the construction of predictive models for this problem. We propose the SPORE (Sparse POlynomial REgression) methodology to build accurate prediction models of program performance using feature data collected from program execution on sample inputs. Our two SPORE algorithms are able to build relationships between responses (e.g., the execution time of a computer program) and features, and select a few from hundreds of the retrieved features to construct an explicitly sparse and non-linear model to predict the response variable. The compact and explicitly polynomial form of the estimated model could reveal important insights into the computer program (e.g., features and their non-linear combinations that dominate the execution time), enabling a better understanding of the program’s behavior. Our evaluation on three widely used computer programs shows that SPORE methods can give accurate prediction with relative error less than 7% by using a moderate number of training data samples. In addition, we compare SPORE algorithms to state-of-the-art sparse regression algorithms, and show that SPORE methods, motivated by real applications, outperform the other methods in terms of both interpretability and prediction accuracy.
منابع مشابه
Predicting the Efficiency of Decision-Making Unit by Using Piecewise Polynomial Extrapolation in Different Times
In this article, we will estimate efficiency amountof decision-making unit by offering the continuous piecewise polynomialextrapolation and interpolation by CCR model input-oriented on the assumptionthat it is constant returns to scale in different times. And finally, we willestimate efficiency amount of decision-making unit indifferent times byoffering an example.
متن کاملCompile-time Detection of Uncaught Exceptions in Standard ML Programs
We present a static analysis that detects potential runtime exceptions that are raised and never handled inside Standard ML programs. This analysis enhances the software safety by predicting, prior to the program execution, the abnormal termination caused by unhandled exceptions. Our analysis prototype has been implemented by using a semantics-based analyzer generator and has been successfully ...
متن کاملBounded Parallelism in Computer Algebra
This thesis examines the performance improvements that can be made by exploiting parallel processing in symbolic mathematical computation. The study focuses on the use of high-level parallelism in the case where the number of processors is fixed and independent of the problem size, as in existing multiprocessors. Since seemingly small changes to the inputs can cause dramatic changes in the exec...
متن کاملPredicting the Resilience of Obfuscated Code Against Symbolic Execution Attacks via Machine Learning
Software obfuscation transforms code such that it is more difficult to reverse engineer. However, it is known that given enough resources, an attacker will successfully reverse engineer an obfuscated program. Therefore, an open challenge for software obfuscation is estimating the time an obfuscated program is able to withstand a given reverse engineering attack. This paper proposes a general fr...
متن کاملSparse Semidefinite Programs with Near-Linear Time Complexity
Some of the strongest polynomial-time relaxations to NP-hard combinatorial optimization problems are semidefinite programs (SDPs), but their solution complexity of up to O(nL) time and O(n) memory for L accurate digits limits their use in all but the smallest problems. Given that combinatorial SDP relaxations are often sparse, a technique known as chordal conversion can sometimes reduce complex...
متن کامل