Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization
نویسندگان
چکیده
We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an application in financial portfolio management where we can train a controller to directly optimize a Sharpe Ratio (or other risk-averse non-additive) utility function. We illustrate the approach by demonstrating experimental results using a kernel-based controller architecture that would not normally be considered in traditional reinforcement learning or approximate dynamic programming. We further show that using a non-additive criterion (incremental Sharpe Ratio) yields a noisy K-best-paths extraction problem, that can give substantially improved performance.
منابع مشابه
MULTIPERIOD CREDIBILITIC MEAN SEMI-ABSOLUTE DEVIATION PORTFOLIO SELECTION
In this paper, we discuss a multiperiod portfolio selection problem with fuzzy returns. We present a new credibilitic multiperiod mean semi- absolute deviation portfolio selection with some real factors including transaction costs, borrowing constraints, entropy constraints, threshold constraints and risk control. In the proposed model, we quantify the investment return and risk associated with...
متن کاملLexicographic goal programming approach for portfolio optimization
This paper will investigate the optimum portfolio for an investor, taking into account 5 criteria. The mean variance model of portfolio optimization that was introduced by Markowitz includes two objective functions; these two criteria, risk and return do not encompass all of the information about investment; information like annual dividends, S&P star ranking and return in later years which is ...
متن کاملPrimal and dual robust counterparts of uncertain linear programs: an application to portfolio selection
This paper proposes a family of robust counterpart for uncertain linear programs (LP) which is obtained for a general definition of the uncertainty region. The relationship between uncertainty sets using norm bod-ies and their corresponding robust counterparts defined by dual norms is presented. Those properties lead us to characterize primal and dual robust counterparts. The researchers show t...
متن کاملPortfolio Optimization with Position Constraints: an Approximate Dynamic Programming Approach
We analyze dynamic portfolio choice problems using an approximate dynamic programming (ADP) algorithm. We extend the algorithm to the case of constraints on borrowing and implement a duality-based simulation procedure for estimating bounds on the true value function. We demonstrate that the ADP solution exhibits a high degree of accuracy in the considered examples, indicating that this is a pro...
متن کاملمدیریت پرتفوی چنددورهای همراه با کنترل ورشکستگی تحت رویکرد برنامهریزی پویا
Efficient portfolio management, has been attractive for financial researchers and was wished for investors from past to now. In this research, a multiperiod portfolio optimization problem for asset liability management of an investor who intends to control the probability of bankrupt is investigated. The proposed portfolio is consisting of number of risky assets, risk free asset and a type of d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- JCP
دوره 2 شماره
صفحات -
تاریخ انتشار 2007