نتایج جستجو برای: specifying allowable transitions from each state for each player
تعداد نتایج: 12554131 فیلتر نتایج به سال:
drought is transient phenomenon , slow , repetitive and integral part of the climate of each region. drought begins with a substantial reduction in precipitation over the long-term average rainfall and over time, reduced soil moisture and surface and ground water resources will continue to decrease. this phenomenon is the most important in bakhtegan basin because of its importance in strategic ...
abstract abstract of thesis (includes summary, goals, execution methods, and results) story “al-obnosat al-bayza” is one of the short stories from “hanna mineh” collection. this story is a selection of lives of some humans during a voyage together, each of them is a symbol of human nature (love, mind, kindness) and examine social, cultural, and political aspects of governing system, and it...
abstract: about 60% of total premium of insurance industry is pertained?to life policies in the world; while the life insurance total premium in iran is less than 6% of total premium in insurance industry in 2008 (sigma, no 3/2009). among the reasons that discourage the life insurance industry is the problem of adverse selection. adverse selection theory describes a situation where the inf...
in his last years of life americans had almost forgotton steinbeck, and those who had not, occasionally criticized him for supporting and praising the u. s. military intervention in the vietnam war. althought ateinbeck incorporates different themes into his works, each one bears many signs of his concern and sympathy for america and the american common people in particular. steinbecks to a god ...
The key principles behind current peer-to-peer research include fully distributing service functionality among all nodes participating in the system and routing individual requests based on a small amount of locally maintained state. The goals extend much further than just improving raw system performance: such systems must survive massive concurrent failures, denial of service attacks, etc. Th...
In this paper we consider the problem of learning the optimal dynamic policy for uncontrolled restless bandit problems. In an uncontrolled restless bandit problem, there is a finite set of arms, each of which when played yields a non-negative reward. There is a player who sequentially selects one of the arms at each time step. The goal of the player is to maximize its undiscounted reward over a...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید