نتایج جستجو برای: regret minimization
تعداد نتایج: 37822 فیلتر نتایج به سال:
introduction: regret after sterilization, not only requires an expensive surgical procedure for recanalization, but also can affect the couples’ inter and intra-relationships. whence, undergoing sterilization in sistan and baluchestan has the lowest rate compared to other provinces, the habitants’ regret after sterilization may further discourage other volunteers of the procedures. therefore, t...
We compare level-k utility maximization and level-k regret minimization in evolutionary competition based on averages of one-shot plays over many randomly generated strategic games. Under the assumption that Theory-of-Mind-reasoning of depth k incurs a cost monotonically increasing with k, our results show that mixed states with low levels of reasoning depth k can be evolutionary stable with su...
We study online learning with the general notion of transductive regret, that is regret with modification rules applying to expert sequences (as opposed to single experts) that are representable by weighted finite-state transducers. We show how transductive regret generalizes existing notions of regret, including: (1) external regret; (2) internal regret; (3) swap regret; and (4) conditional sw...
Game theoretic solution concepts, such as Nash equilibrium strategies that are optimal against worst case opponents, provide guidance in finding desirable autonomous agent behaviour. In particular, we wish to approximate solutions to complex, dynamic tasks, such as negotiation or bidding in auctions. Computational game theory investigates effective methods for computing such strategies. Solving...
We consider stochastic smoothing of spectral functions of matrices using perturbations commonly studied in random matrix theory. We show that a spectral function remains spectral when smoothed using a unitarily invariant perturbation distribution. We then derive state-of-the-art smoothing bounds for the maximum eigenvalue function using the Gaussian Orthogonal Ensemble (GOE). Smoothing the maxi...
We examine risk attitudes under regret theory and derive analytical expressions for two components—the resolution and regret premiums—of the risk premium under regret theory. We posit that regret-averse decision makers are risk seeking (resp., risk averse) for low (resp., high) probabilities of gains and that feedback concerning the forgone option reinforces risk attitudes. We test these hypoth...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید