نتایج جستجو برای: reward packages
تعداد نتایج: 46029 فیلتر نتایج به سال:
Copyright held by the owner/author(s). CHI’18 Extended Abstracts, April 21–26, 2018, Montreal, QC, Canada ACM 978-1-4503-5621-3/18/04. https://doi.org/10.1145/3170427.3188563 Abstract We present Codestrate Packages, a package-based system to create extensible software within Codestrates. Codestrate Packages turns content creation from an applicationcentric model into a document-centric model. C...
Horizontal intracortical projections for agonist and antagonist muscles exist in the primary motor cortex (M1), and reward may induce a reinforcement of transmission efficiency of intracortical circuits. We investigated reward-induced change in M1 excitability for agonist and antagonist muscles. Participants were 8 healthy volunteers. Probabilistic reward tasks comprised 3 conditions of 30 tria...
Previous reports have described that neural activities in midbrain dopamine areas are sensitive to unexpected reward delivery and omission. These activities are correlated with reward prediction error in reinforcement learning models, the difference between predicted reward values and the obtained reward outcome. These findings suggest that the reward prediction error signal in the brain update...
The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision Processes (SMDP), do not capture the fact that, in an asynchronous environment, the state of the environment may change during computation per...
Zhang and Zafar proposed a video compression scheme based on the wavelet representation and multiresolution motion compensation (MRMC). In this letter, an additional masking module will be created to further enhance its efficiency. Specifically, between the modules of wavelet decomposition and MRMC, the masking module will be inserted which will construct binary images based on the difference o...
Reinforcement learning agents interacting with a complex environment like the real world are unlikely to behave optimally all the time. If such an agent is operating in real-time under human supervision, now and then it may be necessary for a human operator to press the big red button to prevent the agent from continuing a harmful sequence of actions—harmful either for the agent or for the envi...
The tourism industry has reported dramatic changes in its structure over the last few years, not least due to the emerge of the Internet. Among others, tourists want to find products which are tailored to their personal needs in a minimum of time, without having to navigate through all the products offered by the tourism information system. Thus, from the tourism information supplier ́s point of...
In this demo, we present PACKAGEBUILDER, a system that extends database systems to support package queries. A package is a collection of tuples that individually satisfy base constraints and collectively satisfy global constraints. The need for package support arises in a variety of scenarios: For example, in the creation of meal plans, users are not only interested in the nutritional content o...
lectronic systems with the increased functionality and speed required in today’s advanced applications are placing a performance burden on the interconnection and packaging technologies that standard electrical “wiring” approaches simply cannot support. Engineering staff at APL recognized this trend several years ago, and in a collaborative effort with The Johns Hopkins University Department of...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید