Reinforcement Learning Recommendation Algorithm Based on Label Value Distribution

نویسندگان

چکیده

Reinforcement learning is an important machine method and has become a hot popular research direction topic at present in recent years. The combination of reinforcement recommendation system, very application scenario application, always received close attention from researchers all sectors society. In this paper, we first propose feature engineering based on label distribution learning, which analyzes historical behavior analyzed constructs, whereby vectors are constructed for users products via learning. Then, algorithm value proposed. We designed the stochastic process process, described user’s state interaction (by including information their explicit implicit state), dynamically generated product recommendations through user feedback. Next, by studying hybrid strategies, combined dynamic static to fully utilize achieve high-quality algorithms. Finally, was validated, various relevant baseline models were compared demonstrate effectiveness study. With study, actually tested remarkable advantages design nonlinear expectations other homogeneous individual models. use systems with considerably increased accuracy, data utilization, robustness, model convergence speed, stability systems. incorporated idea into implementation main practical improved that its performance more accurate than same level computing power level. Moreover, due higher amount enhanced contains, it provides theoretical support basis can be used services, many prospects.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm

: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...

متن کامل

Web Content Recommendation Methods Based on Reinforcement Learning

Information overload is no longer news; the explosive growth of the Internet has made this issue increasingly serious for Web users. Recommender systems aim at directing users through this information space, toward the resources that best meet their needs and interests. In this chapter we introduce our novel machine learning perspective toward the web recommendation problem, based on reinforcem...

متن کامل

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

Reinforcement Learning Estimation of Distribution Algorithm

This paper proposes an algorithm for combinatorial optimizations that uses reinforcement learning and estimation of joint probability distribution of promising solutions to generate a new population of solutions. We call it Reinforcement Learning Estimation of Distribution Algorithm (RELEDA). For the estimation of the joint probability distribution we consider each variable as univariate. Then ...

متن کامل

Web pages ranking algorithm based on reinforcement learning and user feedback

The main challenge of a search engine is ranking web documents to provide the best response to a user`s query. Despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. In this paper, a ranking algorithm based on the reinforcement le...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2023

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math11132895