Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Is RL just trial-and-error learning, or does it include planning?

0
Posted

Is RL just trial-and-error learning, or does it include planning?

0

Modern reinforcement learning concerns both trial-and-error learning without a model of the environment, and deliberative planning with a model. By “a model” here we mean a model of the dynamics of the environment. In the simplest case, this means just an estimate of the state-transition probabilities and expected immediate rewards of the environment. In general it means any predictions about the environment’s future behavior conditional on the agent’s behavior.

Related Questions

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.

Experts123