ladybug2489 ladybug2489

30-05-2023
Business

contestada

you're trying to use reinforcement learning to build a path planning system for an indoor autonomous robot. You want it to enter a specific room the end-user specifies, so you define a reward function to give a huge positive reward when it enters that room. After training, you notice some strange behaviour… what do you notice?

a. nothing, everything works as intended.
b. the robot avoids the room
c. once the robot enters the room, it never leaves.
d. once it gets to the room, the robot enters and exits the room endlessly

which of the following is false about reinforcement learning?

a. find a model which yields the greatest average expected reward
b. reinforcement learning is a award based learning
c. reinforcement learning is a type of supervised learning
d. reinforcement learning is an online learning

Respuesta :

Otras preguntas

In robert campin’s triptych of the annunciation, what everyday object was turned into a religious symbol?

If t(n) equals 3+2n, what is the 5th term?

What does the president swear to preserve protect and defend?

What is the average mass of a single chlorine atom in grams?

Cueva de las manos in argentina is known for cave paintings of what?

Number three . Show work plz

Lisas store collects 5 percent sales tax on every item sold. if she collected $22.00 in sales tax, what was the cost of the items her store sold?

A box contains 3 yellow, 2 red, 4 green and 3 black marbles. Two marbles are taken one after the other at random from the box. What is the probability that bot

Sally has a few choices for lunch; a hamburger, hotdog, or BBQ, with french fries, or chips. How many different combinations are there to choose from?

What is 70/10 equivalent fraction?