bree7246 bree7246
  • 19-08-2021
  • Computers and Technology
contestada

In the Gradient Descent algorithm, we are more likely to reach the global minimum, if the learning rate is selected to be a large value.

a. True
b. False

Respuesta :

oddkevin9
oddkevin9 oddkevin9
  • 19-08-2021
False iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
Answer Link
swan2414 swan2414
  • 23-08-2021

Answer:

false i think.

Explanation:

Gradient Descent is more likely to reach a local minima. because starting at different points and just in general having a different starting point, will lead us to a different local minimum( aka the lowest point closest to the starting point). if alpha(the learning rate) is too large, gradient descent may fail to converge and may even diverge.

Answer Link

Otras preguntas

write a statement that assigns finalresult with the sum of num1 and num2, divided by 3. ex: if num1 is 4 and num2 is 5, finalresult is 3. java
on a scale in which the distance from the sun to the earth is about 15 meters, the distance from the earth to the moon is
breast cancer screening and health behaviors among african american and caribbean women in new york city.
What is Sea Floor Spreading in Simple Words?
b) Methotrexate is a medication that prevents cells from making nucleotides. Explain how this could help fight a cancerous tumor.
pls solve this numerical.​
Write 6.42 as a mixed number in simplest form. 6.42=
what would happen to american democracy if citi- zens stopped participating in political and social life?
can someone help me with this
Problem 10 (a) Using Fermat's little theorem, show that 999 999 is divisible by 7. (b) A number has digits that are all 9 and is divisible by 13. Show that it i