Given the example from a data set x1 = 4, x2 = 1, weights w1 = -0.02, w2 = -0.03, bias b = 0, observed value y = 2, predicted value y’ = -0.11 and learning rate = 0.05. What will be the next weight updating values if you are using a squared difference approach?

🎲 Try a Random Question  |  Total Questions in Quiz: 45  |  🧠 Study this quiz with Flashcards
This question is part of a full practice quiz:
Machine Learning 101 Practice Test: Stochastic Gradient Descent — practice the complete quiz, review flashcards, or try a random question.

Quiz on optimization algorithms, specifically focusing on Stochastic Gradient Descent (SGD), its variants, the standard Gradient Descent Algorithm, and Subgradient Descent. Stochastic Gradient Descent (SGD) is a gradient-based optimization algorithm that finds the optimal parameter configuration for a machine learning algorithm. It's a variant of the Gradient Descent algorithm.  SGD iteratively updates a model's parameters one sample or batch at a time. This makes SGD models computationally efficient and able to handle large datasets.  SGD is one of three types of gradient descent learning... Show more

Given the example from a data set x<sub>1</sub> = 4, x<sub>2</sub> = 1, weights w<sub>1</sub> = -0.02, w<sub>2</sub> = -0.03, bias b = 0, observed value y = 2, predicted value y’ = -0.11 and learning rate = 0.05. What will be the next weight updating values if you are using a squared difference approach?