CS 201 Reimagining Gradient Descent: Large Stepsize, Oscillation
![](https://www.cs.ucla.edu/wp-content/uploads/cs/CS201-JINGFENG-WU-PIC.png)
![](https://www.cs.ucla.edu/wp-content/uploads/cs/Copy-of-EFE-41-1080x675.png)
Uncategorized
![](https://www.mdpi.com/jmse/jmse-10-01376/article_deploy/html/images/jmse-10-01376-g003.png)
JMSE, Free Full-Text
![](https://d3i71xaburhd42.cloudfront.net/5db97973c1d3f0a8bd275d6640d7de927f37182a/11-Figure3-1.png)
Stochastic Gradient Descent with Large Learning Rate
![](https://ars.els-cdn.com/content/image/3-s2.0-B978012372536350003X-f03-12-9780123725363.jpg)
Gradient Descent Method - an overview
![](https://www.cs.ucla.edu/wp-content/uploads/cs/DSC_1179_Gulzar_GoogleFellow2017_600px.jpg)
Archives
![](https://i.stack.imgur.com/7Dwpb.png)
optimization - ADAM Gradient descent oscillates close to minimum - Cross Validated
![](https://www.cs.ucla.edu/wp-content/uploads/cs/CS201-JINGFENG-WU-PIC.png)
CS 201, Reimagining Gradient Descent: Large Stepsize, Oscillation, and Acceleration, JINGFENG WU, UC Berkeley
![](https://wiki.cloudfactory.com/media/pages/docs/mp-wiki/solvers-optimizers/sgd/7a794f56d5-1684142770/gradient-descent-optimized-1.webp)
SGD CloudFactory Computer Vision Wiki
![](https://www.cs.ucla.edu/wp-content/uploads/cs/nae.png)
announcements
![](https://www.cs.ucla.edu/wp-content/uploads/cs/icpc.jpg)
announcements
![](https://pub.mdpi-res.com/jmse/jmse-10-01376/article_deploy/html/images/jmse-10-01376-g012.png?1664194985)
JMSE, Free Full-Text
![](https://www.cs.cornell.edu/courses/cs4780/2018fa/lectures/images/gradient_descent/gradientvsnewton_after6gradientsteps.png)
Lecture 7: Gradient Descent (and Beyond)
![](https://www.cs.ucla.edu/wp-content/themes/Divi-child/images/default-image.jpg)
CS 201- Jon Postel Distinguished Lecture: Finding Very Damaging Needles in Very Large Haystacks, VERN PAXSON, UC Berkeley
![](https://www.mldawn.com/wp-content/uploads/2020/08/gradients_per_loss_surface.png)
Stochastic Approximation to Gradient Descent
![](https://miro.medium.com/v2/resize:fit:1090/1*Il8J3K0K9ZpIBzi3sLMVNg.png)
Gradient Descent Problems and Solutions in Neural Networks