Recent Advances in Stochastic Gradient Descent in Deep Learning

Yingjie Tian
Yuqi Zhang
Haibin Zhang

Open link

Publication date

January 2023

DOI

10.3390/math11030682

Publisher

MDPI AG

Journal

Mathematics

Abstract

In the age of artificial intelligence, the best approach to handling huge amounts of data is a tremendously motivating and hard problem. Among machine learning models, stochastic gradient descent (SGD) is not only simple but also very effective. This study provides a detailed analysis of contemporary state-of-the-art deep learning applications, such as natural language processing (NLP), visual data processing, and voice and audio processing. Following that, this study introduces several versions of SGD and its variant, which are already in the PyTorch optimizer, including SGD, Adagrad, adadelta, RMSprop, Adam, AdamW, and so on. Finally, we propose theoretical conditions under which these methods are applicable and discover that there is sti...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Recent Advances in Stochastic Gradient Descent in Deep Learning

Abstract

Extracted data

Recent Advances in Stochastic Gradient Descent in Deep Learning

Abstract

Extracted data

Related items

Related items