Search in:

Journal of Computational and Graphical Statistics Volume 32, 2023 - Issue 4

Submit an article Journal homepage

328

Views

CrossRef citations to date

Altmetric

Machine Learning

Statistical Analysis of Fixed Mini-Batch Gradient Descent Estimator

Haobo Qia School of Statistics, Beijing Normal University, Beijing, ChinaView further author information

Feifei Wangb Center for Applied Statistics, Renmin University of China, Beijing, China;c School of Statistics, Renmin University of China, Beijing, ChinaCorrespondence[email protected]
View further author information

Hansheng Wangd Guanghua School of Management, Peking University, Beijing, ChinaView further author information

Pages 1348-1360 | Received 04 Jan 2022, Accepted 22 Mar 2023, Published online: 06 Jun 2023

Cite this article
https://doi.org/10.1080/10618600.2023.2204130
CrossMark

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Reprints & Permissions

References

Bottou, L., Curtis, F. E., and Nocedal, J. (2018), ‘‘Optimization Methods for Large-Scale Machine Learning,” Siam Review, 60, 223–311. DOI: 10.1137/16M1080173.
Web of Science ®Google Scholar
Cauchy, A.-L. (1847), “Méthode générale pour la résolution des systèmes d’équations simultanées,” Comptes rendus des séances de l’Académie des sciences de Paris, 536–538.
Google Scholar
Chen, X., Lee, J. D., Tong, X. T., and Zhang, Y. (2016), “Statistical Inference for Model Parameters in Stochastic Gradient Descent,” arXiv preprint arXiv:1610.08637.
Google Scholar
Chen, Z., Mou, S., and Maguluri, S. T. (2022), “Stationary Behavior of Constant Stepsize SGD Type Algorithms,” Proceedings of the ACM on Measurement and Analysis of Computing Systems, 6, 1–24. DOI: 10.1145/3508039.
Google Scholar
Dieuleveut, A., Durmus, A., and Bach, F. (2020), “Bridging the Gap between Constant Step Size Stochastic Gradient Descent and Markov Chains,” Annals of Statistics, 48, 1348–1382.
Web of Science ®Google Scholar
Duchi, J., Hazan, E., and Singer, Y. (2011), “Adaptive Subgradient Methods for Online Learning and Stochastic Optimization,” Journal of Machine Learning Research, 12, 257–269.
Web of Science ®Google Scholar
Gao, Y., Li, J., Zhou, Y., Xiao, F., and Liu, H. (2021), ‘‘Optimization Methods For Large-Scale Machine Learning,” in 2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP), pp. 304–308.
Google Scholar
Gitman, I., Lang, H., Zhang, P., and Xiao, L. (2019), “Understanding the Role of Momentum in Stochastic Gradient Methods,” in Proceedings of 33rd Conference and Workshop on Neural Information Processing Systems.
Google Scholar
He, K., Zhang, X., Ren, S., and Sun, J. (2016), “Deep Residual Learning for Image Recognition,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). DOI: 10.1109/CVPR.2016.90.
Google Scholar
Karimi, H., Nutini, J., and Schmidt, M. (2016), “Linear Convergence of Gradient and Proximal-Gradient Methods Under the Polyak-Łojasiewicz Condition,” Cham: Springer.
Google Scholar
Kingma, D., and Ba, J. (2014), “Adam: A Method for Stochastic Optimization,” in Proceedings of the International Conference on Learning Representations 2015.
Google Scholar
Lan, G. (2020), First-Order and Stochastic Optimization Methods for Machine Learning, Cham: Springer.
Google Scholar
Maas, A. L., Qi, P., Xie, Z., Hannun, A. Y., Lengerich, C. T., Jurafsky, D., and Ng, A. Y. (2017), “Building DNN Acoustic Models for Large Vocabulary Speech Recognition,” Computer Speech and Language, 41,195–213. DOI: 10.1016/j.csl.2016.06.007.
Web of Science ®Google Scholar
Mou, W., Li, C. J., Wainwright, M. J., Bartlett, P. L., and Jordan, M. I. (2020), “On Linear Stochastic Approximation: Fine-grained Polyak-Ruppert and Non-Asymptotic Concentration,” ArXiv, abs/2004.04719.
Google Scholar
Nemirovski, A. S., Juditsky, A., Lan, G., and Shapiro, A. (2009), “Robust Stochastic Approximation Approach to Stochastic Programming,” SIAM Journal on Optimization, 19, 1574–1609. DOI: 10.1137/070704277.
Web of Science ®Google Scholar
Nesterov, Y. (2004), Introductory Lectures on Convex Optimization: A Basic Course, New York: Kluwer Academic Publishers.
Google Scholar
Nocedal, J., and Wright, S. J. (2006), Numerical Optimization (2nd ed.), New York: Springer.
Google Scholar
Polyak, B. T. (1963), “Gradient Methods for Minimizing Functionals” (in Russian), Zh. Vychisl. Mat. Mat. Fiz., 643–653.
Google Scholar
Rakhlin, A., Shamir, O., and Sridharan, K. (2012), “Making Gradient Descent Optimal for Strongly Convex Stochastic Optimization,” in International Conference on Machine Learning.
Google Scholar
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., and Bernstein, M. a. (2015), “ImageNet Large Scale Visual Recognition Challenge,” International Journal of Computer Vision, 115, 211–252. DOI: 10.1007/s11263-015-0816-y.
Web of Science ®Google Scholar
Sarao Mannelli, S., and Urbani, P. (2021), “Analytical Study of Momentum-Based Acceleration Methods in Paradigmatic High-Dimensional Non-Convex Problems,” in Advances in Neural Information Processing Systems (Vol. 34), eds. M. Ranzato, A. Beygelzimer, Y. Dauphin, P. Liang, and J. W. Vaughan, pp. 187–199.
Google Scholar
Tieleman, T., and Hinton, G. (2012), “Lecture 6.5-rmsprop: Divide the Gradient by a Running Average of Its Recent Magnitude,” in COURSERA: Neural Networks for Machine Learning, pp. 26–31.
Google Scholar
Toulis, P., and Airoldi, E. M. (2017), “Asymptotic and Finite-Sample Properties of Estimators based on Stochastic Gradients,” Eprint Arxiv, 45, 1694–1727.
Google Scholar
Xie, Y., Wu, X., and Ward, R. (2020), “Linear Convergence of Adaptive Stochastic Gradient Descent.”
Google Scholar
Yu, L., Balasubramanian, K., Volgushev, S., and Erdogdu, M. (2021), “An Analysis of Constant Step Size SGD in the Non-convex Regime: Asymptotic Normality and Bias,” in Proceedings of 35rd Conference and Workshop on Neural Information Processing Systems.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Statistical Analysis of Fixed Mini-Batch Gradient Descent Estimator

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Statistical Analysis of Fixed Mini-Batch Gradient Descent Estimator

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date