Analysis of gradient descent methods with non-diminishing, bounded errors

Ramaswamy, Arunselvan; Bhatnagar, Shalabh

Computer Science > Systems and Control

arXiv:1604.00151 (cs)

[Submitted on 1 Apr 2016 (v1), last revised 18 Sep 2017 (this version, v3)]

Title:Analysis of gradient descent methods with non-diminishing, bounded errors

Authors:Arunselvan Ramaswamy, Shalabh Bhatnagar

View PDF

Abstract:The main aim of this paper is to provide an analysis of gradient descent (GD) algorithms with gradient errors that do not necessarily vanish, asymptotically. In particular, sufficient conditions are presented for both stability (almost sure boundedness of the iterates) and convergence of GD with bounded, (possibly) non-diminishing gradient errors. In addition to ensuring stability, such an algorithm is shown to converge to a small neighborhood of the minimum set, which depends on the gradient errors. It is worth noting that the main result of this paper can be used to show that GD with asymptotically vanishing errors indeed converges to the minimum set. The results presented herein are not only more general when compared to previous results, but our analysis of GD with errors is new to the literature to the best of our knowledge. Our work extends the contributions of Mangasarian & Solodov, Bertsekas & Tsitsiklis and Tadic & Doucet. Using our framework, a simple yet effective implementation of GD using simultaneous perturbation stochastic approximations (SP SA), with constant sensitivity parameters, is presented. Another important improvement over many previous results is that there are no `additional' restrictions imposed on the step-sizes. In machine learning applications where step-sizes are related to learning rates, our assumptions, unlike those of other papers, do not affect these learning rates. Finally, we present experimental results to validate our theory.

Comments:	arXiv admin note: text overlap with arXiv:1502.01953, IEEE Transactions on Automatic Control, 2017
Subjects:	Systems and Control (eess.SY); Machine Learning (stat.ML)
MSC classes:	93E15, 93E35
Cite as:	arXiv:1604.00151 [cs.SY]
	(or arXiv:1604.00151v3 [cs.SY] for this version)
	https://doi.org/10.48550/arXiv.1604.00151

Submission history

From: Arunselvan Ramaswamy [view email]
[v1] Fri, 1 Apr 2016 07:03:46 UTC (17 KB)
[v2] Tue, 27 Sep 2016 14:36:07 UTC (27 KB)
[v3] Mon, 18 Sep 2017 08:56:56 UTC (30 KB)

Computer Science > Systems and Control

Title:Analysis of gradient descent methods with non-diminishing, bounded errors

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Systems and Control

Title:Analysis of gradient descent methods with non-diminishing, bounded errors

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators