Computer Science > Machine Learning

arXiv:2203.01400 (cs)

[Submitted on 2 Mar 2022 (v1), last revised 25 Jan 2023 (this version, v3)]

Title:Adaptive Gradient Methods with Local Guarantees

Authors:Zhou Lu, Wenhan Xia, Sanjeev Arora, Elad Hazan

View PDF

Abstract:Adaptive gradient methods are the method of choice for optimization in machine learning and used to train the largest deep models. In this paper we study the problem of learning a local preconditioner, that can change as the data is changing along the optimization trajectory. We propose an adaptive gradient method that has provable adaptive regret guarantees vs. the best local preconditioner. To derive this guarantee, we prove a new adaptive regret bound in online learning that improves upon previous adaptive online learning methods. We demonstrate the robustness of our method in automatically choosing the optimal learning rate schedule for popular benchmarking tasks in vision and language domains. Without the need to manually tune a learning rate schedule, our method can, in a single run, achieve comparable and stable task accuracy as a fine-tuned optimizer.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2203.01400 [cs.LG]
	(or arXiv:2203.01400v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.01400

Submission history

From: Zhou Lu [view email]
[v1] Wed, 2 Mar 2022 20:45:14 UTC (1,927 KB)
[v2] Sat, 5 Mar 2022 02:22:42 UTC (1,933 KB)
[v3] Wed, 25 Jan 2023 21:06:46 UTC (1,345 KB)

Computer Science > Machine Learning

Title:Adaptive Gradient Methods with Local Guarantees

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Adaptive Gradient Methods with Local Guarantees

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators