Computer Science > Machine Learning

arXiv:2209.15502 (cs)

[Submitted on 30 Sep 2022]

Title:Efficient LSTM Training with Eligibility Traces

Authors:Michael Hoyer, Shahram Eivazi, Sebastian Otte

View PDF

Abstract:Training recurrent neural networks is predominantly achieved via backpropagation through time (BPTT). However, this algorithm is not an optimal solution from both a biological and computational perspective. A more efficient and biologically plausible alternative for BPTT is e-prop. We investigate the applicability of e-prop to long short-term memorys (LSTMs), for both supervised and reinforcement learning (RL) tasks. We show that e-prop is a suitable optimization algorithm for LSTMs by comparing it to BPTT on two benchmarks for supervised learning. This proves that e-prop can achieve learning even for problems with long sequences of several hundred timesteps. We introduce extensions that improve the performance of e-prop, which can partially be applied to other network architectures. With the help of these extensions we show that, under certain conditions, e-prop can outperform BPTT for one of the two benchmarks for supervised learning. Finally, we deliver a proof of concept for the integration of e-prop to RL in the domain of deep recurrent Q-learning.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2209.15502 [cs.LG]
	(or arXiv:2209.15502v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2209.15502
Related DOI:	https://doi.org/10.1007/978-3-031-15934-3_28

Submission history

From: Sebastian Otte [view email]
[v1] Fri, 30 Sep 2022 14:47:04 UTC (1,342 KB)

Computer Science > Machine Learning

Title:Efficient LSTM Training with Eligibility Traces

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Efficient LSTM Training with Eligibility Traces

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators