Computer Science > Machine Learning

arXiv:2306.10724 (cs)

[Submitted on 19 Jun 2023]

Title:Partial Hypernetworks for Continual Learning

Authors:Hamed Hemati, Vincenzo Lomonaco, Davide Bacciu, Damian Borth

View PDF

Abstract:Hypernetworks mitigate forgetting in continual learning (CL) by generating task-dependent weights and penalizing weight changes at a meta-model level. Unfortunately, generating all weights is not only computationally expensive for larger architectures, but also, it is not well understood whether generating all model weights is necessary. Inspired by latent replay methods in CL, we propose partial weight generation for the final layers of a model using hypernetworks while freezing the initial layers. With this objective, we first answer the question of how many layers can be frozen without compromising the final performance. Through several experiments, we empirically show that the number of layers that can be frozen is proportional to the distributional similarity in the CL stream. Then, to demonstrate the effectiveness of hypernetworks, we show that noisy streams can significantly impact the performance of latent replay methods, leading to increased forgetting when features from noisy experiences are replayed with old samples. In contrast, partial hypernetworks are more robust to noise by maintaining accuracy on previous experiences. Finally, we conduct experiments on the split CIFAR-100 and TinyImagenet benchmarks and compare different versions of partial hypernetworks to latent replay methods. We conclude that partial weight generation using hypernetworks is a promising solution to the problem of forgetting in neural networks. It can provide an effective balance between computation and final test accuracy in CL streams.

Comments:	Accepted to the 2nd Conference on Lifelong Learning Agents (CoLLAs), 2023
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2306.10724 [cs.LG]
	(or arXiv:2306.10724v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.10724

Submission history

From: Hamed Hemati [view email]
[v1] Mon, 19 Jun 2023 06:49:10 UTC (1,116 KB)

Computer Science > Machine Learning

Title:Partial Hypernetworks for Continual Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Partial Hypernetworks for Continual Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators