Computer Science > Machine Learning

arXiv:2306.16993 (cs)

[Submitted on 29 Jun 2023]

Title:Weight Compander: A Simple Weight Reparameterization for Regularization

Authors:Rinor Cakaj, Jens Mehnert, Bin Yang

View PDF

Abstract:Regularization is a set of techniques that are used to improve the generalization ability of deep neural networks. In this paper, we introduce weight compander (WC), a novel effective method to improve generalization by reparameterizing each weight in deep neural networks using a nonlinear function. It is a general, intuitive, cheap and easy to implement method, which can be combined with various other regularization techniques. Large weights in deep neural networks are a sign of a more complex network that is overfitted to the training data. Moreover, regularized networks tend to have a greater range of weights around zero with fewer weights centered at zero. We introduce a weight reparameterization function which is applied to each weight and implicitly reduces overfitting by restricting the magnitude of the weights while forcing them away from zero at the same time. This leads to a more democratic decision-making in the network. Firstly, individual weights cannot have too much influence in the prediction process due to the restriction of their magnitude. Secondly, more weights are used in the prediction process, since they are forced away from zero during the training. This promotes the extraction of more features from the input data and increases the level of weight redundancy, which makes the network less sensitive to statistical differences between training and test data. We extend our method to learn the hyperparameters of the introduced weight reparameterization function. This avoids hyperparameter search and gives the network the opportunity to align the weight reparameterization with the training progress. We show experimentally that using weight compander in addition to standard regularization methods improves the performance of neural networks.

Comments:	Accepted by The International Joint Conference on Neural Network (IJCNN) 2023
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.16993 [cs.LG]
	(or arXiv:2306.16993v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.16993
Journal reference:	IJCNN 2023

Submission history

From: Rinor Cakaj [view email]
[v1] Thu, 29 Jun 2023 14:52:04 UTC (301 KB)

Computer Science > Machine Learning

Title:Weight Compander: A Simple Weight Reparameterization for Regularization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Weight Compander: A Simple Weight Reparameterization for Regularization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators