Computer Science > Machine Learning

arXiv:2306.06146 (cs)

[Submitted on 9 Jun 2023 (v1), last revised 18 Nov 2023 (this version, v2)]

Title:Hidden Classification Layers: Enhancing linear separability between classes in neural networks layers

Authors:Andrea Apicella, Francesco Isgrò, Roberto Prevete

View PDF

Abstract:In the context of classification problems, Deep Learning (DL) approaches represent state of art. Many DL approaches are based on variations of standard multi-layer feed-forward neural networks. These are also referred to as deep networks. The basic idea is that each hidden neural layer accomplishes a data transformation which is expected to make the data representation "somewhat more linearly separable" than the previous one to obtain a final data representation which is as linearly separable as possible. However, determining the appropriate neural network parameters that can perform these transformations is a critical problem. In this paper, we investigate the impact on deep network classifier performances of a training approach favouring solutions where data representations at the hidden layers have a higher degree of linear separability between the classes with respect to standard methods. To this aim, we propose a neural network architecture which induces an error function involving the outputs of all the network layers. Although similar approaches have already been partially discussed in the past literature, here we propose a new architecture with a novel error function and an extensive experimental analysis. This experimental analysis was made in the context of image classification tasks considering four widely used datasets. The results show that our approach improves the accuracy on the test set in all the considered cases.

Comments:	Paper accepted on Pattern Recognition Letters journal in Open Access with doi this https URL . Please refer to the published version
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.06146 [cs.LG]
	(or arXiv:2306.06146v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.06146
Related DOI:	https://doi.org/10.1016/j.patrec.2023.11.016

Submission history

From: Andrea Apicella [view email]
[v1] Fri, 9 Jun 2023 10:52:49 UTC (183 KB)
[v2] Sat, 18 Nov 2023 10:13:30 UTC (183 KB)

Computer Science > Machine Learning

Title:Hidden Classification Layers: Enhancing linear separability between classes in neural networks layers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hidden Classification Layers: Enhancing linear separability between classes in neural networks layers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators