Computer Science > Machine Learning

arXiv:1904.00438 (cs)

[Submitted on 31 Mar 2019 (v1), last revised 21 Nov 2019 (this version, v2)]

Title:Understanding Neural Architecture Search Techniques

View PDF

Abstract:Automatic methods for generating state-of-the-art neural network architectures without human experts have generated significant attention recently. This is because of the potential to remove human experts from the design loop which can reduce costs and decrease time to model deployment. Neural architecture search (NAS) techniques have improved significantly in their computational efficiency since the original NAS was proposed. This reduction in computation is enabled via weight sharing such as in Efficient Neural Architecture Search (ENAS). However, recently a body of work confirms our discovery that ENAS does not do significantly better than random search with weight sharing, contradicting the initial claims of the authors. We provide an explanation for this phenomenon by investigating the interpretability of the ENAS controller's hidden state. We find models sampled from identical controller hidden states have no correlation with various graph similarity metrics, so no notion of structural similarity is learned. This failure mode implies the RNN controller does not condition on past architecture choices. Lastly, we propose a solution to this failure mode by forcing the controller's hidden state to encode pasts decisions by training it with a memory buffer of previously sampled architectures. Doing this improves hidden state interpretability by increasing the correlation between controller hidden states and graph similarity metrics.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1904.00438 [cs.LG]
	(or arXiv:1904.00438v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1904.00438

Submission history

From: George Adam [view email]
[v1] Sun, 31 Mar 2019 15:48:49 UTC (491 KB)
[v2] Thu, 21 Nov 2019 17:49:27 UTC (501 KB)

Computer Science > Machine Learning

Title:Understanding Neural Architecture Search Techniques

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Understanding Neural Architecture Search Techniques

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators