Thread: Circuits

In the original narrative of deep learning, each neuron builds progressively more abstract, meaningful features by composing features in the preceding layer. In recent years, there’s been some skepticism of this view, but what happens if you take it really seriously?

InceptionV1 is a classic vision model with around 10,000 unique neurons — a large number, but still on a scale that a group effort could attack. What if you simply go through the model, neuron by neuron, trying to understand each one and the connections between them? The circuits collaboration aims to find out.

Articles & Comments

The natural unit of publication for investigating circuits seems to be short papers on individual circuits or small families of features. Compared to normal machine learning papers, this is a small and unusual topic for a paper.

To facilitate exploration of this direction, Distill is inviting a “thread” of short articles on circuits, interspersed with critical commentary by experts in adjacent fields. The thread will be a living document, with new articles added over time, organized through an open slack channel (#circuits in the Distill slack). Content in this thread should be seen as early stage exploratory research.

Zoom In: An Introduction to Circuits

Authors

Affiliations

Chris Olah, Nick Cammarata, Ludwig Schubert, Gabriel Goh, Michael Petrov, Shan Carter

OpenAI

Does it make sense to treat individual neurons and the connections between them as a serious object of study? This essay proposes three claims which, if true, might justify serious inquiry into them: the existence of meaningful features, the existence of meaningful circuits between features, and the universality of those features and circuits.

It also discuses historical successes of science “zooming in,” whether we should be concerned about this research being qualitative, and approaches to rigorous investigation.

Read Full Article

An Overview of Early Vision in InceptionV1

Authors

Affiliations

Chris Olah, Nick Cammarata, Ludwig Schubert, Gabriel Goh, Michael Petrov, Shan Carter

OpenAI

An overview of all the neurons in the first five layers of InceptionV1, organized into a taxonomy of “neuron groups.” This article sets the stage for future deep dives into particular aspects of early vision.

Read Full Article

Curve Detectors

Authors

Affiliations

Nick Cammarata, Gabriel Goh, Shan Carter, Ludwig Schubert, Michael Petrov, Chris Olah

OpenAI

Every vision model we’ve explored in detail contains neurons which detect curves. Curve detectors is the first in a series of three articles exploring this neuron family in detail.

Read Full Article

Naturally Occurring Equivariance in Neural Networks

Authors

Affiliations

Chris Olah, Nick Cammarata, Chelsea Voss, Ludwig Schubert, Gabriel Goh

OpenAI

Neural networks naturally learn many transformed copies of the same feature, connected by symmetric weights.

Read Full Article

High-Low Frequency Detectors

Authors

Affiliations

Ludwig Schubert, Chelsea Voss, Nick Cammarata, Gabriel Goh, Chris Olah

OpenAI

A family of early-vision neurons reacting to directional transitions from high to low spatial frequency.

Read Full Article

Curve Circuits

Authors

Affiliations

Nick Cammarata, Gabriel Goh, Shan Carter, Chelsea Voss, Ludwig Schubert, Chris Olah

OpenAI

We reverse engineer a non-trivial learned algorithm from the weights of a neural network and use its core ideas to craft an artificial artificial neural network from scratch that reimplements it.

Read Full Article

Visualizing Weights

Authors

Affiliations

Chelsea Voss, Nick Cammarata, Gabriel Goh, Michael Petrov, Ludwig Schubert, Ben Egan, Swee Kiat Lim, Chris Olah

OpenAI, Mount Royal University, Stanford University

We present techniques for visualizing, contextualizing, and understanding neural network weights.

Read Full Article

Branch Specialization

Authors

Affiliations

Chelsea Voss, Gabriel Goh, Nick Cammarata, Michael Petrov, Ludwig Schubert, Chris Olah

OpenAI

When a neural network layer is divided into multiple branches, neurons self-organize into coherent groupings.

Read Full Article

Weight Banding

Authors

Affiliations

Michael Petrov, Chelsea Voss, Ludwig Schubert, Nick Cammarata, Gabriel Goh, Chris Olah

OpenAI

Weights in the final layer of common visual models appear as horizontal bands. We investigate how and why.

Read Full Article

Get Involved

The Circuits thread is open to articles exploring individual features, circuits, and their organization within neural networks. Critical commentary and discussion of existing articles is also welcome. The thread is organized through the open #circuits channel on the Distill slack. Articles can be suggested there, and will be included at the discretion of previous authors in the thread, or in the case of disagreement by an uninvolved editor.

If you would like get involved but don’t know where to start, small projects may be available if you ask in the channel.

About the Thread Format

Part of Distill’s mandate is to experiment with new forms of scientific publishing. We believe that that reconciling faster and more continuous approaches to publication with review and discussion is an important open problem in scientific publishing.

Threads are collections of short articles, experiments, and critical commentary around a narrow or unusual research topic, along with a slack channel for real time discussion and collaboration. They are intended to be earlier stage than a full Distill paper, and allow for more fluid publishing, feedback and discussion. We also hope they’ll allow for wider participation. Think of a cross between a Twitter thread, an academic workshop, and a book of collected essays.

Threads are very much an experiment. We think it’s possible they’re a great format, and also possible they’re terrible. We plan to trial two such threads and then re-evaluate our thought on the format.

Thread: Circuits

Authors

Affiliations

Published

DOI

Articles & Comments

Zoom In: An Introduction to Circuits

Authors

Affiliations

An Overview of Early Vision in InceptionV1

Authors

Affiliations

Curve Detectors

Authors

Affiliations

Naturally Occurring Equivariance in Neural Networks

Authors

Affiliations

High-Low Frequency Detectors

Authors

Affiliations

Curve Circuits

Authors

Affiliations

Visualizing Weights

Authors

Affiliations

Branch Specialization

Authors

Affiliations

Weight Banding

Authors

Affiliations

This is a living document

Get Involved

About the Thread Format

Citation Information

Updates and Corrections

Reuse

Citation