User profiles for Adam Santoro

Adam Santoro

Google DeepMind
Verified email at google.com
Cited by 16199

A simple neural network module for relational reasoning

A Santoro, D Raposo, DG Barrett… - Advances in neural …, 2017 - proceedings.neurips.cc
Relational reasoning is a central component of generally intelligent behavior, but has
proven difficult for neural networks to learn. In this paper we describe how to use Relation …

Meta-learning with memory-augmented neural networks

A Santoro, S Bartunov, M Botvinick… - International …, 2016 - proceedings.mlr.press
Despite recent breakthroughs in the applications of deep neural networks, one setting that
presents a persistent challenge is that of" one-shot learning." Traditional gradient-based …

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

…, A Abid, A Fisch, AR Brown, A Santoro… - arXiv preprint arXiv …, 2022 - arxiv.org
Language models demonstrate both quantitative improvement and new qualitative
capabilities with increasing scale. Despite their potentially transformative impact, these new …

Relational inductive biases, deep learning, and graph networks

…, A Tacchetti, D Raposo, A Santoro… - arXiv preprint arXiv …, 2018 - arxiv.org
Artificial intelligence (AI) has undergone a renaissance recently, making major progress in
key domains such as vision, language, control, and decision-making. This has been due, in …

Backpropagation and the brain

TP Lillicrap, A Santoro, L Marris, CJ Akerman… - Nature Reviews …, 2020 - nature.com
During learning, the brain modifies synapses to improve behaviour. In the cortex, synapses
are embedded within multilayered networks, making it difficult to determine the effect of an …

Data distributional properties drive emergent in-context learning in transformers

S Chan, A Santoro, A Lampinen… - Advances in …, 2022 - proceedings.neurips.cc
Large transformer-based models are able to perform in-context few-shot learning, without
being explicitly trained for it. This observation raises the question: what aspects of the training …

Relational recurrent neural networks

A Santoro, R Faulkner, D Raposo… - Advances in neural …, 2018 - proceedings.neurips.cc
Memory-based neural networks model temporal data by leveraging an ability to remember
information for long periods. It is unclear, however, whether they also have an ability to …

Assessing the scalability of biologically-motivated deep learning algorithms and architectures

S Bartunov, A Santoro, B Richards… - Advances in neural …, 2018 - proceedings.neurips.cc
The backpropagation of error algorithm (BP) is impossible to implement in a real brain. The
recent success of deep networks in machine learning and AI, however, has inspired …

Relational deep reinforcement learning

V Zambaldi, D Raposo, A Santoro, V Bapst, Y Li… - arXiv preprint arXiv …, 2018 - arxiv.org
We introduce an approach for deep reinforcement learning (RL) that improves upon the
efficiency, generalization capacity, and interpretability of conventional approaches through …

Hyperbolic attention networks

…, P Battaglia, V Bapst, D Raposo, A Santoro… - arXiv preprint arXiv …, 2018 - arxiv.org
We introduce hyperbolic attention networks to endow neural networks with enough capacity
to match the complexity of data with hierarchical and power-law structure. A few recent …