More Web Proxy on the site http://driver.im/

default search action

combined dblp search
author search
venue search
publication search

ask others

Paul F. Christiano

Paul Francis Christiano

> Home > Persons

Person information

affiliation: OpenAI, USA
affiliation (PhD 2017): University of California, Berkeley, CA, USA

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-05566
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-05566
Evan Hubinger, Carson Denison, Jesse Mu, Mike Lambert, Meg Tong, Monte MacDiarmid, Tamera Lanham, Daniel M. Ziegler, Tim Maxwell, Newton Cheng, Adam S. Jermyn, Amanda Askell, Ansh Radhakrishnan, Cem Anil, David Duvenaud, Deep Ganguli, Fazl Barez, Jack Clark, Kamal Ndousse, Kshitij Sachan, Michael Sellitto, Mrinank Sharma, Nova DasSarma, Roger Grosse, Shauna Kravec, Yuntao Bai, Zachary Witten, Marina Favaro, Jan Brauner, Holden Karnofsky, Paul F. Christiano, Samuel R. Bowman, Logan Graham, Jared Kaplan, Sören Mindermann, Ryan Greenblatt, Buck Shlegeris, Nicholas Schiefer, Ethan Perez:
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training. CoRR abs/2401.05566 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-03077
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-03077
Paul F. Christiano, Jacob Hilton, Victor Lecomte, Mark Xu:
Backdoor defense, learnability and obfuscation. CoRR abs/2409.03077 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-01290
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-01290
Paul F. Christiano, Jacob Hilton, Andrea Lincoln, Eric Neyman, Mark Xu:
Towards a Law of Iterated Expectations for Heuristic Estimators. CoRR abs/2410.01290 (2024)
2023
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15324
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15324
Toby Shevlane, Sebastian Farquhar, Ben Garfinkel, Mary Phuong, Jess Whittlestone, Jade Leung, Daniel Kokotajlo, Nahema Marchal, Markus Anderljung, Noam Kolt, Lewis Ho, Divya Siddarth, Shahar Avin, Will Hawkins, Been Kim, Iason Gabriel, Vijay Bolina, Jack Clark, Yoshua Bengio, Paul F. Christiano, Allan Dafoe:
Model evaluation for extreme risks. CoRR abs/2305.15324 (2023)
2022
[c11]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Ouyang0JAWMZASR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Ouyang0JAWMZASR22
Long Ouyang, Jeffrey Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul F. Christiano, Jan Leike, Ryan Lowe:
Training language models to follow instructions with human feedback. NeurIPS 2022
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-02155
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-02155
Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul F. Christiano, Jan Leike, Ryan Lowe:
Training language models to follow instructions with human feedback. CoRR abs/2203.02155 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-06738
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-06738
Paul F. Christiano, Eric Neyman, Mark Xu:
Formalizing the presumption of independence. CoRR abs/2211.06738 (2022)
2021
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/jacm/BrakerskiCMVV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jacm/BrakerskiCMVV21
Zvika Brakerski, Paul F. Christiano, Urmila Mahadev, Umesh V. Vazirani, Thomas Vidick:
A Cryptographic Test of Quantumness and Certifiable Randomness from a Single Quantum Device. J. ACM 68(5): 31:1-31:47 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-10862
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-10862
Jeff Wu, Long Ouyang, Daniel M. Ziegler, Nisan Stiennon, Ryan Lowe, Jan Leike, Paul F. Christiano:
Recursively Summarizing Books with Human Feedback. CoRR abs/2109.10862 (2021)
2020
[c10]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/StiennonO0ZLVRA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/StiennonO0ZLVRA20
Nisan Stiennon, Long Ouyang, Jeffrey Wu, Daniel M. Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul F. Christiano:
Learning to summarize with human feedback. NeurIPS 2020
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-01325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-01325
Nisan Stiennon, Long Ouyang, Jeff Wu, Daniel M. Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul F. Christiano:
Learning to summarize from human feedback. CoRR abs/2009.01325 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-08593
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-08593
Daniel M. Ziegler, Nisan Stiennon, Jeffrey Wu, Tom B. Brown, Alec Radford, Dario Amodei, Paul F. Christiano, Geoffrey Irving:
Fine-Tuning Language Models from Human Preferences. CoRR abs/1909.08593 (2019)
2018
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/focs/BrakerskiCMVV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/focs/BrakerskiCMVV18
Zvika Brakerski, Paul F. Christiano, Urmila Mahadev, Umesh V. Vazirani, Thomas Vidick:
A Cryptographic Test of Quantumness and Certifiable Randomness from a Single Quantum Device. FOCS 2018: 320-331
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00640
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00640
Zvika Brakerski, Paul F. Christiano, Urmila Mahadev, Umesh V. Vazirani, Thomas Vidick:
Certifiable Randomness from a Single Quantum Device. CoRR abs/1804.00640 (2018)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-00899
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-00899
Geoffrey Irving, Paul F. Christiano, Dario Amodei:
AI safety via debate. CoRR abs/1805.00899 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1809-08352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-08352
Tom B. Brown, Nicholas Carlini, Chiyuan Zhang, Catherine Olsson, Paul F. Christiano, Ian J. Goodfellow:
Unrestricted Adversarial Examples. CoRR abs/1809.08352 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-08575
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-08575
Paul F. Christiano, Buck Shlegeris, Dario Amodei:
Supervising strong learners by amplifying weak experts. CoRR abs/1810.08575 (2018)
2017
[b1]
- view
  - electronic edition @ escholarship.org
  - no references & citations available
  authority control:
- export record
  dblp key:
  - phd/basesearch/Christiano17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/basesearch/Christiano17
Paul Francis Christiano:
Manipulation-resistant online learning. University of California, Berkeley, USA, 2017
[c8]
- view
- export record
  dblp key:
  - conf/nips/ChristianoLBMLA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChristianoLBMLA17
Paul F. Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei:
Deep Reinforcement Learning from Human Preferences. NIPS 2017: 4299-4307
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1706-03741
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1706-03741
Paul F. Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei:
Deep reinforcement learning from human preferences. CoRR abs/1706.03741 (2017)
2016
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/colt/Christiano16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/Christiano16
Paul F. Christiano:
Provably manipulation-resistant reputation systems. COLT 2016: 670-697
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/Christiano16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/Christiano16
Paul F. Christiano:
Robust Collaborative Online Learning. CoRR abs/1603.06265 (2016)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/Al-RfouAAa16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/Al-RfouAAa16
Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermüller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul F. Christiano, Tim Cooijmans, Marc-Alexandre Côté, Myriam Côté, Aaron C. Courville, Yann N. Dauphin, Olivier Delalleau, Julien Demouth, Guillaume Desjardins, Sander Dieleman, Laurent Dinh, Melanie Ducoffe, Vincent Dumoulin, Samira Ebrahimi Kahou, Dumitru Erhan, Ziye Fan, Orhan Firat, Mathieu Germain, Xavier Glorot, Ian J. Goodfellow, Matthew Graham, Çaglar Gülçehre, Philippe Hamel, Iban Harlouchet, Jean-Philippe Heng, Balázs Hidasi, Sina Honari, Arjun Jain, Sébastien Jean, Kai Jia, Mikhail Korobov, Vivek Kulkarni, Alex Lamb, Pascal Lamblin, Eric Larsen, César Laurent, Sean Lee, Simon Lefrançois, Simon Lemieux, Nicholas Léonard, Zhouhan Lin, Jesse A. Livezey, Cory Lorenz, Jeremiah Lowin, Qianli Ma, Pierre-Antoine Manzagol, Olivier Mastropietro, Robert McGibbon, Roland Memisevic, Bart van Merriënboer, Vincent Michalski, Mehdi Mirza, Alberto Orlandi, Christopher Joseph Pal, Razvan Pascanu, Mohammad Pezeshki, Colin Raffel, Daniel Renshaw, Matthew Rocklin, Adriana Romero, Markus Roth, Peter Sadowski, John Salvatier, François Savard, Jan Schlüter, John Schulman, Gabriel Schwartz, Iulian Vlad Serban, Dmitriy Serdyuk, Samira Shabanian, Étienne Simon, Sigurd Spieckermann, S. Ramana Subramanyam, Jakub Sygnowski, Jérémie Tanguay, Gijs van Tulder, Joseph P. Turian, Sebastian Urban, Pascal Vincent, Francesco Visin, Harm de Vries, David Warde-Farley, Dustin J. Webb, Matthew Willson, Kelvin Xu, Lijun Xue, Li Yao, Saizheng Zhang, Ying Zhang:
Theano: A Python framework for fast computation of mathematical expressions. CoRR abs/1605.02688 (2016)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/AmodeiOSCSM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AmodeiOSCSM16
Dario Amodei, Chris Olah, Jacob Steinhardt, Paul F. Christiano, John Schulman, Dan Mané:
Concrete Problems in AI Safety. CoRR abs/1606.06565 (2016)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/ChristianoSMSBT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ChristianoSMSBT16
Paul F. Christiano, Zain Shah, Igor Mordatch, Jonas Schneider, Trevor Blackwell, Joshua Tobin, Pieter Abbeel, Wojciech Zaremba:
Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model. CoRR abs/1610.03518 (2016)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/FinnCAL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FinnCAL16
Chelsea Finn, Paul F. Christiano, Pieter Abbeel, Sergey Levine:
A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models. CoRR abs/1611.03852 (2016)
2015
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/lori/FallensteinTC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lori/FallensteinTC15
Benja Fallenstein, Jessica Taylor, Paul F. Christiano:
Reflective Oracles: A Foundation for Game Theory in Artificial Intelligence. LORI 2015: 411-415
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/FallensteinTC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FallensteinTC15
Benja Fallenstein, Jessica Taylor, Paul F. Christiano:
Reflective Oracles: A Foundation for Classical Game Theory. CoRR abs/1508.04145 (2015)
2014
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/colt/Christiano14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/Christiano14
Paul F. Christiano:
Open Problem: Online Local Learning. COLT 2014: 1290-1294
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/stoc/Christiano14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/stoc/Christiano14
Paul F. Christiano:
Online local learning via semidefinite programming. STOC 2014: 468-474
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/BaraszCFHLY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BaraszCFHLY14
Mihály Bárász, Paul F. Christiano, Benja Fallenstein, Marcello Herreshoff, Patrick LaVictoire, Eliezer Yudkowsky:
Robust Cooperation in the Prisoner's Dilemma: Program Equilibrium via Provability Logic. CoRR abs/1401.5577 (2014)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/Christiano14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/Christiano14
Paul F. Christiano:
Online Local Learning via Semidefinite Programming. CoRR abs/1403.5287 (2014)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/Christiano14a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/Christiano14a
Paul F. Christiano:
Provably Manipulation-Resistant Reputation Systems. CoRR abs/1411.1127 (2014)
2013
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/toc/AaronsonC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/toc/AaronsonC13
Scott Aaronson, Paul F. Christiano:
Quantum Money from Hidden Subspaces. Theory Comput. 9: 349-401 (2013)
2012
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/stoc/AaronsonC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/stoc/AaronsonC12
Scott Aaronson, Paul F. Christiano:
Quantum money from hidden subspaces. STOC 2012: 41-60
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1203-4740
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1203-4740
Scott Aaronson, Paul F. Christiano:
Quantum Money from Hidden Subspaces. CoRR abs/1203.4740 (2012)
[i3]
- view
  - electronic edition @ weizmann.ac.il (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/eccc/AaronsonC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eccc/AaronsonC12
Scott Aaronson, Paul F. Christiano:
Quantum Money from Hidden Subspaces. Electron. Colloquium Comput. Complex. TR12 (2012)
[i2]
- view
  - electronic edition @ iacr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/iacr/AaronsonC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/iacr/AaronsonC12
Scott Aaronson, Paul F. Christiano:
Quantum Money from Hidden Subspaces. IACR Cryptol. ePrint Arch. 2012: 171 (2012)
2011
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/stoc/ChristianoKMST11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/stoc/ChristianoKMST11
Paul F. Christiano, Jonathan A. Kelner, Aleksander Madry, Daniel A. Spielman, Shang-Hua Teng:
Electrical flows, laplacian systems, and faster approximation of maximum flow in undirected graphs. STOC 2011: 273-282
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/wads/ChristianoDK11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wads/ChristianoDK11
Paul F. Christiano, Erik D. Demaine, Shaunak Kishore:
Lossless Fault-Tolerant Data Structures with Additive Overhead. WADS 2011: 243-254
2010
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1010-2921
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1010-2921
Paul F. Christiano, Jonathan A. Kelner, Aleksander Madry, Daniel A. Spielman, Shang-Hua Teng:
Electrical Flows, Laplacian Systems, and Faster Approximation of Maximum Flow in Undirected Graphs. CoRR abs/1010.2921 (2010)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.