More Web Proxy on the site http://driver.im/

default search action

combined dblp search
author search
venue search
publication search

ask others

Marlos C. Machado

Marlos Cholodovskis Machado

> Home > Persons

Person information

affiliation: Department of Computing Science, University of Alberta, Canada
affiliation (former): Federal University of Minas Gerais, Belo Horizonte, Brazil

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j11]
- no documents available
  - no references & citations available
- export record
  dblp key:
  - conf/rlc/Meyer0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/Meyer0M24
Edan Meyer, Adam White, Marlos C. Machado:
Harnessing Discrete Representations for Continual Reinforcement Learning. RLJ 2: 606-628 (2024)
[j10]
- no documents available
  - no references & citations available
- export record
  dblp key:
  - conf/rlc/DaleyMW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/DaleyMW24
Brett Daley, Marlos C. Machado, Martha White:
Demystifying the Recency Heuristic in Temporal-Difference Learning. RLJ 3: 1019-1036 (2024)
[j9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ai/WangMWMAKLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/WangMWMAKLW24
Han Wang, Erfan Miahi, Martha White, Marlos C. Machado, Zaheer Abbas, Raksha Kumaraswamy, Vincent Liu, Adam White:
Investigating the properties of neural network representations in reinforcement learning. Artif. Intell. 330: 104100 (2024)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/ml/JanjuaSWMMW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/JanjuaSWMMW24
Muhammad Kamran Janjua, Haseeb Shah, Martha White, Erfan Miahi, Marlos C. Machado, Adam White:
GVFs in the real world: making predictions online for water treatment. Mach. Learn. 113(8): 5151-5181 (2024)
[c25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/SuttonMHSTT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/SuttonMHSTT024
Richard S. Sutton, Marlos C. Machado, G. Zacharias Holland, David Szepesvari, Finbarr Timbers, Brian Tanner, Adam White:
Reward-Respecting Subtasks for Model-Based Reinforcement Learning (Abstract Reprint). AAAI 2024: 22713
[c24]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/GomezBM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GomezBM24
Diego Gomez, Michael Bowling, Marlos C. Machado:
Proper Laplacian Representation Learning. ICLR 2024
[c23]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/DaleyWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DaleyWM24
Brett Daley, Martha White, Marlos C. Machado:
Averaging n-step Returns Reduces Variance in Reinforcement Learning. ICML 2024
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-03903
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-03903
Brett Daley, Martha White, Marlos C. Machado:
Compound Returns Reduce Variance in Reinforcement Learning. CoRR abs/2402.03903 (2024)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-06811
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-06811
Alex Lewandowski, Saurabh Kumar, Dale Schuurmans, András György, Marlos C. Machado:
Learning Continually by Spectral Regularization. CoRR abs/2406.06811 (2024)
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-12284
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-12284
Brett Daley, Marlos C. Machado, Martha White:
Demystifying the Recency Heuristic in Temporal-Difference Learning. CoRR abs/2406.12284 (2024)
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-20634
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-20634
Alex Lewandowski, Dale Schuurmans, Marlos C. Machado:
Plastic Learning with Deep Fourier Features. CoRR abs/2410.20634 (2024)
2023
[j7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ai/SuttonMHSTTW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/SuttonMHSTTW23
Richard S. Sutton, Marlos C. Machado, G. Zacharias Holland, David Szepesvari, Finbarr Timbers, Brian Tanner, Adam White:
Reward-respecting subtasks for model-based reinforcement learning. Artif. Intell. 324: 104001 (2023)
[j6]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/Machado0PB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/Machado0PB23
Marlos C. Machado, André Barreto, Doina Precup, Michael Bowling:
Temporal Abstraction in Reinforcement Learning with the Successor Representation. J. Mach. Learn. Res. 24: 80:1-80:69 (2023)
[j5]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/Tao0M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/Tao0M23
Ruo Yu Tao, Adam White, Marlos C. Machado:
Agent-State Construction with Auxiliary Inputs. Trans. Mach. Learn. Res. 2023 (2023)
[c22]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/collas/AbbasZM0M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/collas/AbbasZM0M23
Zaheer Abbas, Rosie Zhao, Joseph Modayil, Adam White, Marlos C. Machado:
Loss of Plasticity in Continual Deep Reinforcement Learning. CoLLAs 2023: 620-636
[c21]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/DaleyWAM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DaleyWAM23
Brett Daley, Martha White, Christopher Amato, Marlos C. Machado:
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning. ICML 2023: 6818-6835
[c20]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/KlissarovM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KlissarovM23
Martin Klissarov, Marlos C. Machado:
Deep Laplacian-based Options for Temporally-Extended Exploration. ICML 2023: 17198-17217
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-11181
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-11181
Martin Klissarov, Marlos C. Machado:
Deep Laplacian-based Options for Temporally-Extended Exploration. CoRR abs/2301.11181 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-11321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-11321
Brett Daley, Martha White, Christopher Amato, Marlos C. Machado:
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning. CoRR abs/2301.11321 (2023)
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-07507
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-07507
Zaheer Abbas, Rosie Zhao, Joseph Modayil, Adam White, Marlos C. Machado:
Loss of Plasticity in Continual Deep Reinforcement Learning. CoRR abs/2303.07507 (2023)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10833
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10833
Diego Gomez, Michael Bowling, Marlos C. Machado:
Proper Laplacian Representation Learning. CoRR abs/2310.10833 (2023)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-15719
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-15719
Subhojeet Pramanik, Esraa Elelimy, Marlos C. Machado, Adam White:
Recurrent Linear Transformers. CoRR abs/2310.15719 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-00246
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-00246
Alex Lewandowski, Haruto Tanaka, Dale Schuurmans, Marlos C. Machado:
Curvature Explains Loss of Plasticity. CoRR abs/2312.00246 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-01203
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-01203
Edan Meyer, Adam White, Marlos C. Machado:
Harnessing Discrete Representations For Continual Reinforcement Learning. CoRR abs/2312.01203 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-01624
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-01624
Muhammad Kamran Janjua, Haseeb Shah, Martha White, Erfan Miahi, Marlos C. Machado, Adam White:
GVFs in the Real World: Making Predictions Online for Water Treatment. CoRR abs/2312.01624 (2023)
2022
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/VaswaniBTMGGMCR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/VaswaniBTMGGMCR22
Sharan Vaswani, Olivier Bachem, Simone Totaro, Robert Müller, Shivam Garg, Matthieu Geist, Marlos C. Machado, Pablo Samuel Castro, Nicolas Le Roux:
A general class of surrogate functions for stable and efficient reinforcement learning. AISTATS 2022: 8619-8649
[c18]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/uai/ErraqabiMZSLDB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/ErraqabiMZSLDB22
Akram Erraqabi, Marlos C. Machado, Mingde Zhao, Sainbayar Sukhbaatar, Alessandro Lazaric, Ludovic Denoyer, Yoshua Bengio:
Temporal abstractions-augmented temporally contrastive learning: An alternative to the Laplacian in RL. UAI 2022: 641-651
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-03466
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-03466
Richard S. Sutton, Marlos C. Machado, G. Zacharias Holland, David Szepesvari, Finbarr Timbers, Brian Tanner, Adam White:
Reward-Respecting Subtasks for Model-Based Reinforcement Learning. CoRR abs/2202.03466 (2022)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-11369
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-11369
Akram Erraqabi, Marlos C. Machado, Mingde Zhao, Sainbayar Sukhbaatar, Alessandro Lazaric, Ludovic Denoyer, Yoshua Bengio:
Temporal Abstractions-Augmented Temporally Contrastive Learning: An Alternative to the Laplacian in RL. CoRR abs/2203.11369 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15955
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15955
Han Wang, Erfan Miahi, Martha White, Marlos C. Machado, Zaheer Abbas, Raksha Kumaraswamy, Vincent Liu, Adam White:
Investigating the Properties of Neural Network Representations in Reinforcement Learning. CoRR abs/2203.15955 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-07805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-07805
Ruo Yu Tao, Adam White, Marlos C. Machado:
Agent-State Construction with Auxiliary Inputs. CoRR abs/2211.07805 (2022)
2021
[c17]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/AgarwalMCB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/AgarwalMCB21
Rishabh Agarwal, Marlos C. Machado, Pablo Samuel Castro, Marc G. Bellemare:
Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning. ICLR 2021
[c16]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ChungTMR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChungTMR21
Wesley Chung, Valentin Thomas, Marlos C. Machado, Nicolas Le Roux:
Beyond Variance Reduction: Understanding the True Impact of Baselines on Policy Optimization. ICML 2021: 1999-2009
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2101-05265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-05265
Rishabh Agarwal, Marlos C. Machado, Pablo Samuel Castro, Marc G. Bellemare:
Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning. CoRR abs/2101.05265 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-05828
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-05828
Sharan Vaswani, Olivier Bachem, Simone Totaro, Robert Mueller, Matthieu Geist, Marlos C. Machado, Pablo Samuel Castro, Nicolas Le Roux:
A functional mirror ascent view of policy gradient methods with function approximation. CoRR abs/2108.05828 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-11052
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-11052
Adrien Ali Taïga, William Fedus, Marlos C. Machado, Aaron C. Courville, Marc G. Bellemare:
On Bonus-Based Exploration Methods in the Arcade Learning Environment. CoRR abs/2109.11052 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05740
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05740
Marlos C. Machado, André Barreto, Doina Precup:
Temporal Abstraction in Reinforcement Learning with the Successor Representation. CoRR abs/2110.05740 (2021)
2020
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/nature/BellemareCCGMMP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nature/BellemareCCGMMP20
Marc G. Bellemare, Salvatore Candido, Pablo Samuel Castro, Jun Gong, Marlos C. Machado, Subhodeep Moitra, Sameera S. Ponda, Ziyu Wang:
Autonomous navigation of stratospheric balloons using reinforcement learning. Nat. 588(7836): 77-82 (2020)
[c15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/MachadoBB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MachadoBB20
Marlos C. Machado, Marc G. Bellemare, Michael Bowling:
Count-Based Exploration with the Successor Representation. AAAI 2020: 5125-5133
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/JinnaiPMK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/JinnaiPMK20
Yuu Jinnai, Jee Won Park, Marlos C. Machado, George Dimitri Konidaris:
Exploration in Reinforcement Learning with Deep Covering Options. ICLR 2020
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/TaigaFMCB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TaigaFMCB20
Adrien Ali Taïga, William Fedus, Marlos C. Machado, Aaron C. Courville, Marc G. Bellemare:
On Bonus Based Exploration Methods In The Arcade Learning Environment. ICLR 2020
[c12]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/GhoshMR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GhoshMR20
Dibya Ghosh, Marlos C. Machado, Nicolas Le Roux:
An operator view of policy gradient methods. NeurIPS 2020
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-11266
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-11266
Dibya Ghosh, Marlos C. Machado, Nicolas Le Roux:
An operator view of policy gradient methods. CoRR abs/2006.11266 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-13773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-13773
Wesley Chung, Valentin Thomas, Marlos C. Machado, Nicolas Le Roux:
Beyond variance reduction: Understanding the true impact of baselines on policy optimization. CoRR abs/2008.13773 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1908-02388
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-02388
Adrien Ali Taïga, William Fedus, Marlos C. Machado, Aaron C. Courville, Marc G. Bellemare:
Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment. CoRR abs/1908.02388 (2019)
2018
[j3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jair/MachadoBTVHB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/MachadoBTVHB18
Marlos C. Machado, Marc G. Bellemare, Erik Talvitie, Joel Veness, Matthew J. Hausknecht, Michael Bowling:
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents. J. Artif. Intell. Res. 61: 523-562 (2018)
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/MachadoRGLTC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MachadoRGLTC18
Marlos C. Machado, Clemens Rosenbaum, Xiaoxiao Guo, Miao Liu, Gerald Tesauro, Murray Campbell:
Eigenoption Discovery through the Deep Successor Representation. ICLR (Poster) 2018
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/MachadoBTVHB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/MachadoBTVHB18
Marlos C. Machado, Marc G. Bellemare, Erik Talvitie, Joel Veness, Matthew J. Hausknecht, Michael Bowling:
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents (Extended Abstract). IJCAI 2018: 5573-5577
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/SherstanMP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/SherstanMP18
Craig Sherstan, Marlos C. Machado, Patrick M. Pilarski:
Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation. IROS 2018: 2997-3003
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-09001
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-09001
Craig Sherstan, Marlos C. Machado, Patrick M. Pilarski:
Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation. CoRR abs/1803.09001 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-11622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-11622
Marlos C. Machado, Marc G. Bellemare, Michael Bowling:
Count-Based Exploration with the Successor Representation. CoRR abs/1807.11622 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-00123
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-00123
Jesse Farebrother, Marlos C. Machado, Michael Bowling:
Generalization and Regularization in DQN. CoRR abs/1810.00123 (2018)
2017
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/MachadoBB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MachadoBB17
Marlos C. Machado, Marc G. Bellemare, Michael H. Bowling:
A Laplacian Framework for Option Discovery in Reinforcement Learning. ICML 2017: 2295-2304
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/MachadoBB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MachadoBB17
Marlos C. Machado, Marc G. Bellemare, Michael H. Bowling:
A Laplacian Framework for Option Discovery in Reinforcement Learning. CoRR abs/1703.00956 (2017)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1709-06009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-06009
Marlos C. Machado, Marc G. Bellemare, Erik Talvitie, Joel Veness, Matthew J. Hausknecht, Michael Bowling:
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents. CoRR abs/1709.06009 (2017)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1710-11089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-11089
Marlos C. Machado, Clemens Rosenbaum, Xiaoxiao Guo, Miao Liu, Gerald Tesauro, Murray Campbell:
Eigenoption Discovery through the Deep Successor Representation. CoRR abs/1710.11089 (2017)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-04065
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-04065
Miao Liu, Marlos C. Machado, Gerald Tesauro, Murray Campbell:
The Eigenoption-Critic Framework. CoRR abs/1712.04065 (2017)
2016
[j2]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/SeijenMPMS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/SeijenMPMS16
Harm van Seijen, Ashique Rupam Mahmood, Patrick M. Pilarski, Marlos C. Machado, Richard S. Sutton:
True Online Temporal-Difference Learning. J. Mach. Learn. Res. 17: 145:1-145:40 (2016)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/agi/SherstanWMP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/agi/SherstanWMP16
Craig Sherstan, Adam White, Marlos C. Machado, Patrick M. Pilarski:
Introspective Agents: Confidence Measures for General Value Functions. AGI 2016: 258-261
[c6]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/LiangMTB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/LiangMTB16
Yitao Liang, Marlos C. Machado, Erik Talvitie, Michael H. Bowling:
State of the Art Control of Atari Games Using Shallow Reinforcement Learning. AAMAS 2016: 485-493
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/MachadoB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MachadoB16
Marlos C. Machado, Michael H. Bowling:
Learning Purposeful Behaviour in the Absence of Rewards. CoRR abs/1605.07700 (2016)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/SherstanWMP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SherstanWMP16
Craig Sherstan, Adam White, Marlos C. Machado, Patrick M. Pilarski:
Introspective Agents: Confidence Measures for General Value Functions. CoRR abs/1606.05593 (2016)
2015
[c5]
- view
  - electronic edition @ aaai.org (archived)
  - no references & citations available
- export record
  dblp key:
  - conf/aaai/MachadoSB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MachadoSB15
Marlos C. Machado, Sriram Srinivasan, Michael H. Bowling:
Domain-Independent Optimistic Initialization for Reinforcement Learning. AAAI Workshop: Learning for General Competency in Video Games 2015
[e1]
- view
- export record
  dblp key:
  - conf/aaai/2015games
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/2015games
Michael Bowling, Marc G. Bellemare, Erik Talvitie, Joel Veness, Marlos C. Machado:
Learning for General Competency in Video Games, Papers from the 2015 AAAI Workshop, Austin, Texas, USA, January 26, 2015. AAAI Technical Report WS-15-10, AAAI Press 2015, ISBN 978-1-57735-721-6 [contents]
[i5]
- view
  authority control:
- export record
  dblp key:
  - journals/aim/AlbrechtLBBCCDD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aim/AlbrechtLBBCCDD15
Stefano V. Albrecht, J. Christopher Beck, David L. Buckeridge, Adi Botea, Cornelia Caragea, Chi-Hung Chi, Theodoros Damoulas, Bistra Dilkina, Eric Eaton, Pooyan Fazli, Sam Ganzfried, Marius Lindauer, Marlos C. Machado, Yuri Malitsky, Gary Marcus, Sebastiaan A. Meijer, Francesca Rossi, Arash Shaban-Nejad, Sylvie Thiébaux, Manuela M. Veloso, Toby Walsh, Can Wang, Jie Zhang, Yu Zheng:
Reports from the 2015 AAAI Workshop Program. AI Mag. 36(2): 90-101 (2015)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/LiangMTB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LiangMTB15
Yitao Liang, Marlos C. Machado, Erik Talvitie, Michael H. Bowling:
State of the Art Control of Atari Games Using Shallow Reinforcement Learning. CoRR abs/1512.01563 (2015)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/SeijenMPMS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SeijenMPMS15
Harm van Seijen, Ashique Rupam Mahmood, Patrick M. Pilarski, Marlos C. Machado, Richard S. Sutton:
True Online Temporal-Difference Learning. CoRR abs/1512.04087 (2015)
2014
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/cie/CunhaMC14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cie/CunhaMC14
Renato Luiz de Freitas Cunha, Marlos C. Machado, Luiz Chaimowicz:
RTSMate: Towards an Advice System for RTS Games. Comput. Entertain. 12(1): 1:1-1:20 (2014)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/MachadoSB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MachadoSB14
Marlos C. Machado, Sriram Srinivasan, Michael Bowling:
Domain-Independent Optimistic Initialization for Reinforcement Learning. CoRR abs/1410.4604 (2014)
2013
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/Machado13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/Machado13
Marlos C. Machado:
A Methodology for Player Modeling based on Machine Learning. CoRR abs/1312.3903 (2013)
2012
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/cig/MachadoPC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cig/MachadoPC12
Marlos C. Machado, Gisele L. Pappa, Luiz Chaimowicz:
A binary classification approach for automatic preference modeling of virtual agents in Civilization IV. CIG 2012: 155-162
2011
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/cgames/MachadoFC11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cgames/MachadoFC11
Marlos C. Machado, Eduardo P. C. Fantini, Luiz Chaimowicz:
Player modeling: Towards a common taxonomy. CGAMES 2011: 50-57
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/sbgames/MachadoRC11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sbgames/MachadoRC11
Marlos C. Machado, Bruno S. L. Rocha, Luiz Chaimowicz:
Agents Behavior and Preferences Characterization in Civilization IV. SBGames 2011: 43-52
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/sbgames/MachadoC11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sbgames/MachadoC11
Marlos C. Machado, Luiz Chaimowicz:
Combining Metaheuristics and CSP Algorithms to Solve Sudoku. SBGames 2011: 124-131

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.