Agent57: Outperforming the atari human benchmark AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ... International conference on machine learning, 507-517, 2020 | 609 | 2020 |
Never give up: Learning directed exploration strategies AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020 | 313 | 2020 |
A generalist neural algorithmic learner B Ibarz, V Kurin, G Papamakarios, K Nikiforou, M Bennani, R Csordás, ... Learning on graphs conference, 2: 1-2: 23, 2022 | 47 | 2022 |
Beyond fine-tuning: Transferring behavior in reinforcement learning V Campos, P Sprechmann, S Hansen, A Barreto, S Kapturowski, ... arXiv preprint arXiv:2102.13515, 2021 | 20 | 2021 |
Coverage as a principle for discovering transferable behavior in reinforcement learning V Campos, P Sprechmann, SS Hansen, A Barreto, C Blundell, A Vitvitskyi, ... | 9 | 2020 |
Never give up: Learning directed exploration strategies. arXiv AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020 | 8 | 2020 |
Agent57: Outperforming the atari human benchmark. arXiv 2020 AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, D Guo, ... arXiv preprint arXiv:2003.13350, 0 | 7 | |
Never give up: Learning directed exploration strategies A Puigdomènech Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, ... arXiv e-prints, arXiv: 2002.06038, 2020 | 6 | 2020 |
Agent57: Outperforming the Atari Human Benchmark. arXiv e-prints, page AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, D Guo, ... arXiv preprint arXiv:2003.13350, 2020 | 5 | 2020 |
Jointly learning exploratory and non-exploratory action selection policies AP Badia, P Sprechmann, A Vitvitskyi, Z Guo, B Piot, SJ Kapturowski, ... US Patent App. 18/334,112, 2024 | | 2024 |
Jointly learning exploratory and non-exploratory action selection policies AP Badia, P Sprechmann, A Vitvitskyi, Z Guo, B Piot, SJ Kapturowski, ... US Patent 11,714,990, 2023 | | 2023 |
Reinforcement learning with adaptive return computation schemes AP Badia, B Piot, P Sprechmann, SJ Kapturowski, A Vitvitskyi, Z Guo, ... US Patent App. 17/797,878, 2023 | | 2023 |