Pierre-Luc Bacon

Cited by

	All	Since 2019
Citations	2646	2344
h-index	18	18
i10-index	26	23

540

270

135

405

20162017201820192020202120222023202432 74 184 261 352 349 423 536 422

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Doina PrecupDeepMind and McGill UniversityVerified email at cs.mcgill.ca
Jean HarbOpenAIVerified email at openai.com
Emmanuel BengioMcGill UniversityVerified email at mail.mcgill.ca
Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; MilaVerified email at cs.mcgill.ca
Martin KlissarovMcGill University, MilaVerified email at mail.mcgill.ca
Ahmed TouatiMeta AIVerified email at umontreal.ca
Pascal VincentFacebook AI Research; U. Montreal (Professor, Computer Sc. & Op. Res.); MILA; CIFARVerified email at iro.umontreal.ca
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Yao LiuAmazonVerified email at stanford.edu
Timothy A MannMetaVerified email at fb.com
Daniel J. MankowitzGoogle DeepmindVerified email at google.com
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaVerified email at technion.ac.il
Anna HarutyunyanDeepMindVerified email at google.com
Borja BalleDeepMindVerified email at google.com
Anima AnandkumarCalifornia Institute of Technology and NVIDIAVerified email at caltech.edu
David MegerAssociate Professor at McGill UniversityVerified email at cim.mcgill.ca

Pierre-Luc Bacon

University of Montreal

Verified email at mila.quebec - Homepage

reinforcement learning artificial intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The option-critic architecture PL Bacon, J Harb, D Precup Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	1243	2017
Conditional computation in neural networks for faster models E Bengio, PL Bacon, J Pineau, D Precup arXiv preprint arXiv:1511.06297, 2015	339	2015
When waiting is not an option: Learning options with a deliberation cost J Harb, PL Bacon, M Klissarov, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	162	2018
The primacy bias in deep reinforcement learning E Nikishin, M Schwarzer, P D’Oro, PL Bacon, A Courville International conference on machine learning, 16828-16847, 2022	133	2022
Sample-efficient reinforcement learning by breaking the replay ratio barrier P D'Oro, M Schwarzer, E Nikishin, PL Bacon, MG Bellemare, A Courville Deep Reinforcement Learning Workshop NeurIPS 2022, 2022	63	2022
Learnings options end-to-end for continuous action tasks M Klissarov, PL Bacon, J Harb, D Precup arXiv preprint arXiv:1712.00004, 2017	62	2017
Convergent tree backup and retrace with function approximation A Touati, PL Bacon, D Precup, P Vincent International Conference on Machine Learning, 4955-4964, 2018	49	2018
Options of interest: Temporal abstraction with interest functions K Khetarpal, M Klissarov, M Chevalier-Boisvert, PL Bacon, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4444-4451, 2020	48	2020
Learning robust options D Mankowitz, T Mann, PL Bacon, D Precup, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	47	2018
Understanding the curse of horizon in off-policy evaluation via conditional importance sampling Y Liu, PL Bacon, E Brunskill International Conference on Machine Learning, 6184-6193, 2020	41	2020
Policy evaluation networks J Harb, T Schaul, D Precup, PL Bacon arXiv preprint arXiv:2002.11833, 2020	38	2020
Control-oriented model-based reinforcement learning with implicit differentiation E Nikishin, R Abachi, R Agarwal, PL Bacon Proceedings of the AAAI Conference on Artificial Intelligence 36 (7), 7886-7894, 2022	34	2022
Direct behavior specification via constrained reinforcement learning J Roy, R Girgis, J Romoff, PL Bacon, C Pal arXiv preprint arXiv:2112.12228, 2021	32	2021
Temporal Representation Learning PL Bacon McGill University (Canada), 2018	30	2018
Learning with options that terminate off-policy A Harutyunyan, P Vrancx, PL Bacon, D Precup, A Nowe Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	26	2018
Motif: Intrinsic motivation from artificial intelligence feedback M Klissarov, P D'Oro, S Sodhani, R Raileanu, PL Bacon, P Vincent, ... arXiv preprint arXiv:2310.00166, 2023	25	2023
An information-theoretic perspective on credit assignment in reinforcement learning D Arumugam, P Henderson, PL Bacon arXiv preprint arXiv:2103.06224, 2021	22	2021
Continuous-time meta-learning with forward mode differentiation T Deleu, D Kanaa, L Feng, G Kerg, Y Bengio, G Lajoie, PL Bacon arXiv preprint arXiv:2203.01443, 2022	20	2022
Neural algorithmic reasoners are implicit planners AI Deac, P Veličković, O Milinkovic, PL Bacon, J Tang, M Nikolic Advances in Neural Information Processing Systems 34, 15529-15542, 2021	18	2021
Xlvin: executed latent value iteration nets A Deac, P Veličković, O Milinković, PL Bacon, J Tang, M Nikolić arXiv preprint arXiv:2010.13146, 2020	18	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors