Follow
Romain Laroche
Romain Laroche
Microsoft Research
Verified email at polytechnique.org - Homepage
Title
Cited by
Cited by
Year
Hybrid reward architecture for reinforcement learning
H Van Seijen, M Fatemi, J Romoff, R Laroche, T Barnes, J Tsang
Advances in Neural Information Processing Systems 30, 2017
2592017
Safe policy improvement with baseline bootstrapping
R Laroche, P Trichelair, RT Des Combes
International conference on machine learning, 3652-3661, 2019
1932019
Learning dynamic belief graphs to generalize on text-based games
A Adhikari, X Yuan, MA Côté, M Zelinka, MA Rondeau, R Laroche, ...
Advances in Neural Information Processing Systems 33, 3045-3057, 2020
932020
Contextual bandit for active learning: Active thompson sampling
D Bouneffouf, R Laroche, T Urvoy, R Féraud, R Allesiardo
Neural Information Processing: 21st International Conference, ICONIP 2014 …, 2014
912014
Transfer reinforcement learning with shared dynamics
R Laroche, M Barlier
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
592017
Counting to explore and generalize in text-based games
X Yuan, MA Côté, A Sordoni, R Laroche, RT Combes, M Hausknecht, ...
arXiv preprint arXiv:1806.11525, 2018
572018
Score-based inverse reinforcement learning
L El Asri, B Piot, M Geist, R Laroche, O Pietquin
432016
Reinforcement learning algorithm selection
R Laroche, R Feraud
arXiv preprint arXiv:1701.08810, 2017
362017
When does return-conditioned supervised learning work for offline reinforcement learning?
D Brandfonbrener, A Bietti, J Buckman, R Laroche, J Bruna
Advances in Neural Information Processing Systems 35, 1542-1553, 2022
352022
Safe policy improvement with soft baseline bootstrapping
K Nadjahi, R Laroche, R Tachet des Combes
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2020
352020
Transfer Learning for User Adaptation in Spoken Dialogue Systems.
A Genevay, R Laroche
AAMAS, 975-983, 2016
352016
Human-machine dialogue as a stochastic game
M Barlier, J Perolat, R Laroche, O Pietquin
16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), 2015
322015
NASTIA: Negotiating Appointment Setting Interface.
L El Asri, R Lemonnier, R Laroche, O Pietquin, H Khouzaimi
LREC, 266-271, 2014
312014
Reward function learning for dialogue management
L El Asri, R Laroche, O Pietquin
STAIRS 2012, 95-106, 2012
302012
Reward shaping for statistical optimisation of dialogue management
L El Asri, R Laroche, O Pietquin
Statistical Language and Speech Processing: First International Conference …, 2013
282013
Optimising turn-taking strategies with reinforcement learning
H Khouzaimi, R Laroche, F Lefevre
Proceedings of the 16th Annual Meeting of the Special Interest Group on …, 2015
272015
Decentralized exploration in multi-armed bandits
R Féraud, R Alami, R Laroche
International Conference on Machine Learning, 1901-1909, 2019
262019
Hybridisation of expertise and reinforcement learning in dialogue systems
R Laroche, G Putois, P Bretier, B Bouchon-Meunier
Tenth Annual Conference of the International Speech Communication Association, 2009
262009
Multi-advisor reinforcement learning
R Laroche, M Fatemi, J Romoff, H van Seijen
arXiv preprint arXiv:1704.00756, 2017
242017
Safe policy improvement with an estimated baseline policy
TD Simão, R Laroche, RT Combes
International Foundation for Autonomous Agents and Multi-Agent Systems, 2019
222019
The system can't perform the operation now. Try again later.
Articles 1–20