Robert Kirk

Cited by

	All	Since 2019
Citations	427	427
h-index	6	6
i10-index	6	6

200

100

150

20212022202320248 124 192 100

Co-authors

Edward GrefenstetteDirector of Research, Google DeepMind | Honorary Professor, UCLVerified email at google.com
Tim RocktäschelProfessor of Artificial Intelligence at UCL, Open-Endedness Team Lead at Google DeepMindVerified email at cs.ucl.ac.uk
Amy ZhangAssistant Professor of Electrical and Computer Engineering at University of Texas at AustinVerified email at austin.utexas.edu
Eric HambroAnthropicVerified email at anthropic.com
David Scott KruegerUniversity Assistant Professor, University of CambridgeVerified email at cam.ac.uk
Minqi JiangResearch Scientist at Google DeepMindVerified email at ucl.ac.uk
Roberta RaileanuResearch Scientist, MetaVerified email at fb.com
Vitaly KurinResearch Scientist at Isomorphic LabsVerified email at isomorphiclabs.com
Mikayel SamvelyanMeta AI & UCLVerified email at meta.com
Fabio PetroniSamaya AIVerified email at samaya.ai
Heinrich KüttlerInflection AIVerified email at math.lmu.de
Jack Parker-HolderGoogle DeepMind, UCLVerified email at google.com
Ekdeep Singh LubanaUniversity of MichiganVerified email at umich.edu
Usman AnwarUniversity of CambridgeVerified email at cam.ac.uk
Hidenori TanakaGroup Leader, NTT Research at Harvard UniversityVerified email at fas.harvard.edu
Robert DickUniversity of Michigan, StrydVerified email at rpdmail.dyndns.org
Samyak JainUndergrad at Indian Institute of Technology(BHU),VaranasiVerified email at itbhu.ac.in
Christoforos NalmpantisPostdoctoral Researcher, Fundamental AI Research at MetaVerified email at fb.com
Jelena LuketinaOxford UniversityVerified email at cs.ox.ac.uk
Ishita MedirattaMetaVerified email at meta.com

Robert Kirk

PhD Student, University College London

Verified email at ucl.ac.uk - Homepage

AI Alignment AI Safety Language Models Fine-tuning Generalisation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A survey of zero-shot generalisation in deep reinforcement learning R Kirk, A Zhang, E Grefenstette, T Rocktäschel Journal of Artificial Intelligence Research 76, 201-264, 2023	272*	2023
Minihack the planet: A sandbox for open-ended reinforcement learning research M Samvelyan, R Kirk, V Kurin, J Parker-Holder, M Jiang, E Hambro, ... arXiv preprint arXiv:2109.13202, 2021	70	2021
Understanding the effects of rlhf on llm generalisation and diversity R Kirk, I Mediratta, C Nalmpantis, J Luketina, E Hambro, E Grefenstette, ... arXiv preprint arXiv:2310.06452, 2023	20	2023
Reward model ensembles help mitigate overoptimization T Coste, U Anwar, R Kirk, D Krueger arXiv preprint arXiv:2310.02743, 2023	19	2023
Insights from the neurips 2021 nethack challenge E Hambro, S Mohanty, D Babaev, M Byeon, D Chakraborty, ... NeurIPS 2021 Competitions and Demonstrations Track, 41-52, 2022	17	2022
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks S Jain, R Kirk, ES Lubana, RP Dick, H Tanaka, E Grefenstette, ... arXiv preprint arXiv:2311.12786, 2023	15	2023
Generalization to new sequential decision making tasks with in-context learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu arXiv preprint arXiv:2312.03801, 2023	4	2023
Graph backup: Data efficient backup exploiting markovian transitions Z Jiang, T Zhang, R Kirk, T Rocktäschel, E Grefenstette arXiv preprint arXiv:2205.15824, 2022	4*	2022
A study of off-policy learning in environments with procedural content generation A Ehrenberg, R Kirk, M Jiang, E Grefenstette, T Rocktäschel ICLR Workshop on Agent Learning in Open-Endedness, 2022	4	2022
Leading the Pack: N-player Opponent Shaping A Souly, T Willi, A Khan, R Kirk, C Lu, E Grefenstette, T Rocktäschel arXiv preprint arXiv:2312.12564, 2023	1	2023
Domain Generalization for Robust Model-Based Offline Reinforcement Learning A Clark, SA Siddiqui, R Kirk, U Anwar, S Chung, D Krueger arXiv preprint arXiv:2211.14827, 2022	1	2022
What Mechanisms Does Knowledge Distillation Distill? C Wu, ES Lubana, BK Mlodozeniec, R Kirk, D Krueger UniReps: the First Workshop on Unifying Representations in Neural Models, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–12

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors