Follow
Dylan Hadfield-Menell
Dylan Hadfield-Menell
Verified email at csail.mit.edu - Homepage
Title
Cited by
Cited by
Year
Cooperative inverse reinforcement learning
D Hadfield-Menell, SJ Russell, P Abbeel, A Dragan
Advances in neural information processing systems 29, 2016
6932016
Inverse reward design
D Hadfield-Menell, S Milli, P Abbeel, SJ Russell, A Dragan
Advances in neural information processing systems 30, 2017
4032017
The off-switch game
D Hadfield-Menell, A Dragan, P Abbeel, S Russell
Workshops at the Thirty-First AAAI Conference on Artificial Intelligence, 2017
1442017
Open problems and fundamental limitations of reinforcement learning from human feedback
S Casper, X Davies, C Shi, TK Gilbert, J Scheurer, J Rando, R Freedman, ...
arXiv preprint arXiv:2307.15217, 2023
1152023
Pragmatic-pedagogic value alignment
JF Fisac, MA Gates, JB Hamrick, C Liu, D Hadfield-Menell, ...
Robotics Research: The 18th International Symposium ISRR, 49-57, 2020
882020
On the geometry of adversarial examples
M Khoury, D Hadfield-Menell
arXiv preprint arXiv:1811.00525, 2018
782018
Guided search for task and motion plans using learned heuristics
R Chitnis, D Hadfield-Menell, A Gupta, S Srivastava, E Groshev, C Lin, ...
2016 IEEE International Conference on Robotics and Automation (ICRA), 447-454, 2016
772016
Toward transparent ai: A survey on interpreting the inner structures of deep neural networks
T Räuker, A Ho, S Casper, D Hadfield-Menell
2023 IEEE Conference on Secure and Trustworthy Machine Learning (SaTML), 464-483, 2023
742023
Should robots be obedient?
S Milli, D Hadfield-Menell, A Dragan, S Russell
arXiv preprint arXiv:1705.09990, 2017
712017
Incomplete contracting and AI alignment
D Hadfield-Menell, GK Hadfield
Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 417-422, 2019
642019
What are you optimizing for? aligning recommender systems with human values
J Stray, I Vendrov, J Nixon, S Adler, D Hadfield-Menell
arXiv preprint arXiv:2107.10939, 2021
592021
Conservative agency via attainable utility preservation
AM Turner, D Hadfield-Menell, P Tadepalli
Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, 385-391, 2020
562020
Expressive robot motion timing
A Zhou, D Hadfield-Menell, A Nagabandi, AD Dragan
Proceedings of the 2017 ACM/IEEE international conference on human-robot …, 2017
552017
Modular task and motion planning in belief space
D Hadfield-Menell, E Groshev, R Chitnis, P Abbeel
2015 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2015
532015
On the utility of model learning in hri
R Choudhury, G Swamy, D Hadfield-Menell, AD Dragan
2019 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI …, 2019
512019
Consequences of misaligned AI
S Zhuang, D Hadfield-Menell
Advances in Neural Information Processing Systems 33, 15763-15773, 2020
502020
The assistive multi-armed bandit
L Chan, D Hadfield-Menell, S Srinivasa, A Dragan
2019 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI …, 2019
492019
Unifying scene registration and trajectory optimization for learning from demonstrations with application to manipulation of deformable objects
AX Lee, SH Huang, D Hadfield-Menell, E Tzeng, P Abbeel
2014 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2014
412014
Simplifying reward design through divide-and-conquer
E Ratner, D Hadfield-Menell, AD Dragan
arXiv preprint arXiv:1806.02501, 2018
372018
Building human values into recommender systems: An interdisciplinary synthesis
J Stray, A Halevy, P Assar, D Hadfield-Menell, C Boutilier, A Ashar, ...
ACM Transactions on Recommender Systems, 2022
352022
The system can't perform the operation now. Try again later.
Articles 1–20