Követés
Yasin Abbasi Yadkori
Yasin Abbasi Yadkori
Google DeepMind
E-mail megerősítve itt: google.com - Kezdőlap
Cím
Hivatkozott rá
Hivatkozott rá
Év
Improved algorithms for linear stochastic bandits
Y Abbasi-Yadkori, C Szepesvári, D Pal
Advances in Neural Information Processing Systems, 2312-2320, 2011
20502011
Regret Bounds for the Adaptive Control of Linear Quadratic Systems.
Y Abbasi-Yadkori, C Szepesvári
COLT, 1-26, 2011
4412011
Fast approximate nearest-neighbor search with k-nearest neighbor graph
K Hajebi, Y Abbasi-Yadkori, H Shahbazi, H Zhang
Twenty-Second International Joint Conference on Artificial Intelligence, 2011
2842011
Online-to-Confidence-Set Conversions and Application to Sparse Stochastic Bandits.
Y Abbasi-Yadkori, D Pal, C Szepesvari
AISTATS 22, 1-9, 2012
1912012
Sharp Convergence Rates for Langevin Dynamics in the Nonconvex Setting
X Cheng, NS Chatterji, Y Abbasi-Yadkori, PL Bartlett, MI Jordan
arXiv preprint arXiv:1805.01648, 2018
1862018
POLITEX: Regret bounds for policy iteration using expert prediction
Y Abbasi-Yadkori, P Bartlett, K Bhatia, N Lazic, C Szepesvári, G Weisz
Proceedings of the 36th International Conference on Machine Learning 97 …, 2019
1482019
POLITEX: Regret Bounds for Policy Iteration Using Expert Prediction
Y Abbasi-Yadkori, PL Bartlett, K Bhatia, N Lazic, C Szepesvári, G Weisz
1482019
Model-Free Linear Quadratic Control via Reduction to Expert Prediction
Y Abbasi-Yadkori, N Lazic, C Szepesvari
The 22nd International Conference on Artificial Intelligence and Statistics, 2019
120*2019
Conservative contextual linear bandits
A Kazerouni, M Ghavamzadeh, YA Yadkori, B Van Roy
Advances in Neural Information Processing Systems, 3910-3919, 2017
1182017
Model selection in contextual stochastic bandit problems
A Pacchiano, M Phan, Y Abbasi Yadkori, A Rao, J Zimmert, T Lattimore, ...
Advances in Neural Information Processing Systems 33, 10328-10337, 2020
1052020
Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions
Y Abbasi-Yadkori, P Bartlett, V Kanade, Y Seldin, C Szepesvari
Neural Information Processing Systems, 2013
1042013
Online Learning for Linearly Parametrized Control Problems
Y Abbasi-Yadkori
University of Alberta, 2012
902012
Online least squares estimation with self-normalized processes: An application to bandit problems
Y Abbasi-Yadkori, D Pál, C Szepesvári
arXiv preprint arXiv:1102.2670, 2011
772011
Offline Evaluation of Ranking Policies with Click Models
S Li, Y Abbasi-Yadkori, B Kveton, S Muthukrishnan, V Vinay, Z Wen
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge …, 2018
752018
Prediction with limited advice and multiarmed bandits with paid observations
Y Seldin, P Bartlett, K Crammer, Y Abbasi-Yadkori
International Conference on Machine Learning, 280-287, 2014
752014
Bayesian Optimal Control of Smoothly Parameterized Systems
Y Abbasi-Yadkori, C Szepesvári
Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2015
72*2015
Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic Environments.
Y Seldin, C Szepesvári, P Auer, Y Abbasi-Yadkori
EWRL, 103-116, 2012
672012
Bootstrapping upper confidence bound
B Hao, YA Yadkori, Z Wen, G Cheng
Advances in Neural Information Processing Systems, 12123-12133, 2019
622019
Bootstrapping upper confidence bound
B Hao, YA Yadkori, Z Wen, G Cheng
Advances in Neural Information Processing Systems, 12123-12133, 2019
622019
Linear Programming for Large-Scale Markov Decision Problems
Y Abbasi-Yadkori, P Bartlett, A Malek
Proceedings of the 31st International Conference on Machine Learning (ICML …, 2014
58*2014
A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.
Cikkek 1–20