Mohammad Ghavamzadeh

Cited by

	All	Since 2019
Citations	13065	9654
h-index	57	44
i10-index	118	110

2900

1450

725

2175

2005200620072008200920102011201220132014201520162017201820192020202120222023202436 50 58 69 107 104 181 232 200 263 297 355 412 578 916 1186 1676 2093 2805 970

Public access

View all

14 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yinlam ChowResearch Scientist, Google ResearchVerified email at google.com
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchVerified email at inria.fr
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchVerified email at technion.ac.il
Branislav KvetonAmazonVerified email at amazon.com
Sridhar MahadevanDirector, Data Science Lab, Adobe Research & Professor, University of Massachusetts, AmherstVerified email at cs.umass.edu
Rémi MunosDeepMindVerified email at inria.fr
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Georgios TheocharousAdobe ResearchVerified email at adobe.com
Amir-massoud FarahmandUniversity of TorontoVerified email at cs.toronto.edu
Craig BoutilierPrincipal Scientist, GoogleVerified email at google.com
Marek PetrikUniversity of New HampshireVerified email at cs.unh.edu
Ofir NachumOpenAIVerified email at openai.com
Philip ThomasUniversity of Massachusetts AmherstVerified email at cs.umass.edu
Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of ScienceVerified email at iisc.ac.in
Richard S. SuttonKeen, Amii, and University of AlbertaVerified email at richsutton.com
Hung BuiResearch Scientist, Google DeepMindVerified email at google.com
Zheng WenGoogle DeepMindVerified email at google.com
Aviv TamarTechnionVerified email at technion.ac.il
Bo LiuAAAI SM, IEEE SMVerified email at cs.umass.edu
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Verified email at univ-lorraine.fr

Mohammad Ghavamzadeh

Amazon

Verified email at amazon.com - Homepage

Reinforcement Learning Online Learning Machine Learning Control AI


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges M Abdar, F Pourpanah, S Hussain, D Rezazadegan, L Liu, ... Information Fusion, 2021	1678	2021
Natural Actor–critic Algorithms S Bhatnagar, RS Sutton, M Ghavamzadeh, M Lee Automatica 45 (11), 2471-2482, 2009	1087*	2009
A Lyapunov-based Approach to Safe Reinforcement Learning Y Chow, O Nachum, E Duenez-Guzman, M Ghavamzadeh Neural Information Processing Systems, 8103-8112, 2018	528	2018
Bayesian Reinforcement Learning: A Survey M Ghavamzadeh, S Mannor, J Pineau, A Tamar Foundations and Trends in Machine Learning 8 (5-6), 359-483, 2015	524	2015
Risk-constrained Reinforcement Learning with Percentile Risk Criteria Y Chow, M Ghavamzadeh, L Janson, M Pavone Journal of Machine Learning Research (JMLR) 18, 6070-6120, 2017	497	2017
Algorithms for CVaR Optimization in MDPs Y Chow, M Ghavamzadeh Advances in Neural Information Processing Systems, 3509-3517, 2014	397	2014
Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence V Gabillon, M Ghavamzadeh, A Lazaric Neural Information Processing Systems, 3221-3229, 2012	339	2012
Actor-Critic Algorithms for Risk-sensitive MDPs LA Prashanth, M Ghavamzadeh Neural Information Processing Systems, 252-260, 2013	336*	2013
High-confidence Off-policy Evaluation P Thomas, G Theocharous, M Ghavamzadeh AAAI, 3000-3006, 2015	303	2015
More Robust Doubly Robust Off-policy Evaluation M Farajtabar, Y Chow, M Ghavamzadeh ICML, 1447-1456, 2018	251	2018
Safe Policy Learning for Continuous Control Y Chow, O Nachum, A Faust, E Duenez-Guzman, M Ghavamzadeh Conference on Robot Learning (CoRL), 2020	246*	2020
High Confidence Policy Improvement P Thomas, G Theocharous, M Ghavamzadeh ICML, 2380-2388, 2015	213	2015
Speedy Q-learning M Ghavamzadeh, H Kappen, M Azar, R Munos Neural Information Processing Systems 24, 2411-2419, 2011	202*	2011
Supervised actor-critic reinforcement learning MT Rosenstein, AG Barto, J Si, A Barto, W Powell, D Wunsch Learning and approximate dynamic programming: scaling up to the real world …, 2004	197	2004
Hierarchical Multi-agent Reinforcement Learning R Makar, S Mahadevan, M Ghavamzadeh International Conference on Autonomous Agents, 246-253, 2001	194	2001
Personalized Ad Recommendation Systems for Life-time Value Optimization with Guarantees G Theocharous, PS Thomas, M Ghavamzadeh IJCAI, 1806-1812, 2015	190*	2015
Benchmarking Batch Deep Reinforcement Learning Algorithms S Fujimoto, E Conti, M Ghavamzadeh, J Pineau arXiv preprint arXiv:1910.01708, 2019	189	2019
Finite-Sample Analysis of Proximal Gradient TD Algorithms B Liu, J Liu, M Ghavamzadeh, S Mahadevan, M Petrik UAI, 504-513, 2015	172*	2015
Hierarchical Multi-agent Reinforcement Learning M Ghavamzadeh, S Mahadevan, R Makar Journal of Autonomous Agents and Multi-Agent Systems (JAAMAS) 13 (2), 197-229, 2006	172	2006
Regularized Policy Iteration AM Farahmand, M Ghavamzadeh, C Szepesvári, S Mannor Neural Information Processing Systems, 441-448, 2008	162	2008

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors