Lihong Li (李力鸿)

Cited by

	All	Since 2019
Citations	26113	18113
h-index	66	57
i10-index	101	86

3700

1850

925

2775

2008200920102011201220132014201520162017201820192020202120222023202495 184 214 340 437 546 594 835 990 1228 1762 2349 3040 3561 3424 3694 2044

Public access

View all

14 articles

0 articles

available

not available

Based on funding mandates

Co-authors

John LangfordMicrosoft Research New YorkVerified email at hunch.net
Michael LittmanBrown UniversityVerified email at brown.edu
Jianfeng GaoMicrosoft Research, RedmondVerified email at microsoft.com
Wei Chu（褚崴）InfVerified email at gatsby.ucl.ac.uk
Li DengChief AI Officer, Citadel (former)Verified email at ieee.org
Robert SchapireMicrosoft ResearchVerified email at microsoft.com
Bo DaiGoogle Brain & Georgia TechVerified email at google.com
Denny ZhouResearch Scientist, Google DeepMindVerified email at google.com
Jianshu ChenPrincipal Scientist, AmazonVerified email at ucla.edu
Asli CelikyilmazResearcher @ FAIR at Meta AIVerified email at ieee.org
Dale SchuurmansUniversity of Alberta, Google DeepMindVerified email at cs.ualberta.ca
Zachary C. LiptonRaj Reddy Associate Professor of Machine Learning @ Carnegie Mellon University; CTO + CSO @ AbridgeVerified email at cmu.edu
Yun-Nung (Vivian) ChenNational Taiwan UniversityVerified email at ieee.org
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Faisal Ahmed, PhDMicrosoftVerified email at microsoft.com
Thomas J. WalshSony AIVerified email at sony.com
Miroslav DudikMicrosoft ResearchVerified email at microsoft.com
Xiujun LiUniversity of Washington / AppleVerified email at cs.washington.edu
Chong WangAppleVerified email at cs.princeton.edu
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca

Lihong Li (李力鸿)

Amazon

Verified email at amazon.com - Homepage

Reinforcement Learning Machine Learning Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A contextual-bandit approach to personalized news article recommendation L Li, W Chu, J Langford, RE Schapire Proceedings of the 19th international conference on World wide web, 661-670, 2010	3380	2010
An empirical evaluation of thompson sampling O Chapelle, L Li Advances in neural information processing systems 24, 2011	1806	2011
Parallelized stochastic gradient descent M Zinkevich, M Weimer, L Li, A Smola Advances in neural information processing systems 23, 2010	1750	2010
Contextual bandits with linear payoff functions W Chu, L Li, L Reyzin, R Schapire Proceedings of the Fourteenth International Conference on Artificial …, 2011	1224	2011
Neural approaches to conversational AI J Gao, M Galley, L Li The 41st international ACM SIGIR conference on research & development in …, 2018	908	2018
Doubly robust policy evaluation and learning M Dudík, J Langford, L Li arXiv preprint arXiv:1103.4601, 2011	872	2011
Doubly Robust Policy Evaluation and Learning M Dudık, J Langford, L Li	872*
Doubly robust off-policy value evaluation for reinforcement learning N Jiang, L Li International conference on machine learning, 652-661, 2016	815	2016
Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms L Li, W Chu, J Langford, X Wang Proceedings of the fourth ACM international conference on Web search and …, 2011	672	2011
PAC model-free reinforcement learning AL Strehl, L Li, E Wiewiora, J Langford, ML Littman Proceedings of the 23rd international conference on Machine learning, 881-888, 2006	634	2006
Towards a unified theory of state abstraction for MDPs. L Li, TJ Walsh, ML Littman AI&M 1 (2), 3, 2006	604	2006
Sparse Online Learning via Truncated Gradient. J Langford, L Li, T Zhang Journal of Machine Learning Research 10 (3), 2009	593	2009
Taming the monster: A fast and simple algorithm for contextual bandits A Agarwal, D Hsu, S Kale, J Langford, L Li, R Schapire International conference on machine learning, 1638-1646, 2014	568	2014
Towards end-to-end reinforcement learning of dialogue agents for information access B Dhingra, L Li, X Li, J Gao, YN Chen, F Ahmed, L Deng arXiv preprint arXiv:1609.00777, 2016	530*	2016
Doubly robust policy evaluation and optimization M Dudík, D Erhan, J Langford, L Li	475	2014
End-to-end task-completion neural dialogue systems X Li, YN Chen, L Li, J Gao, A Celikyilmaz arXiv preprint arXiv:1703.01008, 2017	457	2017
Neuro-symbolic program synthesis E Parisotto, A Mohamed, R Singh, L Li, D Zhou, P Kohli arXiv preprint arXiv:1611.01855, 2016	407	2016
Reinforcement Learning in Finite MDPs: PAC Analysis. AL Strehl, L Li, ML Littman Journal of Machine Learning Research 10 (11), 2009	377	2009
Breaking the curse of horizon: Infinite-horizon off-policy estimation Q Liu, L Li, Z Tang, D Zhou Advances in neural information processing systems 31, 2018	376	2018
Contextual bandit algorithms with supervised learning guarantees A Beygelzimer, J Langford, L Li, L Reyzin, RE Schapire Arxiv preprint arXiv:1002.4058, 2010	361	2010

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors