Zhong Zheng's Website
Publications
Talks
Teaching
Awards

University of Pennsylvania
Email
Google Scholar
ORCID

Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition

Published in ICLR, 2025

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

Sitemap

Follow:
Feed

© 2025 Zhong Zheng, Powered by Jekyll & AcademicPages, a fork of Minimal Mistakes.
Site last updated 2025-08-27