Zhong Zheng's Website
Publications
Talks
Teaching
Awards

University of Pennsylvania
Email
Google Scholar
ORCID

Federated Q-Learning with Reference-Advantage Decomposition: Almost Optimal Regret and Logarithmic Communication Cost

Published in ICLR, 2025

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

Sitemap

Follow:
Feed

© 2025 Zhong Zheng, Powered by Jekyll & AcademicPages, a fork of Minimal Mistakes.
Site last updated 2025-08-27