Controlling Underestimation Bias in Reinforcement Learning via Quasi-median Operation.

AllBooks Videos Images Maps News Shopping

Scholarly articles for Controlling Underestimation Bias in Reinforcement Learning via Quasi-median Operation.

scholar.google.com › citations

… in reinforcement learning via quasi-median operation
Wei · Cited by 11

Controlling Underestimation Bias in Reinforcement Learning via Quasi ...

Jun 28, 2022 · In this paper, we propose the Quasi-Median Operation, a novel way to mitigate the underestimation bias by selecting the quasi-median from ...

[PDF] Controlling Underestimation Bias in Reinforcement Learning via Quasi ...

ojs.aaai.org › article › download

In this paper, we propose the Quasi-Median. Operation, a novel way to mitigate the underestimation bias by selecting the quasi-median from multiple state-action ...

Controlling Underestimation Bias in Reinforcement Learning via Quasi ...

www.semanticscholar.org › paper › Cont...

Theoretically, the underestimation bias of the method is improved while the estimation variance is significantly reduced compared to Maxmin Q-learning, ...

Controlling Underestimation Bias in Reinforcement Learning via Quasi ...

www.researchgate.net › publication › 36...

Based on the quasi-median operation, we propose Quasi-Median Q-learning (QMQ) for the discrete action tasks and Quasi-Median Delayed Deep Deterministic Policy ...

‪Lin Li‬ - ‪Google Scholar‬

scholar.google.com › citations

Controlling underestimation bias in reinforcement learning via quasi-median operation. W Wei, Y Zhang, J Liang, L Li, Y Li. Proceedings of the AAAI Conference ...

Controlling Estimation Error in Reinforcement Learning via ...

www.researchgate.net › publication › 38...

May 23, 2024 · Controlling Underestimation Bias in Reinforcement Learning via Quasi-median Operation ... underestimation bias by selecting the quasi-median from ...

Papers - AAAI2022

aaai-2022.virtualchair.net › papers

Controlling Underestimation Bias in Reinforcement Learning via Quasi-Median Operation. Wei Wei, Yujia Zhang, Jiye Liang, Lin Li, Yuze Li. [AAAI-22] Main Track.

[PDF] On the Estimation Bias in Double Q-Learning - NeurIPS

proceedings.neurips.cc › paper › file

Double Q-learning is a classical method for reducing overestimation bias, which is caused by taking maximum estimated values in the Bellman operation.

[PDF] Promoting Exploration of Ensemble Policies in Continuous Control - arXiv

arxiv.org › pdf

Oct 17, 2023 · Controlling underestimation bias in reinforcement learning via quasi-median operation. Proceedings of the AAAI Conference on Artificial ...

Images

View all

PDF] Controlling Underestimation Bias in Reinforcement Learning ...