PEGASUS: A Policy Search Method for Large MDPs and POMDPs

AllImages Videos Books Maps News Shopping

PEGASUS: A Policy Search Method for Large MDPs and POMDPs - arXiv

Jan 16, 2013 · Our method applies to arbitrary POMDPs, including ones with infinite state and action spaces. We also present empirical results for our approach ...

[PDF] PEGASUS: A policy search method for large MDPs and POMDPs

citeseerx.ist.psu.edu › document

Our approach is based on the following observation: Any (PO)MDP can be transformed into an “equivalent” POMDP in which all state transitions (given the current ...

[PDF] PEGASUS: A policy search method for large MDPs and POMDPs - arXiv

arxiv.org › pdf

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observ.

[PDF] PEGASUS: A policy search method for large MDPs and POMDPs

www.semanticscholar.org › paper

This work proposes a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov ...

(PDF) PEGASUS: A Policy Search Method for Large MDPs and POMDPs

www.researchgate.net › publication › 23...

PDF | We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov.

PEGASUS: a policy search method for large MDPs and POMDPs

dl.acm.org › doi › abs

Jun 30, 2000 · We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable ...

Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence

dl.acm.org › doi

Jun 30, 2000 · PEGASUS: A policy search method for large MDPs and POMDPs.

Publications | Andrew Ng

www.andrewng.org › publications

PEGASUS: A policy search method for large MDPs and POMDPs. We propose a new approach to the problem of searching a space of policies for a Markov decision ...

Partially Observable Markov Decision Processes - SpringerLink

link.springer.com › chapter

Ng, A.Y., Jordan, M.: PEGASUS: A policy search method for large MDPs and POMDPs. In: Proc. of Uncertainty in Artificial Intelligence (2000). Google Scholar.

[PDF] PEGASUS for Helicopter Control - UBC Computer Science

www.cs.ubc.ca › pegasusAlton

PEGASUS: A policy search method for large MDPs and. POMDPs, 2000 - Andrew Ng and Michael Jordan. • Shaping and Policy Search in Reinforcement Learning,.