Convex synthesis of optimal policies for Markov Decision Processes with sequentially-observed transitions.

AllBooks Images News Maps Videos Shopping

Scholarly articles for Convex synthesis of optimal policies for Markov Decision Processes with sequentially-observed transitions.

scholar.google.com › citations

… Markov decision processes with sequentially-observed …
El Chamie · Cited by 5

Convex synthesis of optimal policies for Markov Decision Processes with ...

ieeexplore.ieee.org › abstract › document

Aug 1, 2016 · Convex synthesis of optimal policies for Markov Decision Processes with sequentially-observed transitions. Abstract: This paper extends ...

Convex synthesis of optimal policies for Markov Decision Processes ...

www.semanticscholar.org › paper › Con...

This paper extends finite state and action space Markov Decision Process (MDP) models by introducing a new type of measurement for the outcomes of actions.

Convex synthesis of optimal policies for Markov Decision Processes ...

ieeexplore.ieee.org › servlet › Login › d...

The new measurement allows to sequentially observe the next-state transition for taking an action, i.e., the actions are ordered and the next action outcome in ...

Markov decision processes with sequential sensor measurements

www.sciencedirect.com › article › abs › pii

This new MDP model with sequential measurements is referred to as sequentially-observed MDP (SO-MDP). We show that the SO-MDP shares some similar properties ...

Convex synthesis of randomized policies for controlled Markov ...

www.researchgate.net › publication › 30...

The proposed Markov model is applicable to both decisionmaking for single and multi-agent systems in stochastic environments. Our particular interest is ...

[PDF] On the convex formulations of robust Markov decision processes - arXiv

arxiv.org › pdf

Dec 13, 2023 · First, as in MDPs, the convex formulation can provide crucial insights into the structure of optimal policies and value functions.

Conference Papers – Autonomous Controls Laboratory

depts.washington.edu › uwacl › conferen...

“Convex synthesis of optimal policies for Markov decision processes with sequentially-observed transitions,” in American Control Conference (ACC), pp. 3862 ...

Finite-Horizon Markov Decision Processes with State Constraints

www.researchgate.net › publication › 27...

Oct 19, 2015 · An efficient algorithm based on Linear Programming (LP) and duality theory is proposed, which gives the convex set of feasible policies and ...

[PDF] Robust Policy Synthesis for Uncertain POMDPs via Convex Optimization

www.ijcai.org › proceedings

We study the problem of policy synthesis for uncer- tain partially observable Markov decision processes. (uPOMDPs). The transition probability function of.

Markov decision processes with sequential sensor measurements ...

www.semanticscholar.org › paper

Convex synthesis of optimal policies for Markov Decision Processes with sequentially-observed transitions · Robust Action Selection in Partially Observable ...