Algorithms for sequential decision-making

January 1996

Author:
Michael Lederman Littman

Publisher:

Brown University
Department of Computer Science Box 1910 Providence, RI
United States

ISBN:978-0-591-16350-6

Order Number:AAI9709069

Pages:

263

Purchase on ProQuest

Bibliometrics

Abstract

Sequential decision making is a fundamental task faced by any intelligent agent in an extended interaction with its environment; it is the act of answering the question "What should I do now?" In this thesis, I show how to answer this question when "now" is one of a finite set of states, "do" is one of a finite set of actions, "should" is maximize a long-run measure of reward, and "I" is an automated planning or learning system (agent). In particular, I collect basic results concerning methods for finding optimal (or near-optimal) behavior in several different kinds of model environments: Markov decision processes, in which the agent always knows its state; partially observable Markov decision processes (scPOMDPS), in which the agent must piece together its state on the basis of observations it makes; and Markov games, in which the agent is in direct competition with an opponent. The thesis is written from a computer-science perspective, meaning that many mathematical details are not discussed, and descriptions of algorithms and the complexity of problems are emphasized. New results include an improved algorithm for solving scPOMDPS exactly over finite horizons, a method for learning minimax-optimal policies for Markov games, a pseudopolynomial bound for policy iteration, and a complete complexity theory for finding zero-reward scPOMDP policies.

Cited By

Contributors

Michael Lederman Littman
Brown University
- Publication Years1989 - 2023
- Publication counts189
- Citation count6,605
- Available for Download57
- Downloads (cumulative)155,406
- Downloads (12 months)4,868
- Downloads (6 weeks)854
- Average Downloads per Article2,726
- Average Citation per Article35
View Full Profile

Recommendations

Comments

Browse Theses

Sections

Cited By

Algorithms for Sequential Decision Making

Complexity analysis and optimal algorithms for decentralized decision making

Resource allocation problems in stochastic sequential decision making

Sections

Cited By

Save to Binder

Recommendations

Algorithms for Sequential Decision Making

Complexity analysis and optimal algorithms for decentralized decision making

Resource allocation problems in stochastic sequential decision making