Reinforcement Learning for POMDP: Rollout and Policy Iteration with Application to Sequential Repair

Citation:

Dimitri P. Bertsekas, Stephanie Gil, Sushmita Bhattacharya, and Thomas Wheeler. 5/1/2019. “Reinforcement Learning for POMDP: Rollout and Policy Iteration with Application to Sequential Repair”.

Download

0 bytes

Abstract:

We study rollout algorithms which combine limited lookahead and terminal cost function approximation in the context of POMDP. We demonstrate their effectiveness in the context of a sequential pipeline repair problem, which also arises in other contexts of search and rescue. We provide performance bounds and empirical validation of the methodology, in both cases of a single rollout iteration, and multiple iterations with intermediate policy space approximations.

Last updated on 07/22/2020

Selected as a 2020 Sloan Research Fellow

The full 2020 fellows list

Recipient of the NSF CAREER Award 2019!

Professor Gil and her students
(Official ASU announcement link) NSF CAREER: “Multi-Agent Decision Making and Optimization using Communication as a Sensor”

Recent Invited Talks

Information Theory Forum at Stanford University, July 2019
Control and Robotics Seminar Series at UC Berkeley, July 2019
Robotics Lunch Colloquium at Stanford University, June 2019
Learning for Decision and Control (poster) at MIT, May 2019
Robotics Colloquium at the University of Washington (April 2019)
Blockchain for Robotics MIT Media Lab (December 2018)
Presentation of our new L-CSS paper “Resilient Multi-Agent Consensus using Wi-Fi Signals” at CDC (December 2018)
CCIS colloquium at Northeastern University (November 2018)

Security for multi-robot systems

Professor Gil and Professor Daniela Rus
Read about our research on security for multi-robot systems from MIT News

View our research in:

ASU Drone Studio Unveiled

As one of the vision leads for this new testbed I am proud to say that ASU now has one of the largest drone research testbeds in academia! See the official announcement for more information.

Drone Studio RM 3316

Program Chair SWRS 2019

I served as the Program Chair for the Southwest Robotics Symposium 2019 with over 600 registrants, invited speakers from 13 universities, 5 topic areas in robotics, and plenary talks by Oussama Khatib and Ruzena Bajcsy!

FURI Research

Congratulations to our REACT Lab members Paul Vohs, Thomas Wheeler, and Maxwell Flanagan whose research proposals were selected for FURI 2018/2019!

FURI Thomas group

Stephanie Gil

Reinforcement Learning for POMDP: Rollout and Policy Iteration with Application to Sequential Repair

Citation:

Abstract:

Selected as a 2020 Sloan Research Fellow

Recipient of the NSF CAREER Award 2019!

Recent Invited Talks

Security for multi-robot systems

View our research in:

ASU Drone Studio Unveiled

Program Chair SWRS 2019

FURI Research

Recent Publications

css.gil

css_pub