
Sensor information from wireless signals used for multi-robot PGO.

Our paper on distributed learning for POMDP in a sequential repair setting with Dimitri Bertsekas has been accepted for publication in RAL 2020!

February 11, 2020

Reinforcement Learning for POMDP: Partitioned Rollout and Policy Iteration with Application to Autonomous Sequential Repair Problems

Sushmita Bhattacharya, Sahil Badyal, Thomas Wheeler, Stephanie Gil, Dimitri Bertsekas 


In this paper we consider infinite horizon discounted dynamic programming...

Read more about Our paper on distributed learning for POMDP in a sequential repair setting with Dimitri Bertsekas has been accepted for publication in RAL 2020!