Institut de Robòtica i Informàtica Industrial

DiscretizeActionModel

PURPOSE ^

Discretizes the action space CS_DO_CA_POMDP.

SYNOPSIS ^

function DA_POMDP=DiscretizeActionModel(CA_POMDP,optimalAction,iteration)

DESCRIPTION ^

 Discretizes the action space CS_DO_CA_POMDP.

   Discretizes the action space of a CS_DO_CA_POMDP and returns a 
   CS_DO_DA_POMDP.
   The discretization affects the action space, the action model,
   and reward model.
   This function is used in Perseus (and iPerseus) in order to define a
   POMDP with a discre action set so that the normal backup can be executed
   without any issue. The discretization is 'belief' based (different for
   each belief and it is executed just before the backup). 
   The parameter 'optimalAction' is the optimal action for the belief in the
   previous Perseus iteration. In the current implementation, this action
   is added to the discrete set of actions.
   Parameter 'iteration' is the current Perseus (or iPerseus) iteration. 
   Right now the discretization is done sampling uniformly, but could also
   progressively focus toward the optimal action as 'iteration' increases.

   Parameters:
     CA_POMDP: The CS_DO_CA input POMDP.
     optimalAction: The previous optimal action .
     iteration: Current iteration (not used in this implementation).

   See also Perseus, iPerseus.

CROSS-REFERENCE INFORMATION ^

This function calls: This function is called by:

SOURCE CODE ^

0001 function DA_POMDP=DiscretizeActionModel(CA_POMDP,optimalAction,iteration)
0002 % Discretizes the action space CS_DO_CA_POMDP.
0003 %
0004 %   Discretizes the action space of a CS_DO_CA_POMDP and returns a
0005 %   CS_DO_DA_POMDP.
0006 %   The discretization affects the action space, the action model,
0007 %   and reward model.
0008 %   This function is used in Perseus (and iPerseus) in order to define a
0009 %   POMDP with a discre action set so that the normal backup can be executed
0010 %   without any issue. The discretization is 'belief' based (different for
0011 %   each belief and it is executed just before the backup).
0012 %   The parameter 'optimalAction' is the optimal action for the belief in the
0013 %   previous Perseus iteration. In the current implementation, this action
0014 %   is added to the discrete set of actions.
0015 %   Parameter 'iteration' is the current Perseus (or iPerseus) iteration.
0016 %   Right now the discretization is done sampling uniformly, but could also
0017 %   progressively focus toward the optimal action as 'iteration' increases.
0018 %
0019 %   Parameters:
0020 %     CA_POMDP: The CS_DO_CA input POMDP.
0021 %     optimalAction: The previous optimal action .
0022 %     iteration: Current iteration (not used in this implementation).
0023 %
0024 %   See also Perseus, iPerseus.
0025 
0026   S=get(CA_POMDP,'StateSpace');
0027   A=Discretize(CA_POMDP.A,CA_POMDP.nSampledActions-1,{optimalAction});
0028   O=get(CA_POMDP,'ObsSpace');
0029   
0030   AM=DiscretizeActionModel(CA_POMDP.CS_CA_ActionModel,A);
0031   OM=get(CA_POMDP,'ObsModel');
0032   RM=DiscretizeRewardModel(CA_POMDP.CS_CA_RewardModel,A);
0033   
0034   name=get(CA_POMDP,'name');
0035   gamma=get(CA_POMDP,'gamma');
0036   ncAlpha=get(CA_POMDP,'maxAlphaC');
0037   
0038   DA_POMDP=CS_DO_DA_POMDP(name,S,A,O,AM,OM,RM,gamma,ncAlpha);
0039


Institut de Robòtica i Informàtica Industrial

Generated on Wed 05-Aug-2009 15:05:21 by m2html © 2003