ComputeAlpha_a
PURPOSE 
Compute the alpha_i_n-element for the given action and belief.
SYNOPSIS 
function Element_a=ComputeAlpha_a(P,V,b,a,Alphas_j_a_o)
DESCRIPTION 
CROSS-REFERENCE INFORMATION 
This function calls:
- size Returns the size of a policy.
- Expectation Expectation between a belief and a alpha-element.
- Expectation Expectation between a belief and a alpha-element.
- get Get for GBeliefs.
- Expectation Expectation between a belief and a alpha-element.
- GMixture Gaussian mixture constructor.
- get Get function for the GMixture object.
- get Gaussian object get function.
- get Get function for CS_CO_CA_POMDPs.
- get Get function for CS_CO_DA_POMDPs.
- get Get function for CS_CO_POMDPs.
- get Get function for CS_DO_CA_POMDPs.
- get Get function for CS_DO_DA_POMDPs.
- ComputeAlpha_j_a_o Computes a particular alpha-element.
- get Get function for CS_POMDPs.
- get Get function for DS_CO_CA_POMDPs.
- get Get function for DS_CO_DA_POMDPs.
- get Get function for DS_DO_CA_POMDPs.
- get Get function for DS_DO_DA_POMDPs.
- ComputeAlpha_j_a_o Computes a particular alpha-element.
- get Get functio for POMDPs.
- GetRewardModelFixedA Defines the reward function for a given action.
- GetRewardModelFixedA Defines the reward function for a given action.
- GetRewardModelFixedA Defines the reward function for a given action.
- GetRewardModelFixedA Defines the reward function for a given action.
- dim Dimensionality of a continuous space.
- max Upper bound of a CSpace
- dim Dimensionality of a discrete space.
This function is called by:
- Backup Backup for a given belief (continuous state version).
- Backup Backupt for a given belief (discrete state version).
SOURCE CODE 
0001 function Element_a=ComputeAlpha_a(P,V,b,a,Alphas_j_a_o)
0002
0003
0004
0005
0006
0007
0008
0009
0010
0011
0012
0013
0014
0015
0016
0017 rj=num2cell(1:size(V));
0018 O=get(P,'ObsSpace');
0019 no=dim(O);
0020 gamma=get(P,'gamma');
0021 noP=isempty(Alphas_j_a_o);
0022 Element_ao=GMixture;
0023 for o=1:no
0024 if noP
0025 Alphas_j=cellfun(@(j)(ComputeAlpha_j_a_o(P,V,j,a,o)),rj,'UniformOutput',false);
0026 else
0027 Alphas_j=Alphas_j_a_o(:,a,o);
0028 end
0029 [v_a_o nAlpha_j]=max(cellfun(@(g)(Expectation(b,g)),Alphas_j));
0030 Element_ao=Element_ao+Alphas_j{nAlpha_j};
0031 end
0032 Element_a=GetRewardModelFixedA(P,a)+gamma*Element_ao;
0033
0034
0035
|