ProbabilisticPolicyComposer

class maze.perception.models.policies.probabilistic_policy_composer.ProbabilisticPolicyComposer(action_spaces_dict: Dict[Union[str, int], gym.spaces.Dict], observation_spaces_dict: Dict[Union[str, int], gym.spaces.Dict], distribution_mapper: maze.distributions.distribution_mapper.DistributionMapper, networks: Union[List[Union[None, str, Mapping[str, Any], Any]], Mapping[str, Union[None, str, Mapping[str, Any], Any]]])

Composes networks for probabilistic policies.

Parameters
  • action_spaces_dict – Dict of sub-step id to action space.

  • observation_spaces_dict – Dict of sub-step id to observation space.

  • distribution_mapper – The distribution mapper.

  • networks – Policy networks as defined in the config (either list or dictionary of object params and type).

property policy

implementation of BasePolicyComposer