DictProbabilityDistribution¶
-
class
maze.distributions.dict.DictProbabilityDistribution(distribution_dict: Dict[str, maze.distributions.distribution.ProbabilityDistribution])¶ Dictionary probability distribution.
- The respective functions either return
the per key distribution properties or
aggregate the properties across the sub-distributions using a reduce_fun such as mean or sum.
- Parameters
distribution_dict – dictionary holding sub-probability distributions.
-
deterministic_sample() → Dict[str, torch.Tensor]¶ implementation of
TorchProbabilityDistributioninterface
-
entropy(reduce_fun: callable = torch.mean) → torch.Tensor¶ implementation of
TorchProbabilityDistributioninterface
-
kl(other: maze.distributions.dict.DictProbabilityDistribution, reduce_fun: callable = torch.mean) → torch.Tensor¶ implementation of
TorchProbabilityDistributioninterface
-
log_prob(actions: Dict[str, torch.Tensor]) → Dict[str, torch.Tensor]¶ implementation of
TorchProbabilityDistributioninterface
-
neg_log_prob(actions: Dict[str, torch.Tensor]) → Dict[str, torch.Tensor]¶ implementation of
TorchProbabilityDistributioninterface
-
sample() → Dict[str, torch.Tensor]¶ implementation of
TorchProbabilityDistributioninterface