BetaProbabilityDistribution

class maze.distributions.beta.BetaProbabilityDistribution(*args, **kwds)

Beta Torch probability distribution.

Parameters
  • logits – the logits for both mean and standard deviation.

  • action_space – the underlying gym.spaces action space.

deterministic_sample() → torch.Tensor

implementation of TorchProbabilityDistribution interface

entropy() → torch.Tensor

implementation of TorchProbabilityDistribution interface

kl(other: maze.distributions.torch_dist.TorchProbabilityDistribution) → torch.Tensor

implementation of TorchProbabilityDistribution interface

log_prob(actions: torch.Tensor) → torch.Tensor

implementation of TorchProbabilityDistribution interface

classmethod required_logits_shape(action_space: gym.spaces.Space) → Sequence[int]

implementation of TorchProbabilityDistribution interface

sample() → torch.Tensor

implementation of TorchProbabilityDistribution interface