Workflow
Policy and Value Networks
Trainers
Concepts and Structure
Environment Customization
Best Practices and Tutorials
Logging
Scaling the Training Process
maze.core.wrappers.reward_clipping_wrapper.
RewardClippingWrapper
Clips original step reward to range [min, max].
env – The underlying environment.
min_val – Minimum allowed reward value.
max_val – Maximum allowed reward value.
reward
Clips the original reward.
reward – The original reward.
The clipped reward.