Workflow
Policy and Value Networks
Trainers
Concepts and Structure
Environment Customization
Best Practices and Tutorials
Logging
Scaling the Training Process
maze.train.trainers.common.model_selection.model_selection_base.
ModelSelectionBase
Base class for model selection strategies.
update
Receives a new evaluation result from the model.
reward – mean evaluation reward