openrl.modules.utils package¶ Submodules¶ openrl.modules.utils.util module¶ openrl.modules.utils.util.get_grad_norm(it)[source]¶ openrl.modules.utils.util.huber_loss(e, d)[source]¶ openrl.modules.utils.util.mse_loss(e)[source]¶ openrl.modules.utils.util.update_linear_schedule(optimizer, epoch, total_num_epochs, initial_lr)[source]¶ Decreases the learning rate linearly openrl.modules.utils.valuenorm module¶ class openrl.modules.utils.valuenorm.ValueNorm(input_shape, norm_axes=1, beta=0.99999, per_element_update=False, epsilon=1e-05, device=device(type='cpu'))[source]¶ Bases: torch.nn.modules.module.Module Normalize a vector of observations - across the first norm_axes dimensions denormalize(input_vector)[source]¶ Transform normalized data back into original distribution normalize(input_vector)[source]¶ reset_parameters()[source]¶ running_mean_var()[source]¶ update(input_vector)[source]¶ Module contents¶