Shortcuts

openrl.modules.utils package

Submodules

openrl.modules.utils.util module

openrl.modules.utils.util.get_grad_norm(it)[source]
openrl.modules.utils.util.huber_loss(e, d)[source]
openrl.modules.utils.util.mse_loss(e)[source]
openrl.modules.utils.util.update_linear_schedule(optimizer, epoch, total_num_epochs, initial_lr)[source]

Decreases the learning rate linearly

openrl.modules.utils.valuenorm module

class openrl.modules.utils.valuenorm.ValueNorm(input_shape, norm_axes=1, beta=0.99999, per_element_update=False, epsilon=1e-05, device=device(type='cpu'))[source]

Bases: torch.nn.modules.module.Module

Normalize a vector of observations - across the first norm_axes dimensions

denormalize(input_vector)[source]

Transform normalized data back into original distribution

normalize(input_vector)[source]
reset_parameters()[source]
running_mean_var()[source]
update(input_vector)[source]

Module contents