Shortcuts

openrl.modules.utils package

Submodules

openrl.modules.utils.util module

openrl.modules.utils.util.get_grad_norm(it)[源代码]
openrl.modules.utils.util.huber_loss(e, d)[源代码]
openrl.modules.utils.util.mse_loss(e)[源代码]
openrl.modules.utils.util.update_linear_schedule(optimizer, epoch, total_num_epochs, initial_lr)[源代码]

Decreases the learning rate linearly

openrl.modules.utils.valuenorm module

class openrl.modules.utils.valuenorm.ValueNorm(input_shape, norm_axes=1, beta=0.99999, per_element_update=False, epsilon=1e-05, device=device(type='cpu'))[源代码]

基类:torch.nn.modules.module.Module

Normalize a vector of observations - across the first norm_axes dimensions

denormalize(input_vector)[源代码]

Transform normalized data back into original distribution

normalize(input_vector)[源代码]
reset_parameters()[源代码]
running_mean_var()[源代码]
update(input_vector)[源代码]

Module contents