openrl.modules package¶
Subpackages¶
- openrl.modules.common package
- openrl.modules.networks package
- Subpackages
- openrl.modules.networks.utils package
- Subpackages
- Submodules
- openrl.modules.networks.utils.act module
- openrl.modules.networks.utils.attention module
- openrl.modules.networks.utils.cnn module
- openrl.modules.networks.utils.distributed_utils module
- openrl.modules.networks.utils.distributions module
- openrl.modules.networks.utils.mix module
- openrl.modules.networks.utils.mlp module
- openrl.modules.networks.utils.popart module
- openrl.modules.networks.utils.rnn module
- openrl.modules.networks.utils.transformer_act module
- openrl.modules.networks.utils.util module
- Module contents
- openrl.modules.networks.utils package
- Submodules
- openrl.modules.networks.base_policy_network module
- openrl.modules.networks.base_value_network module
- openrl.modules.networks.policy_network module
- openrl.modules.networks.policy_value_network module
- openrl.modules.networks.policy_value_network_gpt module
- openrl.modules.networks.value_network module
- Module contents
- Subpackages
- openrl.modules.utils package
Submodules¶
openrl.modules.base_module module¶
openrl.modules.model_config module¶
openrl.modules.ppo_module module¶
- class openrl.modules.ppo_module.PPOModule(cfg, policy_input_space: gym.spaces.box.Box, critic_input_space: gym.spaces.box.Box, act_space: gym.spaces.box.Box, share_model: bool = False, device: Union[str, torch.device] = 'cpu', rank: Optional[int] = None, world_size: Optional[int] = None, model_dict: Optional[Dict[str, Any]] = None)[源代码]¶
基类:
openrl.modules.rl_module.RLModule- evaluate_actions(critic_obs, obs, rnn_states_actor, rnn_states_critic, action, masks, available_actions=None, active_masks=None, critic_masks_batch=None)[源代码]¶
openrl.modules.rl_module module¶
- class openrl.modules.rl_module.RLModule(cfg, model_configs: Dict[str, openrl.modules.model_config.ModelTrainConfig], act_space: gym.spaces.box.Box, rank: int = 0, world_size: int = 1, device: Union[str, torch.device] = 'cpu')[源代码]¶
基类:
openrl.modules.base_module.BaseModule- abstract evaluate_actions(critic_obs, obs, rnn_states_actor, rnn_states_critic, action, masks, available_actions=None, active_masks=None, critic_masks_batch=None)[源代码]¶