openrl.runners.common package¶
Submodules¶
openrl.runners.common.base_agent module¶
openrl.runners.common.chat_agent module¶
- class openrl.runners.common.chat_agent.ChatAgent(model, tokenizer, device=None)[源代码]¶
openrl.runners.common.ppo_agent module¶
- class openrl.runners.common.ppo_agent.PPOAgent(net: Optional[torch.nn.modules.module.Module] = None, env: Union[gym.core.Env, str] = None, run_dir: Optional[str] = None, env_num: Optional[int] = None, rank: int = 0, world_size: int = 1, use_wandb: bool = False, use_tensorboard: bool = False)[源代码]¶
Module contents¶
- class openrl.runners.common.ChatAgent(model, tokenizer, device=None)[源代码]¶
- class openrl.runners.common.PPOAgent(net: Optional[torch.nn.modules.module.Module] = None, env: Union[gym.core.Env, str] = None, run_dir: Optional[str] = None, env_num: Optional[int] = None, rank: int = 0, world_size: int = 1, use_wandb: bool = False, use_tensorboard: bool = False)[源代码]¶