openrl.rewards package¶ Submodules¶ openrl.rewards.base_reward module¶ class openrl.rewards.base_reward.BaseReward(env: openrl.envs.vec_env.base_venv.BaseVecEnv)[源代码]¶ 基类:object batch_rewards(buffer: Any) → Dict[str, Any][源代码]¶ step_reward(data: Dict[str, Any]) → Union[numpy.ndarray, List[Dict[str, Any]]][源代码]¶ openrl.rewards.gail_reward module¶ class openrl.rewards.gail_reward.GAILReward(env: openrl.envs.vec_env.base_venv.BaseVecEnv)[源代码]¶ 基类:openrl.rewards.base_reward.BaseReward set_discriminator(cfg, discriminator: torch.nn.modules.module.Module)[源代码]¶ step_reward(data: Dict[str, Any]) → Tuple[numpy.ndarray, List[Dict[str, Any]]][源代码]¶ class openrl.rewards.gail_reward.RewardPredictor(cfg, discriminator: torch.nn.modules.module.Module)[源代码]¶ 基类:object openrl.rewards.nlp_reward module¶ Module contents¶ class openrl.rewards.RewardFactory[源代码]¶ 基类:object static auto_register(reward_class: Any)[源代码]¶ static get_reward_class(reward_class: Any, env: openrl.envs.vec_env.base_venv.BaseVecEnv)[源代码]¶ static register(reward_name, reward_class)[源代码]¶