openrl.envs.vec_env.wrappers package¶
Submodules¶
openrl.envs.vec_env.wrappers.base_wrapper module¶
- class openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv)[源代码]¶
基类:
openrl.envs.vec_env.base_venv.BaseVecEnv,abc.ABCWraps the vectorized environment to allow a modular transformation.
This class is the base class for all wrappers for vectorized environments. The subclass could override some methods to change the behavior of the original vectorized environment without touching the original code.
- Note:
Don't forget to call
super().__init__(env)if the subclass overrides__init__().
- property action_space: Union[gymnasium.spaces.space.Space[gymnasium.core.ActType], gymnasium.spaces.space.Space[gymnasium.core.WrapperActType]]¶
Return the
Envaction_spaceunless overwritten then the wrapperaction_spaceis used.
- property agent_num¶
- env_is_wrapped(wrapper_class: Type[openrl.envs.wrappers.base_wrapper.BaseWrapper], indices: Union[None, int, Iterable[int]] = None) List[bool][源代码]¶
Check if worker environments are wrapped with a given wrapper
- property env_name¶
- property np_random: numpy.random._generator.Generator¶
Returns the environment's internal
_np_randomthat if not set will initialise with a random seed.- Returns:
Instances of np.random.Generator
- property observation_space: Union[gymnasium.spaces.space.Space[gymnasium.core.ObsType], gymnasium.spaces.space.Space[gymnasium.core.WrapperObsType]]¶
Return the
Envobservation_spaceunless overwritten then the wrapperobservation_spaceis used.
- property parallel_env_num: int¶
- property render_mode: Optional[str]¶
Returns the
Envrender_mode.
- property reward_range: Tuple[SupportsFloat, SupportsFloat]¶
Return the
Envreward_rangeunless overwritten then the wrapperreward_rangeis used.
- set_attr(name, values)[源代码]¶
Set a property in each sub-environment.
- Args:
name (str): Name of the property to be set in each individual environment. values (list, tuple, or object): Values of the property to be set to. If values is a list or
tuple, then it corresponds to the values for each individual environment, otherwise a single value is set for all environments.
- property unwrapped¶
- property use_monitor¶
- class openrl.envs.vec_env.wrappers.base_wrapper.VectorActionWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv)[源代码]¶
基类:
openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapperWraps the vectorized environment to allow a modular transformation of the actions. Equivalent of
ActionWrapperfor vectorized environments.
- class openrl.envs.vec_env.wrappers.base_wrapper.VectorObservationWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv)[源代码]¶
基类:
openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapperWraps the vectorized environment to allow a modular transformation of the observation. Equivalent to
gym.ObservationWrapperfor vectorized environments.- observation(observation: gymnasium.core.ObsType) gymnasium.core.ObsType[源代码]¶
Defines the observation transformation.
- Args:
observation (object): the observation from the environment
- Returns:
observation (object): the transformed observation
- reset(**kwargs)[源代码]¶
Modifies the observation returned from the environment
resetusing theobservation().
- step(actions, *args, **kwargs)[源代码]¶
Modifies the observation returned from the environment
stepusing theobservation().
- class openrl.envs.vec_env.wrappers.base_wrapper.VectorRewardWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv)[源代码]¶
基类:
openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapperWraps the vectorized environment to allow a modular transformation of the reward. Equivalent of
RewardWrapperfor vectorized environments.
openrl.envs.vec_env.wrappers.gen_data module¶
- class openrl.envs.vec_env.wrappers.gen_data.GenDataWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv, data_save_path: str, total_episode: int)[源代码]¶
- class openrl.envs.vec_env.wrappers.gen_data.GenDataWrapper_v1(env: openrl.envs.vec_env.base_venv.BaseVecEnv, data_save_path: str, total_episode: int)[源代码]¶
openrl.envs.vec_env.wrappers.reward_wrapper module¶
- class openrl.envs.vec_env.wrappers.reward_wrapper.RewardWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv, reward_class: openrl.rewards.base_reward.BaseReward)[源代码]¶
openrl.envs.vec_env.wrappers.vec_monitor_wrapper module¶
- class openrl.envs.vec_env.wrappers.vec_monitor_wrapper.VecMonitorWrapper(vec_info: openrl.envs.vec_env.vec_info.base_vec_info.BaseVecInfo, env: openrl.envs.vec_env.base_venv.BaseVecEnv)[源代码]¶
基类:
openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapper- step(action: gymnasium.core.ActType, extra_data: Optional[Dict[str, Any]] = None)[源代码]¶
Step all environments.
- property use_monitor¶
openrl.envs.vec_env.wrappers.zero_reward_wrapper module¶
- class openrl.envs.vec_env.wrappers.zero_reward_wrapper.ZeroRewardWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv)[源代码]¶
基类:
openrl.envs.vec_env.wrappers.base_wrapper.VectorRewardWrapper