openrl.envs.vec_env.wrappers package¶
Submodules¶
openrl.envs.vec_env.wrappers.base_wrapper module¶
- class openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv)[source]¶
Bases:
openrl.envs.vec_env.base_venv.BaseVecEnv,abc.ABCWraps the vectorized environment to allow a modular transformation.
This class is the base class for all wrappers for vectorized environments. The subclass could override some methods to change the behavior of the original vectorized environment without touching the original code.
- Note:
Don’t forget to call
super().__init__(env)if the subclass overrides__init__().
- property action_space: Union[gymnasium.spaces.space.Space[gymnasium.core.ActType], gymnasium.spaces.space.Space[gymnasium.core.WrapperActType]]¶
Return the
Envaction_spaceunless overwritten then the wrapperaction_spaceis used.
- property agent_num¶
- call(name, *args, **kwargs)[source]¶
Call a method, or get a property, from each parallel environment.
- env_is_wrapped(wrapper_class: Type[openrl.envs.wrappers.base_wrapper.BaseWrapper], indices: Union[None, int, Iterable[int]] = None) List[bool][source]¶
Check if worker environments are wrapped with a given wrapper
- property env_name¶
- property np_random: numpy.random._generator.Generator¶
Returns the environment’s internal
_np_randomthat if not set will initialise with a random seed.- Returns:
Instances of np.random.Generator
- property observation_space: Union[gymnasium.spaces.space.Space[gymnasium.core.ObsType], gymnasium.spaces.space.Space[gymnasium.core.WrapperObsType]]¶
Return the
Envobservation_spaceunless overwritten then the wrapperobservation_spaceis used.
- property parallel_env_num: int¶
- property render_mode: Optional[str]¶
Returns the
Envrender_mode.
- property reward_range: Tuple[SupportsFloat, SupportsFloat]¶
Return the
Envreward_rangeunless overwritten then the wrapperreward_rangeis used.
- set_attr(name, values)[source]¶
Set a property in each sub-environment.
- Args:
name (str): Name of the property to be set in each individual environment. values (list, tuple, or object): Values of the property to be set to. If values is a list or
tuple, then it corresponds to the values for each individual environment, otherwise a single value is set for all environments.
- property unwrapped¶
- property use_monitor¶
- class openrl.envs.vec_env.wrappers.base_wrapper.VectorActionWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv)[source]¶
Bases:
openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapperWraps the vectorized environment to allow a modular transformation of the actions. Equivalent of
ActionWrapperfor vectorized environments.
- class openrl.envs.vec_env.wrappers.base_wrapper.VectorObservationWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv)[source]¶
Bases:
openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapperWraps the vectorized environment to allow a modular transformation of the observation. Equivalent to
gym.ObservationWrapperfor vectorized environments.- observation(observation: gymnasium.core.ObsType) gymnasium.core.ObsType[source]¶
Defines the observation transformation.
- Args:
observation (object): the observation from the environment
- Returns:
observation (object): the transformed observation
- reset(**kwargs)[source]¶
Modifies the observation returned from the environment
resetusing theobservation().
- step(actions, *args, **kwargs)[source]¶
Modifies the observation returned from the environment
stepusing theobservation().
- class openrl.envs.vec_env.wrappers.base_wrapper.VectorRewardWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv)[source]¶
Bases:
openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapperWraps the vectorized environment to allow a modular transformation of the reward. Equivalent of
RewardWrapperfor vectorized environments.
openrl.envs.vec_env.wrappers.gen_data module¶
- class openrl.envs.vec_env.wrappers.gen_data.GenDataWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv, data_save_path: str, total_episode: int)[source]¶
Bases:
openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapper
- class openrl.envs.vec_env.wrappers.gen_data.GenDataWrapper_v1(env: openrl.envs.vec_env.base_venv.BaseVecEnv, data_save_path: str, total_episode: int)[source]¶
Bases:
openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapper
openrl.envs.vec_env.wrappers.reward_wrapper module¶
- class openrl.envs.vec_env.wrappers.reward_wrapper.RewardWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv, reward_class: openrl.rewards.base_reward.BaseReward)[source]¶
Bases:
openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapper
openrl.envs.vec_env.wrappers.vec_monitor_wrapper module¶
- class openrl.envs.vec_env.wrappers.vec_monitor_wrapper.VecMonitorWrapper(vec_info: openrl.envs.vec_env.vec_info.base_vec_info.BaseVecInfo, env: openrl.envs.vec_env.base_venv.BaseVecEnv)[source]¶
Bases:
openrl.envs.vec_env.wrappers.base_wrapper.VecEnvWrapper- step(action: gymnasium.core.ActType, extra_data: Optional[Dict[str, Any]] = None)[source]¶
Step all environments.
- property use_monitor¶
openrl.envs.vec_env.wrappers.zero_reward_wrapper module¶
- class openrl.envs.vec_env.wrappers.zero_reward_wrapper.ZeroRewardWrapper(env: openrl.envs.vec_env.base_venv.BaseVecEnv)[source]¶
Bases:
openrl.envs.vec_env.wrappers.base_wrapper.VectorRewardWrapper