openrl.buffers package¶
Subpackages¶
Submodules¶
openrl.buffers.normal_buffer module¶
- class openrl.buffers.normal_buffer.NormalReplayBuffer(cfg, num_agents, obs_space, act_space, data_client, episode_length=None)[source]¶
Bases:
object- feed_forward_critic_obs_generator(advantages, num_mini_batch=None, mini_batch_size=None, critic_obs_process_func=None)[source]¶
- feed_forward_generator(advantages, num_mini_batch=None, mini_batch_size=None, critic_obs_process_func=None)[source]¶
openrl.buffers.offpolicy_buffer module¶
openrl.buffers.offpolicy_replay_data module¶
- class openrl.buffers.offpolicy_replay_data.OffPolicyReplayData(cfg, num_agents, obs_space, act_space, data_client=None, episode_length=None)[source]¶
openrl.buffers.replay_data module¶
- class openrl.buffers.replay_data.ReplayData(cfg, num_agents, obs_space, act_space, data_client=None, episode_length=None)[source]¶
Bases:
object- feed_forward_critic_obs_generator(advantages, num_mini_batch=None, mini_batch_size=None, critic_obs_process_func=None)[source]¶
- feed_forward_generator(advantages, num_mini_batch=None, mini_batch_size=None, critic_obs_process_func=None)[source]¶
- feed_forward_generator_transformer(advantages, num_mini_batch=None, mini_batch_size=None)[source]¶
Yield training data for MLP policies. :param advantages: (np.ndarray) advantage estimates. :param num_mini_batch: (int) number of minibatches to split the batch into. :param mini_batch_size: (int) number of samples in each minibatch.
Module contents¶
- class openrl.buffers.NormalReplayBuffer(cfg, num_agents, obs_space, act_space, data_client, episode_length=None)[source]¶
Bases:
object- feed_forward_critic_obs_generator(advantages, num_mini_batch=None, mini_batch_size=None, critic_obs_process_func=None)[source]¶
- feed_forward_generator(advantages, num_mini_batch=None, mini_batch_size=None, critic_obs_process_func=None)[source]¶