RL4LMs Logo
latest

Getting Started

  • Installation
  • Quick Start - Train PPO/NLPO using pre-defined YAML configs
  • Custom Building Blocks

Module Guide

  • rl4lms.algorithms package
  • rl4lms.envs package
  • rl4lms.core_components package
  • rl4lms.data_pools package
RL4LMs
  • Overview: module code

All modules for which code is available

  • rl4lms.algorithms.common.algo_utils
  • rl4lms.algorithms.common.maskable.buffers
  • rl4lms.algorithms.common.maskable.utils
  • rl4lms.core_components.sampler
  • rl4lms.core_components.sweep
  • rl4lms.data_pools.custom_text_generation_pools
  • rl4lms.data_pools.text_generation_pool
  • rl4lms.envs.common.action_space
  • rl4lms.envs.common.observation
  • rl4lms.envs.common.reward
  • rl4lms.envs.text_generation.caption_metrics.cider
  • rl4lms.envs.text_generation.caption_metrics.spice.spice
  • rl4lms.envs.text_generation.kl_controllers
  • rl4lms.envs.text_generation.observation
  • rl4lms.envs.text_generation.post_processors
  • rl4lms.envs.text_generation.summ_metrics.summa_c
  • rl4lms.envs.text_generation.test_datapool

© Copyright 2023, RL4LMs Team. Revision 56f367e7.

Built with Sphinx using a theme provided by Read the Docs.