RL4LMs Logo
latest

Getting Started

  • Installation
  • Quick Start - Train PPO/NLPO using pre-defined YAML configs
  • Custom Building Blocks

Module Guide

  • rl4lms.algorithms package
  • rl4lms.envs package
  • rl4lms.core_components package
  • rl4lms.data_pools package
RL4LMs
  • Python Module Index

Python Module Index

r
 
r
- rl4lms
    rl4lms.algorithms
    rl4lms.algorithms.a2c
    rl4lms.algorithms.common
    rl4lms.algorithms.common.algo_utils
    rl4lms.algorithms.common.maskable
    rl4lms.algorithms.common.maskable.buffers
    rl4lms.algorithms.common.maskable.utils
    rl4lms.algorithms.ppo
    rl4lms.core_components
    rl4lms.core_components.sampler
    rl4lms.core_components.sweep
    rl4lms.data_pools
    rl4lms.data_pools.custom_text_generation_pools
    rl4lms.data_pools.text_generation_pool
    rl4lms.envs
    rl4lms.envs.common
    rl4lms.envs.common.action_space
    rl4lms.envs.common.observation
    rl4lms.envs.common.reward
    rl4lms.envs.text_generation
    rl4lms.envs.text_generation.caption_metrics
    rl4lms.envs.text_generation.caption_metrics.cider
    rl4lms.envs.text_generation.caption_metrics.spice
    rl4lms.envs.text_generation.caption_metrics.spice.spice
    rl4lms.envs.text_generation.kl_controllers
    rl4lms.envs.text_generation.observation
    rl4lms.envs.text_generation.policy
    rl4lms.envs.text_generation.post_processors
    rl4lms.envs.text_generation.summ_metrics
    rl4lms.envs.text_generation.summ_metrics.summa_c
    rl4lms.envs.text_generation.test_datapool

© Copyright 2023, RL4LMs Team. Revision 56f367e7.

Built with Sphinx using a theme provided by Read the Docs.