RL4LMs Logo
latest

Getting Started

  • Installation
  • Quick Start - Train PPO/NLPO using pre-defined YAML configs
  • Custom Building Blocks

Module Guide

  • rl4lms.algorithms package
    • Subpackages
      • rl4lms.algorithms.a2c package
      • rl4lms.algorithms.common package
      • rl4lms.algorithms.nlpo package
      • rl4lms.algorithms.ppo package
      • rl4lms.algorithms.trpo package
    • Module contents
  • rl4lms.envs package
  • rl4lms.core_components package
  • rl4lms.data_pools package
RL4LMs
  • rl4lms.algorithms package
  • Edit on GitHub

rl4lms.algorithms package

Subpackages

  • rl4lms.algorithms.a2c package
    • Submodules
    • rl4lms.algorithms.a2c.a2c module
    • Module contents
  • rl4lms.algorithms.common package
    • Subpackages
    • Submodules
    • rl4lms.algorithms.common.algo_utils module
    • Module contents
  • rl4lms.algorithms.nlpo package
    • Submodules
    • rl4lms.algorithms.nlpo.nlpo module
    • rl4lms.algorithms.nlpo.policies module
    • Module contents
  • rl4lms.algorithms.ppo package
    • Submodules
    • rl4lms.algorithms.ppo.ppo module
    • Module contents
  • rl4lms.algorithms.trpo package
    • Submodules
    • rl4lms.algorithms.trpo.policies module
    • rl4lms.algorithms.trpo.trpo module
    • Module contents

Module contents

Previous Next

© Copyright 2023, RL4LMs Team. Revision 56f367e7.

Built with Sphinx using a theme provided by Read the Docs.