Contents Menu Expand Light mode Dark mode Auto light/dark, in light mode Auto light/dark, in dark mode Skip to content
Logo

get started

  • Installation
  • Features
  • Efficiency
  • Usage Video
  • Environments Customization
  • Supported Algorithms
  • Experiment Grid

benchmark

  • On-Policy Algorithms
  • Off-Policy Algorithms
  • Offline Algorithms
  • Model-based Algorithms
  • Case Study

mathematical theory

  • Mathematical Notations
  • Vector and Martrix
  • Lagrange Duality

base rl algorithms

  • Trust Region Policy Optimization
  • Proximal Policy Optimization

safe rl algorithms

  • Constrained Policy Optimization
  • Projection-Based Constrained Policy Optimization
  • First Order Constrained Optimization in Policy Space
  • Lagrangian Methods

base rl algorithms api

  • Base On-policy Algorithms
  • Base Off-policy Algorithms
  • Base Model-based Algorithms

safe rl algorithms api

  • First Order Algorithms
  • Second Order Algorithms
  • Lagrange Algorithms
  • Penalty Function Algorithms
  • Model-based Algorithms

common api

  • OmniSafe Buffer
  • OmniSafe Experiment Grid
  • OmniSafe Lagrange Multiplier
  • OmniSafe Normalizer
  • OmniSafe Logger
  • OmniSafe Simmer Agent
  • OmniSafe Statistics Tools
  • OmniSafe Offline Data

utils api

  • OmniSafe Config
  • OmniSafe Distributed
  • OmniSafe Math
  • OmniSafe Model Utils
  • OmniSafe Tools
  • OmniSafe Plotter

models api

  • OmniSafe Actor
  • OmniSafe Critic
  • OmniSafe Actor Critic
  • OmniSafe Model-based Model
  • OmniSafe Model-based Planner
  • OmniSafe Offline Model

envs api

  • OmniSafe Core Environment
  • OmniSafe Customization Interface of Environments
  • OmniSafe Wrapper
  • Safety Gymnasium Environment
  • Mujoco Environment
  • OmniSafe Adapter
Back to top
Copyright © 2022, OmniSafe Team
Made with Sphinx and @pradyunsg's Furo