Search -

Hide navigation sidebar

Hide table of contents sidebar

Skip to content

Toggle site navigation sidebar

Toggle table of contents sidebar

get started

Installation
Features
Efficiency
Usage Video
Environments Customization
Supported Algorithms
Experiment Grid

benchmark

On-Policy Algorithms
Off-Policy Algorithms
Offline Algorithms
Model-based Algorithms
Case Study

mathematical theory

Mathematical Notations
Vector and Martrix
Lagrange Duality

base rl algorithms

Trust Region Policy Optimization
Proximal Policy Optimization

safe rl algorithms

Constrained Policy Optimization
Projection-Based Constrained Policy Optimization
First Order Constrained Optimization in Policy Space
Lagrangian Methods

base rl algorithms api

Base On-policy Algorithms
Base Off-policy Algorithms
Base Model-based Algorithms

safe rl algorithms api

First Order Algorithms
Second Order Algorithms
Lagrange Algorithms
Penalty Function Algorithms
Model-based Algorithms

common api

OmniSafe Buffer
OmniSafe Experiment Grid
OmniSafe Lagrange Multiplier
OmniSafe Normalizer
OmniSafe Logger
OmniSafe Simmer Agent
OmniSafe Statistics Tools
OmniSafe Offline Data

utils api

OmniSafe Config
OmniSafe Distributed
OmniSafe Math
OmniSafe Model Utils
OmniSafe Tools
OmniSafe Plotter

models api

OmniSafe Actor
OmniSafe Critic
OmniSafe Actor Critic
OmniSafe Model-based Model
OmniSafe Model-based Planner
OmniSafe Offline Model

envs api

OmniSafe Core Environment
OmniSafe Customization Interface of Environments
OmniSafe Wrapper
Safety Gymnasium Environment
Mujoco Environment
OmniSafe Adapter

Toggle table of contents sidebar

Copyright © 2022, OmniSafe Team

Made with Sphinx and @pradyunsg's Furo