Skip to content
Change the repository type filter

All

    Repositories list

    • air-dream-website

      Public template
      🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.
      TeX
      MIT License
      6.3k400Updated Oct 22, 2024Oct 22, 2024
    • .github

      Public
      0000Updated Jun 4, 2024Jun 4, 2024
    • IVM

      Public
      The offical Implementation of "Instruction-Guided Visual Masking"
      Jupyter Notebook
      Apache License 2.0
      2000Updated Jun 3, 2024Jun 3, 2024
    • QPA

      Public
      Python
      MIT License
      2000Updated Apr 8, 2024Apr 8, 2024
    • [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"
      Python
      MIT License
      1000Updated Mar 19, 2024Mar 19, 2024
    • official implementation of ODICE
      Python
      1000Updated Jan 31, 2024Jan 31, 2024
    • FISOR

      Public
      [ICLR 2024] The official implementation of "Feasibility-Guided Safe Offline Reinforcement Learning"
      Python
      4000Updated Jan 21, 2024Jan 21, 2024
    • OMIGA

      Public
      The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization" (NeurIPS 2023)
      Python
      3200Updated Jan 21, 2024Jan 21, 2024
    • TSRL

      Public
      Python
      MIT License
      1100Updated Oct 18, 2023Oct 18, 2023
    • D2C

      Public
      D2C(Data-driven Control Library) is a library for data-driven control based on reinforcement learning.
      Python
      MIT License
      12000Updated Oct 18, 2023Oct 18, 2023
    • openchat

      Public
      OpenChat: Advancing Open-source Language Models with Imperfect Data
      Jupyter Notebook
      Apache License 2.0
      400000Updated Oct 13, 2023Oct 13, 2023
    • d4rl

      Public
      A benchmark for offline reinforcement learning.
      Python
      Apache License 2.0
      281000Updated Sep 28, 2023Sep 28, 2023
    • H2O

      Public
      [NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
      Python
      4100Updated Sep 24, 2023Sep 24, 2023
    • onerl

      Public
      One RL Platform is all you need -- Event-driven fully distributed reinforcement learning framework
      Python
      4000Updated Sep 1, 2023Sep 1, 2023
    • IVR

      Public
      Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
      Python
      MIT License
      7000Updated Jul 27, 2023Jul 27, 2023
    • PROTO

      Public
      Python
      1000Updated May 25, 2023May 25, 2023
    • POR

      Public
      Author's implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
      Python
      MIT License
      6000Updated Apr 6, 2023Apr 6, 2023
    • DOGE

      Public
      The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)
      Python
      2000Updated Mar 6, 2023Mar 6, 2023
    • RGM

      Public
      The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)
      Python
      2000Updated Mar 3, 2023Mar 3, 2023
    • CPQ

      Public
      Author's implementation of Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
      Python
      MIT License
      2000Updated Jan 26, 2023Jan 26, 2023
    • DWBC

      Public
      Author's implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"
      Python
      MIT License
      2000Updated Jan 5, 2023Jan 5, 2023
    • Author's implementation of "DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning"
      Python
      2000Updated Jul 21, 2022Jul 21, 2022
    • MOPP

      Public
      Official codebase of "Model-Based Offline Planning with Trajectory Pruning (MOPP)"
      Python
      3000Updated Dec 21, 2021Dec 21, 2021