PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research papers.
-
Updated
Aug 14, 2025 - Python
PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research papers.
A C# Unity mod connected through a named pipe with Python for training a Reinforcement Learning agent to fight Hollow Knight Hornet Protector
End-to-end RL trading framework with PPO agent, self-attention neural network, custom Gym environment, and advanced backtesting.
A Complete Collection of Deep RL Famous Algorithms implemented in Gymnasium most Popular environments
🚦 Next-generation AI Traffic Management System with real-time computer vision, reinforcement learning optimization, emergency vehicle detection, and immersive 3D visualization
This repository is dedicated to implementing algorithms "From Scratch". It goes beyond simple API calls, diving deep into the underlying logic of everything from basic training to cutting-edge techniques like DeepSeek-R1.
A legged locomotion project
This is a project for PPO S&P 500 trading
2D orbital rocket sim with PPO in PyTorch. Models thrust, drag, gravity, fuel; agent learns efficient ascent. Includes telemetry & visualization
This repository implements a Proximal Policy Optimization (PPO) agent that learns to play Super Mario Bros using TensorFlow/Keras and OpenAI Gym. Features CNNs for vision, Actor-Critic architecture, and parallel environments. Train your own Mario master or run a pre-trained one!
Autonomous driving system using PPO-based Reinforcement Learning and CARLA Simulator for lane following and navigation.
Reinforcement learning–based controller for balancing an inverted pendulum using Proximal Policy Optimization (PPO). Supports configurable mass, length, and gravity settings (Earth, lunar, microgravity) with automated training logs, reward visualization, and performance analysis.
Smart Pricing for NANCY using Multi-Agent Reinforcement Learning and Reverse Auction Theory. Smart_Pricing_MARL_NANCY is an open-source, EU-co-funded Smart Pricing Module (SPM) developed for the NANCY project. It leverages Multi-Agent Reinforcement Learning (MARL) and Reverse Auction Theory to calculate optimal pricing strategies.
Autonomous Microgrid Balancer using PPO RL with Adversarial Training for resilience under High-Impact Low-Probability (HILP) disturbances.
Multi-modal RL trading agent (CNN + PPO) integrating market prices, macroeconomic indicators, and news signals . MSc dissertation artefact.
Stable Baselines3
This Legal Document Analyzer is a proof-of-concept NLP project demonstrating the potential of transformers for legal document summarization.
AI agents for Trackmania using the TMRL package. Implemented DDPG, PPO, and used two SAC algorithms (with one or two critics) to train cars to navigate custom-built tracks.
End-to-end safe robot navigation in crowds using ROS2. Chains LiDAR-based pedestrian detection (DR-SPAAM), SORT tracking, Gumbel Social Transformer trajectory prediction with conformal uncertainty bounds, and a PPO-Lagrangian constrained RL policy for safety-aware velocity control.
Add a description, image, and links to the ppo-algorithm topic page so that developers can more easily learn about it.
To associate your repository with the ppo-algorithm topic, visit your repo's landing page and select "manage topics."