Build software better, together

VachanVY / Reinforcement-Learning

PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research papers.

reinforcement-learning deep-reinforcement-learning pytorch artificial-intelligence dqn policy-gradient deep-deterministic-policy-gradient ddpg-algorithm proximal-policy-optimization actor-critic-algorithm dqn-pytorch rl-book sutton-barto-book policy-gradient-with-baseline actor-critic-pytorch soft-actor-critic-continuous ppo-algorithm reinforcement-learning-an-introduction

Updated Aug 14, 2025
Python

Vitao2 / Hollow-Knight-Neural-Network

Star

A C# Unity mod connected through a named pipe with Python for training a Reinforcement Learning agent to fight Hollow Knight Hornet Protector

reinforcement-learning hollow-knight hollow-knight-mod ppo-algorithm

Updated Apr 9, 2026
Python

amin-sharifi-github / quant-rl-trading-agent

Star

End-to-end RL trading framework with PPO agent, self-attention neural network, custom Gym environment, and advanced backtesting.

reinforcement-learning ai algotrading reinforcement-learning-algorithms trading-algorithms quantitative-finance attention-mechanism quantitative-trading backtesting trading-systems gym-environment reinforcement-learning-agent financial-machine-learning quantitative-research market-simulation stable-baselines3 ppo-algorithm

Updated Aug 6, 2025
Python

negarhonarvar / DeepReinforcementLearning

Star

A Complete Collection of Deep RL Famous Algorithms implemented in Gymnasium most Popular environments

dqn boltzmann-exploration sarsa lunar-lander cartpole-v1 d3qn swimmer softmax-exploration drl-algorithms ppo-algorithm gymnasium-environment

Updated Apr 13, 2025
Python

Ruchit-Gaurh / AI-Traffic-Management-System

Star

🚦 Next-generation AI Traffic Management System with real-time computer vision, reinforcement learning optimization, emergency vehicle detection, and immersive 3D visualization

Updated Oct 14, 2025
Python

RongzheZhao2R2-lab / Implementing-Core-LLM-Algorithms-from-Scratch

Star

This repository is dedicated to implementing algorithms "From Scratch". It goes beyond simple API calls, diving deep into the underlying logic of everything from basic training to cutting-edge techniques like DeepSeek-R1.

moe knowledge-distillation multimodal-learning alignment-algorithm rag mixture-of-experts rlhf ppo-algorithm grpo

Updated Nov 26, 2025
Python

mturan33 / isaaclab-anymal-locomotion

Star

A legged locomotion project

ppo anymal isaacsim isaac-sim locomation legged-locomotion ppo-algorithm isaac-lab isaaclab anymal-c

Updated Nov 29, 2025
Python

zxy-tech / ppo-for-S-P-500-trading-strategy

Star

This is a project for PPO S&P 500 trading

time-series-forecasting stockprediction stocktrader ppo-algorithm

Updated Mar 10, 2025
Python

green-hat-001 / NASA-Space-Apps-Commercialising-LEO-by-OptimAI

Star

2D orbital rocket sim with PPO in PyTorch. Models thrust, drag, gravity, fuel; agent learns efficient ascent. Includes telemetry & visualization

ai python3 rocketry ppo-algorithm

Updated Dec 23, 2025
Python

omerjakoby / MARIO-RL-PPO

Star

This repository implements a Proximal Policy Optimization (PPO) agent that learns to play Super Mario Bros using TensorFlow/Keras and OpenAI Gym. Features CNNs for vision, Actor-Critic architecture, and parallel environments. Train your own Mario master or run a pre-trained one!

machine-learning tensorflow keras openai-gym cnn actor-critic mario-game proximal-policy-optimization ppo reinforcement-learning-agent ppo-algorithm

Updated Dec 12, 2025
Python

crystalknife / NeuroDrive-RL

Star

Autonomous driving system using PPO-based Reinforcement Learning and CARLA Simulator for lane following and navigation.

Updated May 25, 2026
Python

unaizaahmedk / Balancing-Inverted-Pendulum-using-RL

Star

Reinforcement learning–based controller for balancing an inverted pendulum using Proximal Policy Optimization (PPO). Supports configurable mass, length, and gravity settings (Earth, lunar, microgravity) with automated training logs, reward visualization, and performance analysis.

reinforcement-learning openai-gym reinforcement-learning-algorithms inverted-pendulum ppo-algorithm

Updated Mar 3, 2026
Python

Eight-Bells-Ltd / Smart_Pricing_MARL_NANCY

Star

Smart Pricing for NANCY using Multi-Agent Reinforcement Learning and Reverse Auction Theory. Smart_Pricing_MARL_NANCY is an open-source, EU-co-funded Smart Pricing Module (SPM) developed for the NANCY project. It leverages Multi-Agent Reinforcement Learning (MARL) and Reverse Auction Theory to calculate optimal pricing strategies.

reinforcement-learning game-theory reverse-auction self-play multi-agent-reinforcement-learning marl ppo-algorithm

Updated Mar 17, 2026
Python

anjaliy11 / Powerly_

Star

Autonomous Microgrid Balancer using PPO RL with Adversarial Training for resilience under High-Impact Low-Probability (HILP) disturbances.

automation python3 reinforcement-learning-agent ppo-algorithm gymnasium-environment

Updated Jun 19, 2026
Python

Lekssz / rl_trading_agent

Star

Multi-modal RL trading agent (CNN + PPO) integrating market prices, macroeconomic indicators, and news signals . MSc dissertation artefact.

python machine-learning reinforcement-learning deep-learning cnn pytorch supervised-learning policy-gradient quantitative-finance feature-engineering algorithmic-trading macroeconomics news-analysis financial-time-series ppo-algorithm

Updated Jan 31, 2026
Python

mafaldaaires / Reinforcement-Learning

Star

Stable Baselines3

gymnasium a2c-algorithm car-racing-environment ppo-algorithm

Updated Dec 26, 2023
Python

sanatren / Legal-Document-Analyzer

Star

This Legal Document Analyzer is a proof-of-concept NLP project demonstrating the potential of transformers for legal document summarization.

deep-learning transformer bart reinforcement-learning-algorithms byte-pair-encoding huggingface ppo-algorithm finetuning-transformers

Updated Jun 8, 2025
Python

Anca-Mt / TrackmaniaRL-AI

Star

AI agents for Trackmania using the TMRL package. Implemented DDPG, PPO, and used two SAC algorithms (with one or two critics) to train cars to navigate custom-built tracks.

python ai reinforcement-learning-algorithms game-ai ddpg-algorithm ppo-algorithm sac-algorithm tmrl tmrl-package modern-game-ai

Updated Aug 20, 2024
Python

AmudhanManimaran / AutoHeal_Autonomous-Server-Remediation-via-PPO-Reinforcement-Learning

Star

python reinforcement-learning pytorch tensorboard self-healing anomaly-detection stable-baselines3 ppo-algorithm gymnasium-environment

Updated Apr 26, 2026
Python

pavansai018 / crowd_aware_safe_navigation

Star

End-to-end safe robot navigation in crowds using ROS2. Chains LiDAR-based pedestrian detection (DR-SPAAM), SORT tracking, Gumbel Social Transformer trajectory prediction with conformal uncertainty bounds, and a PPO-Lagrangian constrained RL policy for safety-aware velocity control.

reinforcement-learning transformers pytorch ros2 gumbel-softmax transformer-architecture ppo-algorithm gumbel-social-transformer

Updated Jun 21, 2026
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ppo-algorithm

Here are 31 public repositories matching this topic...

VachanVY / Reinforcement-Learning

Vitao2 / Hollow-Knight-Neural-Network

amin-sharifi-github / quant-rl-trading-agent

negarhonarvar / DeepReinforcementLearning

Ruchit-Gaurh / AI-Traffic-Management-System

RongzheZhao2R2-lab / Implementing-Core-LLM-Algorithms-from-Scratch

mturan33 / isaaclab-anymal-locomotion

zxy-tech / ppo-for-S-P-500-trading-strategy

green-hat-001 / NASA-Space-Apps-Commercialising-LEO-by-OptimAI

omerjakoby / MARIO-RL-PPO

crystalknife / NeuroDrive-RL

unaizaahmedk / Balancing-Inverted-Pendulum-using-RL

Eight-Bells-Ltd / Smart_Pricing_MARL_NANCY

anjaliy11 / Powerly_

Lekssz / rl_trading_agent

mafaldaaires / Reinforcement-Learning

sanatren / Legal-Document-Analyzer

Anca-Mt / TrackmaniaRL-AI

AmudhanManimaran / AutoHeal_Autonomous-Server-Remediation-via-PPO-Reinforcement-Learning

pavansai018 / crowd_aware_safe_navigation

Improve this page

Add this topic to your repo