reinfrocement-learning
Placeholder for the in-development RL-based 2D quad mesh generator (advancing-front + Pan et al.'s SAC). See GitHub for status; no functional code published yet.
Gym Armed Bandits is an environment bundle for OpenAI Gym
My implementation of Hindsight replay in PyTorch: "Hindsight Experience Replay"