Back to Portfolio
Research
Reinforcement Learning Agent
An implementation of a reinforcement learning agent using Q-learning with epsilon-greedy exploration. The agent learns optimal policies in discrete state-action spaces through iterative interaction with simulated environments. The project includes implementations of tabular Q-learning, Deep Q-Networks (DQN) with experience replay, and double DQN variants. Training visualizations show convergence curves, Q-value heatmaps, and policy evolution over episodes. The codebase is structured as an educational resource with detailed docstrings explaining the theory behind each algorithm component.
Tech Stack
PythonNumPyGymnasiumMatplotlibPyTorch