Final Lunar Lander Policy

This project is an enhancement of the final project from the Reinforcement Learning specialization on Coursera. It focuses on using the Expected Sarsa algorithm with a neural network as a function approximator to play Lunar Lander.

The model was trained using Keras and TensorFlow.

The complete source code for this project is available in this GitHub repository.