Pokémon Blue Reinforcement Learning

A deep reinforcement learning project that trains an AI agent to play Pokémon Blue using PyBoy emulation and deep Q-learning.

Overview

This project uses deep Q-learning to train an agent to navigate and play Pokémon Blue on the Game Boy. The system uses PyBoy for emulation and PyTorch for the deep learning model.

Inspired by Peter Whidden's Youtube Video.

Features

Custom Gym environment for Pokémon Blue
Deep Q-Learning implementation with PyTorch
Exploration-based reward system
Automated training pipeline
Model checkpointing

Prerequisites

Python 3.8+
PyTorch
Gymnasium
PyBoy
NumPy
A legally obtained Pokémon Blue/Red ROM file

Installation

Clone Repo
Install dependencies:

pip install torch numpy gymnasium pyboy
``

3. Place your legally obtained Pokémon ROM file in the project directory as `POKEMONR.GBC`.

4. (Optional) Create a saved state file using PyBoy and save it as `state_file.state`.

## Usage

Run the training script:

```sh

python pokemon-emulator-project.py

How It Works

Environment

The custom PokemonBlueEnv class wraps PyBoy to provide a Gym-compatible environment. It:

Captures screen state as observations
Translates actions to Game Boy button presses
Calculates rewards based on exploration of new map tiles
Manages episode termination

Agent

The DeepQLearningAgent implements deep Q-learning with:

Convolutional neural network for processing screen images
Experience replay for stable learning
Epsilon-greedy exploration strategy
Target network for stable Q-value estimation

Reward System

The agent receives rewards for:

Exploring new map tiles (visiting new coordinates)

Configuration

Modify these constants in the script to adjust training:

REPLAY_MEMORY_SIZE: Size of experience replay buffer
STEPS_PER_EPISODE: Maximum steps per episode
NUM_EPISODES: Number of episodes to train

Results

After training, model weights are saved to:

[MODEL_NAME]_model_state_dict.pth
[MODEL_NAME]_optimizer_state_dict.pth

Future Improvements

Incorporate game-specific rewards (badges, Pokémon caught, etc.)
Implement prioritized experience replay
Add visualization tools for agent behavior
Support for saving/loading training progress

Acknowledgments

Built with the guidance of Claude AI
Inspired by Peter Whidden's reinforcement learning projects
Uses PyBoy for Game Boy emulation

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
Makefile		Makefile
Readme.md		Readme.md
analysis.py		analysis.py
pokemon_blue_agent.py		pokemon_blue_agent.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pokémon Blue Reinforcement Learning

Overview

Features

Prerequisites

Installation

How It Works

Environment

Agent

Reward System

Configuration

Results

Future Improvements

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

bobmaertz/pokemon-dqn-agent

Folders and files

Latest commit

History

Repository files navigation

Pokémon Blue Reinforcement Learning

Overview

Features

Prerequisites

Installation

How It Works

Environment

Agent

Reward System

Configuration

Results

Future Improvements

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages