Overview

An implementation of Q-learning with automated hyperparameter optimization. It is based on Gymnasium's Taxi-v3 environment and uses Optuna to find best set parameters to maximaze rewards. The Q-table values are updated using the Bellman equation. This can be used as a template for other environments.

Scripts (entry points)

optimize.py: Wraps the agent's training to measure reward across multiple trials with different hyperparameters values to efficiently find the best set of parameters. Parameter values that yield the best results are then stored to the local directory. Number of trials and ranges for hyperparameters are specified in the configuration file (for more details see the Configuration section).

use.py: Train and evaluate the Q-learning agent. Training stage will read the stored hyperparameters, train a new agent and save the Q-table. Evaluation stage will measure average reward over multiple episodes. Training and evaluation can be run separately using script arguements.

Configuration

The configuration is loaded from a JSON file, by defaut on path ./config.json which contains configurable settings for training, evaluation and the hyperparameter optimization. See an example in config.json.

Hyperprameters

lr: Learning rate for the Q-learning algorithm. gamma: Discount factor for future rewards. epsilon: Exploration rate for the epsilon-greedy policy. epsilon_decay: Decay rate for the exploration rate. epsilon_min: Minimum exploration rate.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
actions.py		actions.py
config.json		config.json
config.py		config.py
optimize.py		optimize.py
params.json		params.json
pyproject.toml		pyproject.toml
q_table.npy		q_table.npy
requirements.txt		requirements.txt
use.py		use.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Scripts (entry points)

Configuration

Hyperprameters

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Overview

Scripts (entry points)

Configuration

Hyperprameters

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages