DQN-DDQN-on-Space-Invaders

Implementation of Double Deep Q Networks and Dueling Q Networks using Keras on Space Invaders using OpenAI Gym. Code can be easily generalized to other Atari games.

Prerequisites

You can install all the prerequisites for code use using

pip install -r requirements.txt

On windows (7)

Download and install the appropriate vc_redist.*.exe file, which can be found here.
Uninstall gym and atari-py (If already installed):

pip uninstall atari-py
pip uninstall gym[atari]

Download VS build tools here
Run the VS build setup and select "C++ build tools" and install it.
Restart PC.
Install cmake, atari-py and gym

pip install cmake
pip install --no-index -f https://github.
com/Kojoley/atari-py/releases atari_py
pip install gym[atari]

Now run the following code:

import atari_py
print(atari_py.list_games())

and if everything worked then it should return a list of all games as shown below

['adventure', 'air_raid', 'alien', 'amidar', 'assault', 'asterix', 'asteroids', 'atlantis',
'bank_heist', 'battle_zone', 'beam_rider', 'berzerk', 'bowling', 'boxing', 'breakout', 'carnival',
'centipede', 'chopper_command', 'crazy_climber', 'defender', 'demon_attack', 'double_dunk',
'elevator_action', 'enduro', 'fishing_derby', 'freeway', 'frostbite', 'gopher', 'gravitar', 'hero',
'ice_hockey', 'jamesbond', 'journey_escape', 'kaboom', 'kangaroo', 'krull', 'kung_fu_master',
'montezuma_revenge', 'ms_pacman', 'name_this_game', 'phoenix', 'pitfall', 'pong', 'pooyan',
'private_eye', 'qbert', 'riverraid', 'road_runner', 'robotank', 'seaquest', 'skiing', 'solaris',
'space_invaders', 'star_gunner', 'tennis', 'time_pilot', 'tutankham', 'up_n_down', 'venture',
'video_pinball', 'wizard_of_wor', 'yars_revenge', 'zaxxon']

On Ubuntu (20.04)

I used conda virtual environment and some China mirrors for faster downloads. For ~/.condarc,

channels:
  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main/
  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/free/
  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/conda-forge/
  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/pytorch/
  - conda-forge

Then

conda install tensorflow
conda install keras
conda install numpy
conda install opencv
conda install -c hcc gym
conda install -c likan999 atari_py
conda install -c jiayi_anaconda gym-atari
sudo apt-get install -y python-pygame
git clone https://github.com/lusob/gym-tetris.git
cd gym-tetris/
pip install -e .

In attempt to install NVIDIA driver and CUDA, I followed instructions from

and used

https://mirrors.aliyun.com/nvidia-cuda/ubuntu1804/x86_64/

in place of

http://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64

in the Software Update. It did not lead to good results, and I am considering trying on Ubuntu 18.04 instead. Or maybe my driver version GeForce GT 620 OEM is not compatible with CUDA 10.2.

Instructions on Use

Details about the code are covered in the blog here

To run the code use

  python main.py

with arguments where arguments are given by

usage: main.py [-h] [-g GAME] -n NETWORK -m MODE [-l LOAD] [-s SAVE] [-x] [-v]

Train and test different networks on Space Invaders or Tetris

optional arguments:
  -h, --help            show this help message and exit
  -g GAME, --game GAME  Specify the game to train on. Defaults to space invaders
  -n NETWORK, --network NETWORK
                        Please specify the network you wish to use, either DQN or DDQN
  -m MODE, --mode MODE  Please specify the mode you wish to run, either train or test
  -l LOAD, --load LOAD  Please specify the file you wish to load weights from(for example saved.h5)
  -s SAVE, --save SAVE  Specify folder to render simulation of network in
  -x, --statistics      Specify to calculate statistics of network(such as average score on game)
  -v, --view            Display the network playing a game of space-invaders. Is overriden by the -s command

For example, to test the pre-trained Double Deep Q Network architecture and view the network playing space invaders use

  python main.py -n DDQN -m test -l saved.h5 -v

or to train the Dueling Q Network architecture and then save the resulting video of the network playing in the test/ directory use

 python main.py -n DQN -m train -s test

Note that as the model is trained, every 10000 images, the program saves the network weights in either saved.h5 of duel_saved.h5 for DDQN and DQN respectively

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.gitignore		.gitignore
README.md		README.md
deep_Q.py		deep_Q.py
duel_Q.py		duel_Q.py
duel_saved.h5		duel_saved.h5
game.py		game.py
main.py		main.py
replay_buffer.py		replay_buffer.py
requirements.txt		requirements.txt
saved.h5		saved.h5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DQN-DDQN-on-Space-Invaders

Prerequisites

On windows (7)

On Ubuntu (20.04)

Instructions on Use

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DQN-DDQN-on-Space-Invaders

Prerequisites

On windows (7)

On Ubuntu (20.04)

Instructions on Use

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages