Running Theano-based DeepMind DQN on Mac

Theano-based implementation of Deep Q-learning is available on github. During the DeepHack hackathon our team «’drop table table users;» (which consists of me, Alexey Ozerin, Alexander Notchenko, Mikhail and Artur Kuzin) managed to get this code up and running on Ubuntu in a couple of minutes because of the provided dep_script.sh. But running this code on Mac resulted in some problems, which we were able to solve succeffully.

First of all to run test script with ./run_nips –rom breakout you need to make sure you have corresponding roms:

$ cd ./roms $ wget https://atariage.com/2600/roms/Breakout.zip $ unzip -e Breakout.zip && rm Breakout.zip

You might need to alter ~/.theanorc:
$ cat ~/.theanorc [global] floatX = float32 device = gpu0

[nvcc]
fastmath = True

Error with unavailable numpy/multiarray.h can be solved with exporting proper PATH variables:

export PATH=/Developer/NVIDIA/CUDA-7.0/bin:$PATH export DYLD_LIBRARY_PATH=/Developer/NVIDIA/CUDA-7.0/lib:$DYLD_LIBRARY_PATH

and / or altering a ale_data_set.py file replacing coressponding lines:

import pyximport import numpy as np pyximport.install(setup_args={'include_dirs': np.get_include()})

Annoying dnn error

File "/lib/python2.7/site-packages/Lasagne-0.1.dev0-py2.7.egg/lasagne/layers/dnn.py", line 13, in raise ImportError("dnn not available") # pragma: no cover ImportError: dnn not available

can be solved with copying cudnn header and lib files to /usr/local/lib and /usr/local/include:
$ cd /cudnn-6.5-osx-v2/ $ ls CUDNN_License.pdf cudnn.h libcudnn.dylib INSTALL.txt libcudnn.6.5.dylib libcudnn_static.a $ cp cudnn.h /usr/local/include/ $ cp *.dylib /usr/local/lib/ $ cp libcudnn_static.a /usr/local/lib/ $ cd /deep_q_rl/deep_q_rl #and the running deep_q_rl looks like this: $ ./run_nips.py --rom breakout RL-Glue Version 3.04, Build 909 RL-Glue is listening for connections on port=4096 RL-Glue Python Experiment Codec Version: 2.1 (Build 738) Connecting to 127.0.0.1 on port 4096... RL-Glue :: Experiment connected. A.L.E: Arcade Learning Environment (version 0.5.0) [Powered by Stella] Use -help for help screen. Warning: couldn't load settings file: ./stellarc Game console created: ROM file: ../roms/breakout.bin Cart Name: Breakout - Breakaway IV (1978) (Atari) Cart MD5: f34f08e5eb96e500e851a80be3277a56 Display Format: AUTO-DETECT ==> NTSC ROM Size: 2048 Bankswitch Type: AUTO-DETECT ==> 2K

Running ROM file…
Random Seed: Time
Game will be controlled through RL-Glue.
Initializing ALE RL-Glue …
RL-Glue :: Environment connected.
Using gpu device 0: GeForce GT 750M
RL-Glue Python Agent Codec Version: 2.1 (Build 738)
Connecting to 127.0.0.1 on port 4096…
Agent Codec Connected
RL-Glue :: Agent connected.
/Lasagne-0.1.dev0-py2.7.egg/lasagne/layers/helper.py:69: UserWarning: get_all_layers() has been changed to return layers in topological order. The former implementation is still available as get_all_layers_old(), but will be removed before the first release of Lasagne. To ignore this warning, use `warnings.filterwarnings(‘ignore’, ‘.*topo.*’)`.
warnings.warn(“get_all_layers() has been changed to return layers in ”
INFO:root:OPENING breakout_07-20-18-18_0p0002_0p95/results.csv
INFO:root:training epoch: 1 steps_left: 50000
INFO:root:steps/second: 100.20

Hope that this notes have made you able to train and successfully run an atari game agent.

P.S. If you were unable to run code and you still get ImportError: dnn not available error, you might want to debug theano/sandbox/cuda/dnn.py dnn_available() function.