Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

optimal params for training #7

Open
vinnitu opened this issue Jan 9, 2017 · 0 comments
Open

optimal params for training #7

vinnitu opened this issue Jan 9, 2017 · 0 comments

Comments

@vinnitu
Copy link

vinnitu commented Jan 9, 2017

I trying to build model on data in your code but I have limited memory on my GPU (8G on GTX 1080)

What best options to use this device?

Now I limited dataset from 15.437.674 to 5.000.000 lines
My ~/.theanorc

[global]
floatX = float32
device = gpu0

[nvcc]
fastmath = True

[lib]
cnmem = .9

but what about program options, now I have such

NUMBER_OF_ITERATIONS = 10000 # 20000
EPOCHS_PER_ITERATION = 1 # 5
RNN = recurrent.LSTM
INPUT_LAYERS = 2
OUTPUT_LAYERS = 2
AMOUNT_OF_DROPOUT = 0.3
BATCH_SIZE = 500
HIDDEN_SIZE = 700
INITIALIZATION = "he_normal" # : Gaussian initialization scaled by fan_in (He et al., 2014)
MAX_INPUT_LEN = 40
MIN_INPUT_LEN = 3
INVERTED = True
AMOUNT_OF_NOISE = 0.2 / MAX_INPUT_LEN
NUMBER_OF_CHARS = 100 # 75
CHARS = list("abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ .")

What can you advice me?

Thank you for any help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant