Hyperparameter name | Meaning of hyperparameter |
---|---|
learningrate | Learning rate for stochastic gradient descent optimization |
learningratepretraining | Learning rate for pre-training, may be specified separately |
epochs | Number of training epochs |
epochspretraining | Number of epochs for pre-training, may be specified separately |
nhiddens | Number of hidden nodes specified as a vector of numbers, containing one number for each hidden layer |
batchsizepretraining | Batch size used in pre-training |