From: An alternative approach to dimension reduction for pareto distributed data: a case study
Hyperparameter
Value
Learning rate
0.001
Beta1
0.9
Beta2
0.999
Epsilon
1 × 10−7