Model Configuration
Recommendations
L2 Regularization (Weight Decay)
1e-4
Penalizes large weights. Start with 1e-4 and adjust based on validation loss.
Dropout Rate
0.3
Apply 30% dropout before final layers. Reduces co-adaptation of neurons.
Label Smoothing
0.1
Use 0.1 label smoothing to prevent overconfident predictions.
L1 Regularization
0.0
Optional for sparse models. Usually 0 unless feature selection needed.