--stratify_folds
Switch
--stratify_folds
Description
This switch is used to specify that the data should be stratified into folds with respect to the outcome column. This will ensure that the outcome distribution of each fold is similar to the overall outcome distribution.
Argument and Default Value
None
Details
Can be used for both binary and continuous outcomes.
Other Switches
Required Switches:
Example Commands
# Runs stratified 10-fold cross validation on predicting the users ages from 1grams.
dlatkInterface.py -d dla_tutorial -t msgs -c user_id -f 'feat$1gram$msgs$user_id$16to16' --outcome_table blog_outcomes \
--outcomes age --combo_test_regression --model ridgecv --folds 10 --stratify_folds