--create_random_sample percentage [random_seed]


Creates a new table with a random subset of rows from the table specified by -t. New table has _rand appended to the table name.

You must specify the percentage of rows to keep. Optionally you can give a random seed, which is 42 by default.

Other Switches

Required Switches:

Example Commands

Example 1: create the table msgs_rand that contains a random 10% of the rows in msgs:

dlatkInterface.py -d dla_tutorial -t msgs --create_random_sample .10

Example 2: create the table blog_outcomes_rand that contains a random 50% of the rows in blog_outcomes with the random seed 567:

dlatkInterface.py -d dla_tutorial -t blog_outcomes --create_random_sample .50 567