--no_unicode
Switch
--no_unicode
Description
If set then all CHAR, VARCHAR and TEXT fields in MySQL will be treated as strings. This is done both in reading from MySQL and writing to MySQL (and writing to any other output formats specified: csv, html, wordclouds). When printing wordclouds with this flag all features with non-ascii will be ignored.
Argument and Default Value
If this flag is set during feature extraction all non-ascii will be removed.
Details =======Contents [hide]Switch Description Argument and Default Value
Example Commands
Switch
--no_unicode Description
If set then all CHAR, VARCHAR and TEXT fields in MySQL will be treated as strings. This is done both in reading from MySQL and writing to MySQL (and writing to any other output formats specified: csv, html, wordclouds). When printing wordclouds with this flag all features with non:doc:fwflag_ascii will be ignored.
If this flag is set during feature extraction all non:doc:fwflag_ascii will be removed.
Argument and Default Value
Unicode is the default and no flag is needed. Only use this flag for turning it off.
This flag sets the encoding to latin1. You can override this with --encoding.
Example Commands
This command will write all output to HTML and CSV files as strings.
./fwInterface.py -d dla_tutorial -t msgs_xxx -g user_id -f 'feat$cat_LIWC2007$msgs_xxx$user_id$16to16' --outcome_table masterstats_r500 --group_freq_thresh 1000 --outcomes demog_age demog_gender --output_name xxx_output --rmatrix --sort --csv --no_unicode