Ejects low frequency observations and subsamples

trim(wfm, min.count = 5, min.doc = 5, sample = NULL, verbose = TRUE)

Arguments

wfm

an object of class wfm, or a data matrix

min.count

the smallest permissible word count

min.doc

the fewest permissible documents a word can appear in

sample

how many words to randomly retain

verbose

whether to say what we did

Value

If sample is a number then this many words will be retained after min.doc and min.doc filters have been applied.

See also

Author

Will Lowe