I am using WEKA GUI to determine TF-IDF using unigrams and bigrams. I have used the "stringtowordvector filter) on my csv file and placed the following options to TRUE:
IDFTransform
TFTransform
OutputWordCounts and Tokenizer: N-gram tokenizer-max2-min1
However, after applying these to my data...