I'm trying to build a model using tweets. I've manually tagged 30 tweets as threatening, and 50 random tweets as non-threatening. When I build the mode with:

update(models2, batchSize="50",
             train(UNCLASS,
                      features(UNCLASS,
                                     q="ProfileID:PROFCLUST1",
                                     featureSet="threatFeatures3",
                                     field="ClusterText",
                                     outcome="out_i",
                                     positiveLabel=1,
                                     numTerms=250),
                      q="ProfileID:PROFCLUST1",
                      name="threatModel3",
                      field="ClusterText",
                      outcome="out_i",
                      maxIterations="100"))

It appears to work, but all the idfs_ds values are identical. The terms_ss values look reasonable, but nearly all the weights_ds are 1.0. For out_i it is either -1 for non-threatening tweets, and +1 for threatening tweets. I'm trying to follow along with Joel Bernstein's excellent post here:
http://joelsolr.blogspot.com/2017/01/deploying-ai-alerting-system-with-solrs.html

Tips?

Thank you!

-Joe

Reply via email to