Re: Classify document using bag of words

2017-03-27 Thread alessandro.benedetti
Hi marotosg, john's suggestion will definitely work ( I recommend you a copyfield for that analysis). What happens in your use case if a word is in common for more than one bag of word ( if possible at all in your use case)? Do you expect to get back all the classes ? scored in some way ? In th

Re: Classify document using bag of words

2017-03-26 Thread Koji Sekiguchi
Hi, I'm not sure that it can help you but I'd like to show you the link of an article which I wrote about document classification years ago: Comparing Document Classification Functions of Lucene and Mahout http://soleami.com/blog/comparing-document-classification-functions-of-lucene-and-mahout.

Re: Classify document using bag of words

2017-03-26 Thread John Blythe
Glad to hear it! On Sun, Mar 26, 2017 at 3:49 PM Sergio García Maroto wrote: > Sorry it actually works. Thanks a lot. > > On 26 March 2017 at 21:45, Sergio García Maroto > wrote: > > > Hi John. > > thanks for that. > > > > That's actually a good option but I would need the category text on the

Re: Classify document using bag of words

2017-03-26 Thread Sergio García Maroto
Hi John. thanks for that. That's actually a good option but I would need the category text on the field so I can facet on the field and get every category and the number. On 26 March 2017 at 18:27, John Blythe wrote: > You could use keepwords to filter out any other words besides your bag and >

Re: Classify document using bag of words

2017-03-26 Thread Sergio García Maroto
Sorry it actually works. Thanks a lot. On 26 March 2017 at 21:45, Sergio García Maroto wrote: > Hi John. > thanks for that. > > That's actually a good option but I would need the category text on the > field so I can facet on the field and get every category and the number. > > On 26 March 2017

Re: Classify document using bag of words

2017-03-26 Thread John Blythe
You could use keepwords to filter out any other words besides your bag and then have a synonym filter that translates the remaining word(s) to a corresponding category/classification On Sun, Mar 26, 2017 at 12:05 PM marotosg wrote: > Hi, > > I have a very simple use case where I would need to cl