RE: Creating facets based on the content field

2015-03-23 Thread Markus Jelsma
r 23, 2015 at 9:41 AM, wrote: > >>> I just want a list of recurring words (for now.) > >>> > >>> I removed the manually-created facets from solrconfig.xml and SOLR > >>> "automagically" created a facet list for me. > >>> >

Re: Creating facets based on the content field

2015-03-23 Thread Erick Erickson
gically" created a facet list for me. >>> >>> But thanks for your suggestions. >>> >>> >>> >>> - Mail original - >>> De: "Charlie Hull" >>> À: solr-user@lucene.apache.org >>> Envoyé: Lundi 23 Mars 2

Re: Creating facets based on the content field

2015-03-23 Thread Philippe de Rochambeau
for your suggestions. >> >> >> >> - Mail original - >> De: "Charlie Hull" >> À: solr-user@lucene.apache.org >> Envoyé: Lundi 23 Mars 2015 17:26:18 >> Objet: Re: Creating facets based on the content field >> >>> On 23/03/2

Re: Creating facets based on the content field

2015-03-23 Thread Erick Erickson
; > But thanks for your suggestions. > > > > - Mail original - > De: "Charlie Hull" > À: solr-user@lucene.apache.org > Envoyé: Lundi 23 Mars 2015 17:26:18 > Objet: Re: Creating facets based on the content field > > On 23/03/2015 16:08, phi...@free.

Re: Creating facets based on the content field

2015-03-23 Thread phiroc
nvoyé: Lundi 23 Mars 2015 17:26:18 Objet: Re: Creating facets based on the content field On 23/03/2015 16:08, phi...@free.fr wrote: > Let's say that one pdf has the following contents: Aren't you thinking of Named Entity Recognition? We've used Stanford NLP for this in the past

Re: Creating facets based on the content field

2015-03-23 Thread phiroc
I reindexed the PDFs without specifying facets and they "magically" appeared in facets.vm! Many thanks! - Mail original - De: "Alexandre Rafalovitch" À: "solr-user" Envoyé: Lundi 23 Mars 2015 17:23:40 Objet: Re: Creating facets based on the conte

Re: Creating facets based on the content field

2015-03-23 Thread Charlie Hull
im and a "Gandhi" facet is missing from sorlconfig.xml. Is there a way to dynamically generate a list of facets based on words contained in the content field? Cheers, Philippe - Mail original - De: "Erik Hatcher" À: solr-user@lucene.apache.org Envoyé: Lundi

Re: Creating facets based on the content field

2015-03-23 Thread Alexandre Rafalovitch
instein" facet is > therefore useless to him and a "Gandhi" facet is missing from sorlconfig.xml. > > Is there a way to dynamically generate a list of facets based on words > contained in the content field? > > Cheers, > > Philippe > > > > > >

Re: Creating facets based on the content field

2015-03-23 Thread phiroc
on words contained in the content field? Cheers, Philippe - Mail original - De: "Erik Hatcher" À: solr-user@lucene.apache.org Envoyé: Lundi 23 Mars 2015 16:30:49 Objet: Re: Creating facets based on the content field Philippe - can you provide a concrete example of w

Re: Creating facets based on the content field

2015-03-23 Thread Erik Hatcher
Philippe - can you provide a concrete example of what you mean by creating facets on field’s content? Or maybe rather, what’s missing from doing &facet.field=content currently? Erik > On Mar 23, 2015, at 10:48 AM, phi...@free.fr wrote: > > Hello, > > let's say that you haved indexed

Creating facets based on the content field

2015-03-23 Thread phiroc
Hello, let's say that you haved indexed hundreds of PDFs using the following curl command: curl -Ss -X POST 'http://mysolr:8990/solr/core0/update/extract?extractFormat=text&wt=json&literal.url=/path/to/the/pdf.pdf"; The PDF's contents are now stored in core0's "content" field. I wonder how yo