Re: Taxonomy in SOLR

2011-01-24 Thread Jonathan Rochkind
There aren't any great general purpose out of the box ways to handle hieararchical data in Solr. Solr isn't an rdbms. There may be some particular advice on how to set up a particular Solr index to answer particular questions with regard to hieararchical data. I saw a great point made recent

Re: Taxonomy in SOLR

2011-01-24 Thread Em
3/world/europe/germany/berlin >> >> >> >> I need *_taxon_hierarchy to faceting and label to auto complete. >> >> >> >> With a RDBMs, i have 100 entry max for one taxo, but with solr and 4 >> >> million documents the redundandcy is huge, no ?

Re: Taxonomy in SOLR

2011-01-24 Thread Erick Erickson
er document > >> > >> Damien > >> > >> Le 24/01/2011 10:30, Em a écrit : > >> > >> Hi Damien, > >>> > >>> why are you storing the taxonomies? > >>> When it comes to faceting, it only depends on indexed values. If there > >>> is > >>> a > >>> meaningful difference between the indexed and the stored value, I would > >>> prefer to use an RDBMs or something like that to reduce redundancy. > >>> > >>> Does this help? > >>> > >>> Regards > >>> > >> > >> > > > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Taxonomy-in-SOLR-tp2317955p2320666.html > Sent from the Solr - User mailing list archive at Nabble.com. >

Re: Taxonomy in SOLR

2011-01-24 Thread Em
re you storing the taxonomies? >>> When it comes to faceting, it only depends on indexed values. If there >>> is >>> a >>> meaningful difference between the indexed and the stored value, I would >>> prefer to use an RDBMs or something like that to reduce redundancy. >>> >>> Does this help? >>> >>> Regards >>> >> >> > > -- View this message in context: http://lucene.472066.n3.nabble.com/Taxonomy-in-SOLR-tp2317955p2320666.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Taxonomy in SOLR

2011-01-24 Thread Damien Fontaine
Thanks Em and Erick for your answers, Now, i better understand functioning of Solr. Damien Le 24/01/2011 16:23, Erick Erickson a écrit : First, the redundancy is certainly there, but that's what Solr does, handles large amounts of data. 4 million documents is actually a pretty small corpus by

Re: Taxonomy in SOLR

2011-01-24 Thread Erick Erickson
First, the redundancy is certainly there, but that's what Solr does, handles large amounts of data. 4 million documents is actually a pretty small corpus by Solr standards, so you may well be able to do exactly what you propose with acceptable performance/size. I'd advise just trying it with, say,

Re: Taxonomy in SOLR

2011-01-24 Thread Em
ntext: http://lucene.472066.n3.nabble.com/Taxonomy-in-SOLR-tp2317955p2319920.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Taxonomy in SOLR

2011-01-24 Thread Damien Fontaine
Le 24/01/2011 13:10, Em a écrit : Hi Daniem, ahm, the formula I wrote was no definitive guide, just some numbers I combined to visualize the amount of data - perhaps not even a complete formula. Well, when you can use your taxonomy as indexed-only you do not double the used disk space when yo

Re: Taxonomy in SOLR

2011-01-24 Thread Em
ex is much smaller than the size of the original data. I hope what I tried to explain was easy to understand. Regards -- View this message in context: http://lucene.472066.n3.nabble.com/Taxonomy-in-SOLR-tp2317955p2319202.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Taxonomy in SOLR

2011-01-24 Thread Damien Fontaine
Thanks Em, How i can calculate index time, update time and space disk used by one taxonomy ? Le 24/01/2011 10:58, Em a écrit : 100 Entries per taxon? Well, with Solr you got 100 taxon-entries * 4mio docs * 10 taxons. If your indexed taxon-versions are looking okay, you could leave out the DB-

Re: Taxonomy in SOLR

2011-01-24 Thread Em
100 Entries per taxon? Well, with Solr you got 100 taxon-entries * 4mio docs * 10 taxons. If your indexed taxon-versions are looking okay, you could leave out the DB-overhead and could do everything in Solr. -- View this message in context: http://lucene.472066.n3.nabble.com/Taxonomy-in-SOLR

Re: Taxonomy in SOLR

2011-01-24 Thread Damien Fontaine
Yes, i am not obliged to store taxonomies. My taxonomies are type of english_taxon_label = Berlin english_taxon_type = location english_taxon_hierarchy = 0/world 1/world/europe 2/world/europe/germany

Re: Taxonomy in SOLR

2011-01-24 Thread Em
-- View this message in context: http://lucene.472066.n3.nabble.com/Taxonomy-in-SOLR-tp2317955p2318363.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Taxonomy in SOLR

2011-01-24 Thread Damien Fontaine
My schema : Le 24/01/2011 09:56, Em a écrit : Hi Damien, can you provide a schema sample plus example-data? Since your information is really general, I think no one can give you a situation-specific advice. Regards

Re: Taxonomy in SOLR

2011-01-24 Thread Em
Hi Damien, can you provide a schema sample plus example-data? Since your information is really general, I think no one can give you a situation-specific advice. Regards -- View this message in context: http://lucene.472066.n3.nabble.com/Taxonomy-in-SOLR-tp2317955p2318200.html Sent from the

Taxonomy in SOLR

2011-01-24 Thread Damien Fontaine
Hi, I am trying Solr and i have one question. In the schema that i set up, there are 10 fields with always same data(hierarchical taxonomies) but with 4 million documents, space disk and indexing time must be big. I need this field for auto complete. Is there another way to do this type of ope