Hello Erick, Thanks a lot for your reply! Your suggestion is actually exactly the alternative solution we are thinking about and with your clarification on Solr's performance we are going to go for it! Many thanks again!
Milen ________________________________________ Von: Erick Erickson [erickerick...@gmail.com] Gesendet: Sonntag, 23. September 2012 17:50 An: solr-user@lucene.apache.org Betreff: Re: Help with new Join Functionallity in Solr 4.0 The very first thing to try is flatten your data so you don't have to use joins. I know that goes against your database instincts, but Solr easily handles millions and millions of documents. So if the cross-product of docs and modules isn't prohibitive, that's what I'd do first. Then it's just a matter of forming a search without joins.... Joins run into performance issues when the join field has many unique values, unfortunately the field people often want to join on is something like a <uniqueKey> (or PK in RDBMS terms), so be aware of that. Best Erick On Fri, Sep 21, 2012 at 5:46 AM, <milen.ti...@materna.de> wrote: > Dear Solr community, > > I am rather new to Solr, however I already find it kind of attractive. We are > developing a research application, which contains a Solr index with three > different kinds of documents, here the basic idea: > > > - A document of type "doc" consisting of fields id, docid, doctitle > and some other metadata > > - A document of type "module" consisting of fields id, modid and text > > - A document of type "docmodule" consisting of fields id, docrefid, > modrefid and some metadata about the relation between a document and a > module; filed docrefid refers to the id of a "doc" document, while field > modrefid contains the id of a "module" document > > In other words, in our model there are documents (type "doc") consisting of > several modules and there is some characterization of each link between a > document and a module. > > Almost all fields of a "doc" document are searchable, as well as the text of > a module and the metadata of the "docmodule" entries. > > We are looking for a fast way to retrieve all modules containing a certain > text and associated with a given document, preferably with a single query. > This means we want to query the text from a "module" document while we set a > restriction on the docrefid from a "docmodule" or the id from a "doc" > document. Is this possible by means of the new pseudo joins? Any ideas are > highly appreciated! > > Thanks in advance! > > Milen Tilev > Master of Science > Softwareentwickler > Business Unit Information > ________________________________________________ > > MATERNA GmbH > Information & Communications > > Voßkuhle 37 > 44141 Dortmund > Deutschland > > Telefon: +49 231 5599-8257 > Fax: +49 231 5599-98257 > E-Mail: milen.ti...@materna.de<mailto:milen.ti...@materna.de> > > www.materna.de<http://www.materna.de/> | > Newsletter<http://www.materna.de/newsletter> | > Twitter<http://twitter.com/MATERNA_GmbH> | > XING<http://www.xing.com/companies/MATERNAGMBH> | > Facebook<http://www.facebook.com/maternagmbh> > ________________________________________________ > > Sitz der MATERNA GmbH: Voßkuhle 37, 44141 Dortmund > Geschäftsführer: Dr. Winfried Materna, Helmut an de Meulen, Ralph Hartwig > Amtsgericht Dortmund HRB 5839 >