Erick, Thanks for the help, it sure helps a lot to read that, as it gives me more confidence I am not crazy about what I am thinking. The only problem I see by de-normalizing data as you said is that if any relation between customer and vendor changes, I will have to update the index for all the vendors. I could have about 10 000 customers per vendor. Anyway, by what you're saying, it's more common than I was imagining, right? I wonder how long solr will take to reindex 10000 records when this happens.
Thanks, Marcelo Valle. 2012/10/24 Erick Erickson <erickerick...@gmail.com> > One, take off your RDBMS cap <G>... > > DB folks regularly reject the idea of de-normalizing data > to make best use of Solr, but that's what I would explore > first. Yes, this repeats the, in your case, vendor information > perhaps many times, but try that first, even though that > causes you to update multiple customers whenever a vendor > changes. You haven't specified how many customers and vendors > you're talking abou there, but unless the total number of documents > (where each document is a customer+vendor combination) > is multiple tens of millions, you probably will be fine. > > You can get a list of just customers by using grouping where you > group on customer, although that may not be the most efficient. You > could index a field, call it "cust_filter" that was set to true for the > first > customer/vendor you indexed and false (or just left out) for all the > rest and q=blahblah&fq=cust_filter:true. > > Hope that helps > Erick > > On Wed, Oct 24, 2012 at 12:01 PM, Marcelo Elias Del Valle > <mvall...@gmail.com> wrote: > > Hello, > > > > I am new to Solr and I have a scenario where I want to use it, but I > > might be misunderstanding some concepts. I will explain what I want here, > > if someone has a solution for this, I would gladly accept the help. > > I have a core indexing customers. I have another core indexing > vendors. > > Both are related to each other. > > Here is what I want to do in my application: I want to find all the > > customers that follow some criteria and them find the vendors related to > > them. > > > > My first option was to to have just vendor core and in for each > > document in vendor core I would have all the customers related to it. > > However, I would write the same customer several times to the index, as > > more than one vendor could be related to the same customer. Besides, I > > wonder how would I write a query to list just the different customers. > > Another problem is that I update customers in a different frequency I > > update vendors, but have vendor + customers in a single document would > obly > > me to do the full update. > > > > Does anyone have a good solution for this I am not being able to > see? I > > might be missing some basic concept here... > > > > Thanks, > > -- > > Marcelo Elias Del Valle > > http://mvalle.com - @mvallebr > -- Marcelo Elias Del Valle http://mvalle.com - @mvallebr