Thanks David, I'm not clear on how the X value of a range pair will help me 
filter on pairs of start-end times.Can you explain how that'd work? Still, 
seems like the ability to create subobjects in solr is a huge feature, I'm 
hoping it'll eventually make it in.Eric.
 > From: dsmi...@mitre.org
> To: solr-user@lucene.apache.org
> Subject: Re: Solr 4.0 - Join performance
> Date: Tue, 14 Aug 2012 20:50:43 +0000
> 
> This one should work for now:
> https://issues.apache.org/jira/browse/SOLR-3304
> If you're comfortable with checking out Lucene/Solr and applying a patch, 
> then you can do it yourself and get it working without any real coding.  
> You'd have to use a dummy constant value for 'y' as you index rectangles, and 
> you'd configure it for non-geospatial.  The unfortunate piece is that 'x' 
> (nor 'y') can't be the full range of a double, and it's not oriented towards 
> a 'long' time value.  There's no JIRA issue for a one-dimensional spatial 
> field yet; that's pretty far down the priority list.  You are certainly not 
> the first that could use this feature, though.
> 
> ~ David Smiley
> 
> On Aug 14, 2012, at 4:19 PM, Eric Khoury wrote:
> 
> > 
> > Thanks David, that does indeed sound like it'll help.  Is there an issue 
> > number I can use to track development\availability?Eric.
> >> From: dsmi...@mitre.org
> >> To: solr-user@lucene.apache.org
> >> Subject: Re: Solr 4.0 - Join performance
> >> Date: Tue, 14 Aug 2012 20:15:27 +0000
> >> 
> >> Stepping back a bit, the reason you are using multiple cores with a join 
> >> is because Solr doesn't have a multi-valued numeric range type.  The 
> >> spatial work I'm doing in Lucene-spatial does, and it's 2-dimensional for 
> >> an x & y whereas your case calls for one dimension.  It's taking a bit of 
> >> time, but when finished you should be able to use it for your use case 
> >> ignoring the 'y'.  Eventually I'd like to develop  such a Solr field type 
> >> for a numeric/time range to do it more natively but that's a ways off.
> >> 
> >> Cheers,
> >>  ~ David Smiley
> >> 
> >> On Aug 2, 2012, at 10:45 AM, Eric Khoury wrote:
> >> 
> >>> 
> >>> 
> >>> 
> >>> 
> >>> 
> >>> 
> >>> Hello all,
> >>> 
> >>> 
> >>> 
> >>> I’m testing out the new join feature, hitting some perf
> >>> issues, as described in Erick’s article 
> >>> (http://architects.dzone.com/articles/solr-experimenting-join).
> >>> 
> >>> Basically, I’m using 2 objects in solr (this is a simplified
> >>> view):
> >>> 
> >>> 
> >>> 
> >>> Item
> >>> 
> >>> - Id
> >>> 
> >>> - Name
> >>> 
> >>> 
> >>> 
> >>> Grant
> >>> 
> >>> - ItemId
> >>> 
> >>> - AvailabilityStartTime
> >>> 
> >>> - AvailabilityEndTime
> >>> 
> >>> 
> >>> 
> >>> Each item can have multiple grants attached to it.
> >>> 
> >>> 
> >>> 
> >>> The query I'm using is the following, to find items by
> >>> name, filtered by grants availability window:
> >>> 
> >>> 
> >>> 
> >>> solr/select?fq=Name:XXX&q={!join
> >>> from=ItemId to=Id} AvailabilityStartTime:[* TO NOW] AND 
> >>> -AvailabilityEndTime:[*
> >>> TO NOW]
> >>> 
> >>> 
> >>> 
> >>> With a hundred thousand items, this query can take multiple seconds
> >>> to perform, due to the large number or ItemIds returned from the join 
> >>> query.
> >>> 
> >>> Has anyone come up with a better way to use joins for these types of 
> >>> queries?  Are there improvements planned in 4.0 rtm in this area?
> >>> 
> >>> 
> >>> 
> >>> Btw, I’ve explored simply adding Start-End times to items, but
> >>> the flat data model makes it hard to maintain start-end pairs.
> >>> 
> >>> 
> >>> 
> >>> Thanks for the help!
> >>> 
> >>> Eric.
> >>> 
> >>> 
> >>> 
> >>>                                     
> >> 
> >                                       
> 
                                          

Reply via email to