Re: Solr 4.0 segment flush times has bigger difference between tow machines

2012-10-20 Thread Erick Erickson
My first question is why this matters? Is this curiosity or is there a real performance issue you're tracking down? I don't quite understand when you say "machine A forwards...to machineB". Are you talking about replication here? Or SolrCloud? Details matter, a lot DIH has nothing that I know

Re: Solr-4.0.0 DIH not indexing xml attributes

2012-10-20 Thread Billy Newman
Sorry guys, it had nothing to do with the DIH's ability to parse attributes. My xslt did not work with the DIH. I used xsltproc to test my xslt and it worked great. However the DIH xslt transformation failed. I was able to move some things around in the xslt to get things working. Not sur

Re: Multicore setup is ignored when deploying solr.war on Tomcat 5/6/7

2012-10-20 Thread Rogerio Pereira
Here`s the catalina.out contents: Out 20, 2012 12:55:58 PM org.apache.solr.core.SolrResourceLoader locateSolrHome INFO: using system property solr.solr.home: /home/rogerio/Dados/salutisvitae Out 20, 2012 12:55:58 PM org.apache.solr.core.SolrResourceLoader INFO: new SolrResourceLoader for deduced

Index polygon/bbox with DIH

2012-10-20 Thread Billy Newman
Hey guys, Just started using Solr 4 and my main use case involves indexing bounding boxes/polygons. I have a pretty small dataset and am currently using the DIH (URLDatasource) to index my XML. Part of my XML comes back as minx, miny, maxx, maxy. Is it possible to index my bbox using the DIH?

Re: SimpleTextCodec usage tips?

2012-10-20 Thread Erick Erickson
Yeah, all this is new, usage tips are often something that gets done on an "as needed" basis. I've been curious about per-field codecs, and your post prompted me to create a Wiki page here: http://wiki.apache.org/solr/SimpleTextCodecExample Feel free to edit it as you try it out, I find that the

Re: SimpleTextCodec usage tips?

2012-10-20 Thread seralf
thanks very much Erick! i've missed the "postingsFormat="SimpleText"" part! now it works as espected on solr4 :-) i llok forward if to find some user case i could provide to your example, and if i'll find i'll promise i'll add, thanks Alfredo 2012/10/20 Erick Erickson > Yeah, all this is new,

Re: number and minus operator

2012-10-20 Thread Erick Erickson
Please review: http://wiki.apache.org/solr/UsingMailingLists There's not nearly enough information here to help you. Best Erick On Fri, Oct 19, 2012 at 1:06 PM, calmsoul wrote: > I have a document with name ABC 102030 XYZ and if i search for this document > with ABC and -"10" then i dont get th

Re: Solr 4.0 copyField not applying index analyzers

2012-10-20 Thread Erick Erickson
Are you sure you're not just seeing the stored value (which never have analysis applied?) They're what you get back when you specify fl=blah,blivet take a look at admin/schema browser and point it it the field to see what's actually in the index. Or get a copy of Luke, Otherwise, as Jack says

Re: Easy question ? docs with empty geodata field

2012-10-20 Thread David Smiley (@MITRE.org)
That'll probably work. Or with Solr 4's new spatial field types you can do a rectangle query of the whole world: geofieldname:[-90,-180 TO 90,180]. Perhaps it'd be nice to add explicit support for [* TO *]. - Author: http://www.packtpub.com/apache-solr-3-enterprise-search-server/book -- V

Re: Data Writing Performance of Solr 4.0

2012-10-20 Thread Nagendra Nagarajayya
You may want to look at realtime NRT for this kind of performance: https://issues.apache.org/jira/browse/SOLR-3816 You can download realtime NRT integrated with Apache Solr from here: http://solr-ra.tgels.org Regards, - Nagendra Nagarajayya http://solr-ra.tgels.org http://rankingalgorithm.tgel

Doing facet count using group truncating with distributed search

2012-10-20 Thread Kenneth Vindum
Hi Solr users! Could any of you tell me how to do a facet count across several cores excluding duplicates. Eg. Core A: Page 1 Id=a Text=hello world Page 2 Id=b Text=hello again Core B: Page 1 Id=a Text=Hej verden Id=c Text=Ny besked Doing a facet count on core A giv

Re: Easy question ? docs with empty geodata field

2012-10-20 Thread darul
Thank you Amit, I let you know on monday when at office because I do not have access to solr from home... But I guess I missed to use dynamic field in right way, a long time I do not read my basics ;) -- View this message in context: http://lucene.472066.n3.nabble.com/Easy-question-docs-wi

Re: Easy question ? docs with empty geodata field

2012-10-20 Thread darul
Indeed, it would be nice we can use [* TO *] Then, is it possible to deal with the following on solr 3.6: geofieldname:[-90,-180 TO 90,180] -- View this message in context: http://lucene.472066.n3.nabble.com/Easy-question-docs-with-empty-geodata-field-tp4014751p4014944.html Sent from the Sol

Understanding Filter Queries

2012-10-20 Thread Amit Nithian
Hi all, Quick question. I've been reading up on the filter query and how it's implemented and the multiple articles I see keep referring to this notion of leap frogging and filter query execution in parallel with the main query. Question: Can someone point me to the code that does this so I can be

log4j binding finally working, more problems

2012-10-20 Thread Shawn Heisey
I managed to get a setup with a log4j binding working. I modified the build script so that the dist-war-excl-slf4j target excludes all jars with slf4j in the name. Then I put jars from the newest versions of slf4j and log4j into lib/ext under the jetty home. Then I added -Dlog4j.configuratio

Re: Understanding Filter Queries

2012-10-20 Thread Mikhail Khludnev
Amit, Sure. this method https://github.com/apache/lucene-solr/blob/trunk/solr/core/src/java/org/apache/solr/search/SolrIndexSearcher.java#L796beside some other stuff calculates fq's docset intersection which is supplied into filtered search call https://github.com/apache/lucene-solr/blob/trunk/sol

Re: Understanding Filter Queries

2012-10-20 Thread Amit Nithian
Thanks. So I was poking through this and see that the filters are calculated up front and stored as docsets that get intersected and passed into Lucene in the filter. The question though is that somewhere in the IndexSearcher and somewhere into the scorer it does this but I can't quite find where.

Re: Understanding Filter Queries

2012-10-20 Thread Amit Nithian
Okay I think I found it. Let me know if this makes sense (also for those curious about this). 1) The IndexSearcher will create a FilteredQuery using the RANDOM_ACCESS_STRATEGY by default (IndexSearcher#wrapFilter). 2) When the searcher requests the scorer, the FilteredQuery uses the FilterStrateg

Re: Solr Partial word search in a sentance.

2012-10-20 Thread Amit Nithian
On the surface this looks like you could use the minimum should match feature of the dismax handler and alter that behavior depending on whether or not the search is your main search or your fallback search as you described in your (c) case. On Sat, Oct 20, 2012 at 1:13 AM, Uma Mahesh wrote: > Hi

SOLR capacity planning and Disaster relief

2012-10-20 Thread Worthy LaFollette
CAVEAT: I am a nubie w/r to SOLR (some Lucene experience, but not SOLR itself. Trying to come up to speed. What have you all done w/r to SOLR capacity planning and disaster relief? I am curious to the following metrics: - File handles and other ulimit/profile concerns - Space calculations (p