RE: SOLR 3.3 DIH and Java 1.6

2012-03-20 Thread Juan Pablo Mora
Some versions of the OpenJDK doesn´t include the Rhino Engine to run javascript dataimport. You have to use the Oracle JDK. Juampa. De: randolf.julian [randolf.jul...@dominionenterprises.com] Enviado el: martes, 20 de marzo de 2012 5:41 Para: solr-user@luc

Re: is the SolrJ call to add collection of documents a blocking function call ?

2012-03-20 Thread Michael Kuhlmann
Hi Ramdev, add() is a blocking call. Otherwise it had to start an own background thread which is not what a library like Solrj should do (how many threads at most? At which priority? Which thread group? How long keep them pooled?) And, additionally, you might want to know whether the transmi

Why does parameter "useCompoundFile" not work?

2012-03-20 Thread cheermc
Dear all, I want to generate compound type index instead of files contain fdt,fdx etc. I follow the suggestion to change the "useCompoundFile" parameter to true (both in indexDefaults and mainIndex) in solrconfig.xml, but when i use post.jar to post example xml file, i find the index is the same

Re: PorterStemmer using example schema and data

2012-03-20 Thread Birkmann, Magdalena
I tried that, and it seems like "recharging" and "rechargeable", for example, actually do stem to the same root (recharg). So why is it not working when I'm searching on my indexed sampledocs? The stemming works when I search for "videos" and it's actually "video" in the document, etc., but not for

Staggering Replication start times

2012-03-20 Thread Eric Pugh
I am playing with an index that is sharded many times, between 64 and 128. One thing I noticed is that with replication set to happen every 5 minutes, it means that each slave hits the master at the same moment asking for updates: :00:00, :05:00, :10:00, :15:00 etc. Replication takes very li

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-03-20 Thread Erick Erickson
Faceting is orthogonal to grouping, so be careful what you ask for. So adding faceting would be easy, the only reason I suggested grouping is your requirement that your brands be just a count of the number of distinct ones found, not the number of matching docs. So a really simple solution would b

Re: is the SolrJ call to add collection of documents a blocking function call ?

2012-03-20 Thread Erick Erickson
Also consider StreamingUpdateSolrServer if you want multiple threads to operate from your client. Best Erick On Tue, Mar 20, 2012 at 4:12 AM, Michael Kuhlmann wrote: > Hi Ramdev, > > add() is a blocking call. Otherwise it had to start an own background thread > which is not what a library like S

Re: is the SolrJ call to add collection of documents a blocking function call ?

2012-03-20 Thread darul
Hmm nice feature Erik -- View this message in context: http://lucene.472066.n3.nabble.com/is-the-SolrJ-call-to-add-collection-of-documents-a-blocking-function-call-tp3839387p3842232.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Why my email always been rejected?

2012-03-20 Thread Travis Low
I received it...sometimes it just needs some time. 2012/3/20 怪侠 <87863...@qq.com> > I send email to :solr-user@lucene.apache.org, but I always receive the > rejected email. It can't send successful. -- ** *Travis Low, Director of Development* ** * * *Centurion Research Solutions, LLC*

SolrCloud replica and leader out of Sync somehow

2012-03-20 Thread Jamie Johnson
I'm trying to figure out how it's possible for 2 solr instances (1 which is leader 1 is replica) to be out of sync. I've done commits to the solr instances, forced replication but still the solr instances have different info. The relevant snippet from my clusterstate.json is listed below. \

RE: SOLR 3.3 DIH and Java 1.6

2012-03-20 Thread randolf.julian
Thanks Mikhail and Juampa. How can I prove to our Systems guys that the Rhino Engine is not installed? This is the only way that I can prove that it's not installed and we have to have it for SOLR data importhandler script to run. Thanks again. - Randolf -- View this message in context: http://l

Re: querying on shards

2012-03-20 Thread Shawn Heisey
On 3/19/2012 11:55 PM, Ankita Patil wrote: Hi, I wanted to know whether it is feasible to query on all the shards even if the query yields data only from a few shards n not all. Or is it better to mention those shards explicitly from which we get the data and only query on them. for example : I

RE: SOLR 3.3 DIH and Java 1.6

2012-03-20 Thread Dyer, James
Taking a quick look at the code, it seems this exception could have been thrown for four reasons: (see org.apache.solr.handler.dataimport.ScriptTransformer#initEngine) 1. Your JRE doesn't have class "javax.script.ScriptEngineManager" (pre 1.6, loaded here via reflection) 2. Your JRE doesn't

Re: SolrCloud replica and leader out of Sync somehow

2012-03-20 Thread Mark Miller
Do you have the logs for this? Either around startup or when you are forcing replication. Logs around both would be helpful. Also the doc counts for each shard? On Mar 20, 2012, at 10:16 AM, Jamie Johnson wrote: > I'm trying to figure out how it's possible for 2 solr instances (1 > which is lea

Re: SolrCloud replica and leader out of Sync somehow

2012-03-20 Thread Jamie Johnson
DocCounts are the same. I am going to disable my custom component to see if that is mucking with something but it seems to be working properly. After looking at the results a little closer (expanding the number of results coming back) it seems that the same information is in both but the order in

Re: SolrCloud replica and leader out of Sync somehow

2012-03-20 Thread Jamie Johnson
ok, with my custom component out of the picture I still have the same issue. Specifically, when sorting by score on a leader and replica I am getting different doc orderings. Is this something anyone has seen? On Tue, Mar 20, 2012 at 11:09 AM, Jamie Johnson wrote: > DocCounts are the same.  I a

Re: SolrCloud replica and leader out of Sync somehow

2012-03-20 Thread Yonik Seeley
On Tue, Mar 20, 2012 at 11:17 AM, Jamie Johnson wrote: > ok, with my custom component out of the picture I still have the same > issue.  Specifically, when sorting by score on a leader and replica I > am getting different doc orderings.  Is this something anyone has > seen? This is certainly poss

Re: SolrCloud replica and leader out of Sync somehow

2012-03-20 Thread Jamie Johnson
HmmmOk, I don't see how it's possible for me to ensure that there are no ties. If a query were for *:* everything has a constant score, if the user requested 1 page then requested the next the results on the second page could be duplicates from what was on the first page. I don't remember ever

Re: SolrCloud replica and leader out of Sync somehow

2012-03-20 Thread Yonik Seeley
On Tue, Mar 20, 2012 at 11:39 AM, Jamie Johnson wrote: > HmmmOk, I don't see how it's possible for me to ensure that there > are no ties.  If a query were for *:* everything has a constant score, > if the user requested 1 page then requested the next the results on > the second page could be d

org.apache.solr.common.SolrException: Internal Server Error

2012-03-20 Thread qingwei201314
I use the solrJ to index a pdf file. File file = new File("1.pdf"); String urlString = constant.getUrl(); StreamingUpdateSolrServer solr = new StreamingUpdateSolrServer( urlString, 1, 1);

Re: SolrCloud replica and leader out of Sync somehow

2012-03-20 Thread Jamie Johnson
I believe we're using replication to only duplicate the index (standard SolrCloud nothing special on our end) so I don't see why the docids wouldn't be the sameam I missing something that is happening there that I am unaware of? On Tue, Mar 20, 2012 at 11:50 AM, Yonik Seeley wrote: > On Tue,

Re: Replication with different schema

2012-03-20 Thread in.abdul
Thanks .. i need to index data from one solr to another solr with different analyser .. Now i am able to do this by querying from solr which will be index into another solr NOTE: As the field which i need to reindex is stored so it is easy by as my index has 31 lakh record it is taking lot of time

Re: SolrCloud replica and leader out of Sync somehow

2012-03-20 Thread Jamie Johnson
Thanks Yonik, I really appreciate the explanation. It sounds like the best solution for me to solve this is to add the additional sort parameter. That being said is there a significant memory increase to do this when sorting by score? I don't see how with SolrCloud I can avoid doing this, and ho

RE: SOLR 3.3 DIH and Java 1.6

2012-03-20 Thread Dyer, James
I also applied a fix to both Trunk/4.x and the 3.x branch (will be in 3.6 when it is released). This should give you better error messages when something goes wrong when ScriptTransformer is invoked. It will tell you that you need >1.6 only if the functionality is absent (case #1 in my last me

Multi-valued polyfields - Do they exist in the wild ?

2012-03-20 Thread ramdev.wudali
Hi: We have been keen on using polyfields for a while. But we have been restricted from using it because they do not seem to support Multi-values (yet). I am wondering if there are any Custom implementations or is there any ETA on the Solr releases to include Multivalued PolyFields . Thank

SV: To truncate or not to truncate (group.truncate vs. facet)

2012-03-20 Thread rasser
Thanks for taking the time to help me Erick! Just to clarify my desired behavior from the facets. This is the index, notice color is multivalued to represent a model of car that has more than one color: Audi A4 audi A4_black black white Audi A4 audi A4_white white Volvo V50 volvo Volvo_V50

Re: Multi-valued polyfields - Do they exist in the wild ?

2012-03-20 Thread Yonik Seeley
On Tue, Mar 20, 2012 at 2:17 PM, wrote: > Hi: >   We have been keen on using polyfields for a while. But we have been > restricted from using it because they do not seem to support Multi-values > (yet). Poly-fields should support multi-values, it's more what uses them may not. For example LatL

Thanks All

2012-03-20 Thread vybe3142
Here is the core of the SOLRJ client that ended up accomplishing what I wanted String fileName2 = "C:\\work\\SolrClient\\data\\worldwartwo.txt"; SolrServer server = new StreamingUpdateSolrServer("http://localhost:8080/solr/",20,8); UpdateRequest req = new UpdateRequest("/up

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-03-20 Thread rasser
Thanks for taking the time to help me Erick! Just to clarify my desired behavior from the facets. This is the index, notice color is multivalued to represent a model of car that has more than one color: Audi A4 audi A4_black black white Audi A4 audi A4_white white Volvo V50 volvo Volvo_V50 b

Re: Any way to get reference to original request object from within Solr component?

2012-03-20 Thread SUJIT PAL
Hi Hoss, Thanks for the pointers, and sorry, it was a bug in my code (was some dead code which was alphabetizing the facet link text and also the parameters themselves indirectly by reference). I actually ended up building a servlet and a component to print out the multi-valued parameters usin

Re: Replication with different schema

2012-03-20 Thread Erick Erickson
OK, I was thrown off by your use of "schema", I thought you were talking about schema.xml Anyway, assuming you have some kind of loop that pages through the documents via Solr, gets the results and then sends them to another Solr server... yeah, that'll be slow. You have the "deep paging" prob

Re: To truncate or not to truncate (group.truncate vs. facet)

2012-03-20 Thread Erick Erickson
Ok, assuming sku is an un-tokenized field (and if it isn't, use a copyField) then just facet on that field. Then, at the app layer, combine them to get your aggregate counts. So your raw return would have Audi A4 (2) Audi A5 (1) Audi S8 (2) Volvo V50 (1) The app would have to be smart enough to s

Re: Thanks All

2012-03-20 Thread Chris Hostetter
: To get this to work correctly, the following server side config was needed : (I started from a barebones solr config) : 1. Add apache-solr-cell-3.5.0.jar to the /lib directory (or : wherever solr can access jars) as this contains the class : ExtractingRequestHandler : 2. Add the appropriate han

Re: StreamingUpdateSolrServer - thread exit timeout?

2012-03-20 Thread Chris Hostetter
: > Is there any way to get get the threads within SUSS objects to immediately : > exit without creating other issues? Alternatively, if immediate isn't : > possible, the exit could take 1-2 seconds. I could not find any kind of : > method in the API that closes down the object. you should take

Re: Thanks All

2012-03-20 Thread Lance Norskog
If you build it, they will come! On Tue, Mar 20, 2012 at 12:59 PM, vybe3142 wrote: > I'm still puzzled that there are no readily available alternatives to using > the Tika based ExtractingRequestHandler in the situation where the input > data is plain UTF-8 text files that SOLR needs to injest a

Re: Staggering Replication start times

2012-03-20 Thread William Bell
For our use case this is a no-no. When the index is updated, we need all indexes to be updated at the same time. We put all indexes (slaves) behind a load balancer and the user would expect the same results from page to page. On Tue, Mar 20, 2012 at 5:36 AM, Eric Pugh wrote: > I am playing with

Re: StreamingUpdateSolrServer - thread exit timeout?

2012-03-20 Thread Shawn Heisey
On 3/20/2012 8:11 PM, Chris Hostetter wrote: :> Is there any way to get get the threads within SUSS objects to immediately :> exit without creating other issues? Alternatively, if immediate isn't :> possible, the exit could take 1-2 seconds. I could not find any kind of :> method in the API