Re: Commit Strategy for SolrCloud when Talking about 200 million records.

2012-03-22 Thread I-Chiang Chen
At this time we are not leveraging the NRT functionality. This is the initial data load process where the idea is to just add all 200 millions records first. Than do a single commit at the end to make them searchable. We actually disabled auto commit at this time. We have tried to leave auto commi

RequestHandler versus SearchComponent

2012-03-22 Thread Benson Margulies
I'm looking at the following. I want to (1) map some query fields to some other query fields and add some things to FL, and then (2) rescore. I can see how to do it as a RequestHandler that makes a parser to get the fields, or I could see making a SearchComponent that was stuck into the list just

Re: confirm subscribe to solr-user@lucene.apache.org

2012-03-22 Thread Andreas Kalla
Am 22.03.2012 23:47, schrieb solr-user-h...@lucene.apache.org: Hi! This is the ezmlm program. I'm managing the solr-user@lucene.apache.org mailing list. I'm working for my owner, who can be reached at solr-user-ow...@lucene.apache.org. To confirm that you would like m...@akalla.de added

Re: Unable to get solr server to initialize

2012-03-22 Thread Chris Hostetter
: Iam trying to get Solr installed using apache solr 3.5.0, Java 1.6.0, and : Drupal 7. I am able to log in through ssh, navigate to : apache-solr-3.5.0/example, and run java -jar start.jar. After that, however, : trying to access either localhost:8983/solr/admin or localhost:8983/solr just : cont

Re: MLT and solrcloud?

2012-03-22 Thread Darren Govoni
Ok, I'll do what I can to help! As always, appreciate the hard work Mark. On Thu, 2012-03-22 at 17:31 -0400, Mark Miller wrote: > On Mar 22, 2012, at 5:22 PM, Darren Govoni wrote: > > > Hi, > > It was mentioned before that SolrCloud has all the capability of > > regular solr (including handler

Re: MLT and solrcloud?

2012-03-22 Thread Mark Miller
On Mar 22, 2012, at 5:22 PM, Darren Govoni wrote: > Hi, > It was mentioned before that SolrCloud has all the capability of > regular solr (including handlers) with the exception of the MLT handler. > As this is a key capability for Solr, is there work planned to include > the MLT in SolrCloud? I

MLT and solrcloud?

2012-03-22 Thread Darren Govoni
Hi, It was mentioned before that SolrCloud has all the capability of regular solr (including handlers) with the exception of the MLT handler. As this is a key capability for Solr, is there work planned to include the MLT in SolrCloud? If so when? Our efforts greatly depend on it. As such, I'm hap

Re: Commit Strategy for SolrCloud when Talking about 200 million records.

2012-03-22 Thread Mark Miller
On Mar 21, 2012, at 9:37 PM, I-Chiang Chen wrote: > We are currently experimenting with SolrCloud functionality in Solr 4.0. > The goal is to see if Solr 4.0 trunk with is current state is able to > handle roughly 200million documents. The document size is not big around 40 > fields no more than

Re: url-encoded strings from Admin Query String search

2012-03-22 Thread Aaron Couch
hi Chris and Hoss: Thanks for the feedback. This is useful to hear. This seems like a bug to me but not a very important one. I'm new to Solr and seems like you have a great community here. -Aaron On Thu, Mar 22, 2012 at 1:34 PM, Chris Hostetter wrote: > > : The admin screen is made for doing

Re: url-encoded strings from Admin Query String search

2012-03-22 Thread Chris Hostetter
: The admin screen is made for doing a quick query against the default field : with the settings defined in the default search handler. To that end, it : assumes that all entered characters should be part of the search string, so it : encodes them accordingly. correct ... that text box in questi

org.apache.solr.common.SolrException: parsing error

2012-03-22 Thread qingwei201314
I post a *.doc file to the solr server, but I always get the error: org.apache.solr.common.SolrException: parsing error at org.apache.solr.client.solrj.impl.BinaryResponseParser.processResponse(BinaryResponseParser.java:43) at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.

Boosting StandardQuery scores with a "subquery"?

2012-03-22 Thread Sean O'Connor
Hi all, I'm having some trouble wrapping my head around boosting StandardQueries. It looks like the function: query(subquery, default) is what I want, but the examples seem to focus on just returning a score (e.g. product of popularity and

org.apache.solr.common.SolrException: Internal Server Error

2012-03-22 Thread qingwei201314
My solr server is running, and following is my client code: File file = new File("1.pdf"); String urlString = constant.getUrl(); StreamingUpdateSolrServer solr = new StreamingUpdateSolrServer(

Re: problems with search in solr

2012-03-22 Thread Tomás Fernández Löbbe
Or if you still want to have stemming, you could use a Spanish stemmer, like: or Tomás On Thu, Mar 22, 2012 at 11:09 AM, Juan Pablo Mora wrote: > Remove the stemmer filter. "Caso" and "casa" are transformed into "cas" if > you use the stemmer filter. > > En español: > Quita el filtro de stem

Re: copyField question

2012-03-22 Thread ramdev.wudali
Hi Tomas: These fields are for searching only. Currently we have around 1.8M docs indexed.and Assuming each Doc has about 20 of these additional fields to be created as dynamic fields (worst case scenario), and also there are about 6K if these different values (I.e. If we were to create static fi

Re: Disseminate results from different sources

2012-03-22 Thread fbrisbart
Thanks for the replies, it fixed my mind, and I now have something to implement :o) I will try to do that with 2 requests: - 1 grouped by source to retrieve the documents to boost - 1 with a FunctionQuery to add the boosts computed during the first request It won't be easy to do that with 1 reque

RE: problems with search in solr

2012-03-22 Thread Juan Pablo Mora
Remove the stemmer filter. "Caso" and "casa" are transformed into "cas" if you use the stemmer filter. En español: Quita el filtro de stemmer, que se usa para sacar la raiz de las palabras, pero en tu caso la raíz de "casa" y "caso" es la misma, "cas". Un saludo. __

Re: problems with search in solr

2012-03-22 Thread Rafał Kuć
Hello! The probable cause is the use of solr.PorterStemFilterFactory. You can check it using the Solr admin or by removing that filter and reindexing your data. -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch > Good morning: > I have problems with the result

problems with search in solr

2012-03-22 Thread PINA CORONADO, RAFAEL
Good morning: I have problems with the results obtained Solr search string (eg caso). Me back records with similar terms (in this example would return the same as if looking casa). The 1.4.1 version of Solr is The definition of type text in the file schema.xml is:

Re: whethere solr 3.3 index file is compatable with solr 4.0

2012-03-22 Thread Jan Høydahl
That's correct. Solr4 will read your existing index and let you use it with the feature set it already has. But in order for you to use new fieldTypes, you need to re-index your data. -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com

Re: Can solr index folder can be moved from one system to another?

2012-03-22 Thread olivier sallou
The index is not directory related, there is no path information in the index. You can create an index then move it anywhere (or merge it with an other one). I often do this, there is no issue. Olivier 2012/3/22 ravicv > Hi Tomás, > > I can not use Solr replcation in my scenario. My requiremen

Re: copyField question

2012-03-22 Thread Tomás Fernández Löbbe
I meant, how many values in total? A single document may have 20, but are those 20 shared with other document (even if they have different score) or each document will have 10-20 completely different values? I think Solr could handle a couple hundred of fields, but I don't know how it would behave

Re: Can solr index folder can be moved from one system to another?

2012-03-22 Thread Tomás Fernández Löbbe
When you say "and send to dotnet system through webservice", you mean that the client will be dotnet, but Solr is still going to be Solr, in Java, right? I'm sure that if you stop Solr, change the "index" directory (like if you unzip the one you brought from the other server) and start Solr again,

Re: Solr / Tika crashing when attempting to index large files

2012-03-22 Thread Erick Erickson
Why stop at 1G? But no, it's really all-or-nothing when you blast a file at Solr. But be sure you're bumping the _solr_ heap, not just Tomcat's heap. Best Erick On Wed, Mar 21, 2012 at 5:42 PM, vybe3142 wrote: > While waiting for someohe to help answer my multicore config issue :),... I > decide

Re: Bulk insertion in solr

2012-03-22 Thread Erick Erickson
Here's a sample of indexing with SolrJ instead of DIH, you could consider partitioning your problem to N copies of this and running in parallel. http://www.lucidimagination.com/blog/2012/02/14/indexing-with-solrj/ But you haven't indicated whether your speed issue is on the query side or the index

Re: Rescoring queries

2012-03-22 Thread Erick Erickson
You probably want to provide a custom Similarity class. Here's a start: http://wiki.apache.org/solr/SolrPlugins#Similarity That's just a brief hint, but it should get you started. From there you'll have to dig into the docs. Do take some care here. I believe this is called for _every_ doc that m

Re: Best way to index huge data quickly in solr multi core configuration

2012-03-22 Thread Erick Erickson
First question: What's taking the time? The data acquisition or the actual indexing process? Until you answer that question, you don't know where to spend your efforts Best Erick On Wed, Mar 21, 2012 at 4:10 AM, ravicv wrote: > Hi > > I am using Oracle Exadata as my DB. I want to index nearl

Re: possible spellcheck bug in 3.5 causing erroneous suggestions

2012-03-22 Thread tom
same On 22.03.2012 10:00, Markus Jelsma wrote: Can you try spellcheck.q ? On Thu, 22 Mar 2012 09:57:19 +0100, tom wrote: hi folks, i think i found a bug in the spellchecker but am not quite sure: this is the query i send to solr: http://lh:8983/solr/CompleteIndex/select? &rows=0 &echoParam

Re: url-encoded strings from Admin Query String search

2012-03-22 Thread fbrisbart
Maybe you don't use special characters such as '?', '&', ... in your query, but other guys do. If someone want to search for '? & the mysterians', it's impossible if you don't encode it. As the 'admin' interface must be used by anyone, the query has to be url-encoded. Franck Le mercredi 21 mars

Re: possible spellcheck bug in 3.5 causing erroneous suggestions

2012-03-22 Thread Markus Jelsma
Can you try spellcheck.q ? On Thu, 22 Mar 2012 09:57:19 +0100, tom wrote: hi folks, i think i found a bug in the spellchecker but am not quite sure: this is the query i send to solr: http://lh:8983/solr/CompleteIndex/select? &rows=0 &echoParams=all &spellcheck=true &spellcheck.onlyMorePopula

possible spellcheck bug in 3.5 causing erroneous suggestions

2012-03-22 Thread tom
hi folks, i think i found a bug in the spellchecker but am not quite sure: this is the query i send to solr: http://lh:8983/solr/CompleteIndex/select? &rows=0 &echoParams=all &spellcheck=true &spellcheck.onlyMorePopular=true &spellcheck.extendedResults=no &q=a+bb+ccc++ and this is the resul