distributed search is significantly slower than direct search

2013-10-28 Thread Elran Dvir
Hi all, I am using Solr 4.4 with multi cores. One core (called template) is my "routing" core. When I run http://127.0.0.1:8983/solr/template/select?rows=5000&q=*:*&shards=127.0.0.1:8983/solr/core1, it consistently takes about 7s. When I run http://127.0.0.1:8983/solr/core1/select?rows=5000&q=

Re: Apache-Solr with Tomcat: displaying the format of search result

2013-10-28 Thread Mamta Alshi
HI Ramesh, Have you documented how you have integrated Solr with Tomcat..? I have to do the same..was trying sometime ago and then left it...will be helpful if you have steps on how to integrate the two. Thanks, Mamta On Mon, Oct 28, 2013 at 2:40 PM, pyramesh wrote: > Hi All, > > Recently I

Re: Apache-Solr with Tomcat: displaying the format of search result

2013-10-28 Thread pyramesh
Thanks Shawn for quick response... As suggested, I verified my configuration to check whether the update processors configured or not and found no processors configured. I am just wonder how the format getting changed. Let explain my problem in details I am indexing the .xml file to solr.

Re: Solr 4.5.1 replication Bug? "Illegal to have multiple roots (start tag in epilog?)."

2013-10-28 Thread Sai Gadde
Hi Michael, I downgraded to Solr 4.4.0 and this issue is gone. No additional settings or tweaks are done. This is not a fix or solution I guess but, in our case we wanted something working and we were running out of time. I will watch this thread if there are any suggestions but, possibly we wil

Re: how to avoid recover? how to ensure a recover success?

2013-10-28 Thread deniz
I have had a similar problem before but the patch which was included with the version 4.1 fixed that... I couldnt reproduce the problem with the patch... anyone is able to reproduce this exception? - Zeki ama calismiyor... Calissa yapar... -- View this message in context: http://lucene.47

Re: Solr 4.5.1 Overseer error

2013-10-28 Thread Shawn Heisey
On 10/28/2013 5:50 PM, dboychuck wrote: I am upgrading from 4.4 to 4.5.1 I used to just upload my configurations to zookeeper and then install solr with no default core Solr would give me an error that no cores were created when I tried to access until I ran the collections API create command

Re: Single multilingual field analyzed based on other field values

2013-10-28 Thread Trey Grainger
Hi David, What version of the Solr in Action MEAP are you looking at (current version is 12, and version 13 is coming out later this week, and prior versions had significant bugs in the code you are referencing)? I added an update processor in the most recent version that can do language identifi

Solr 4.5.1 Overseer error

2013-10-28 Thread dboychuck
I am upgrading from 4.4 to 4.5.1 I used to just upload my configurations to zookeeper and then install solr with no default core Solr would give me an error that no cores were created when I tried to access until I ran the collections API create command to make a collection however now when I tr

Re: Background merge errors with Solr 4.4.0 on Optimize call

2013-10-28 Thread Matthew Shapiro
Sorry for reposting after I just sent in a reply, but I just looked at the error trace closer and noticed 1. Caused by: java.lang.IllegalArgumentException: no such field what The 'what' field was removed by request of the customer as they wanted the logic behind what gets queried in the "wha

Re: Background merge errors with Solr 4.4.0 on Optimize call

2013-10-28 Thread Matthew Shapiro
Thanks for your response. You were right, solr is logging to the catalina.out file for tomcat. When I click the optimize button in solr's admin interface the following logs are written: http://apaste.info/laup About JVM memory, solr's admin interface is listing JVM memory at 3.1% (221.7MB is dar

Re: Global User defined properties - solr.xml from Solr 4.4 to Solr 4.5

2013-10-28 Thread marotosg
Done https://issues.apache.org/jira/browse/SOLR-5398 -- View this message in context: http://lucene.472066.n3.nabble.com/Global-User-defined-properties-solr-xml-from-Solr-4-4-to-Solr-4-5-tp4097740p4098143.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Index JTS Point in Solr/Lucene index

2013-10-28 Thread David Smiley (@MITRE.org)
Just follow-ing up with this thread after a round of emails between Shahbaz and I… David Smiley wrote > Ooooh, I see your confusion. You looked at code in an > UpdateRequestProcessor and expected it to work on the client in SolrJ. It > won't work for the reason that the code in the URP is creat

Re: Single multilingual field analyzed based on other field values

2013-10-28 Thread Jack Krupansky
Consider an update processor - it can operate on any field and has access to all fields. You could have one update processor to combine all the fields to process, into a temporary, dummy field. Then run a language detection update processor on the combined field. Then process the results and p

Single multilingual field analyzed based on other field values

2013-10-28 Thread David Anthony Troiano
Hello, First some background... I am indexing a multilingual document set where documents themselves can contain multiple languages. The language(s) within my documents are known ahead of time. I have tried separate fields per language, and due to the poor query performance I'm seeing with that

Re: Compound words

2013-10-28 Thread Parvesh Garg
Hi Roman, thanks for the link, will go through it. Erick, will try with expand=true once and check out the results. Will update this thread with the findings. I remember we rejected expand=true because of some weird spaghetti problem. Will check it out again. Thanks, Parvesh Garg http://www.zett

Re: Solr 4.5.1 replication Bug? "Illegal to have multiple roots (start tag in epilog?)."

2013-10-28 Thread Michael Tracey
Hey, this is Michael, who was having the exact error on the Jetty side with an update. I've upgraded jetty from the 4.5.1 embedded version (in the example directory) to version 9.0.6, which means I had to upgrade my OpenJDK from 1.6 to 1.7.0_45. Also, I added the suggested (very large) setting

Solr block join

2013-10-28 Thread Simon
Hi, The block join feature introduced in Solr 4.5 is really helpful in solving some of the issues in my project. I am able to get it working in simple cases. However, I couldn't figure out how to use it in some more complex cases and I could find very little reference about it. 1) how to return bo

Re: When is/should qf different from pf?

2013-10-28 Thread Upayavira
There'd be no point having them the same. You're likely to include boosts in your pf, so that docs that match the phrase query as well as the term query score higher than those that just match the term query. Such as: qf=text description&pf=text^2 description^4 Upayavira On Mon, Oct 28, 2013

Re: When is/should qf different from pf?

2013-10-28 Thread Amit Nithian
Thanks Erick. Numeric fields make sense as I guess would strictly string fields too since its one term? In the normal text searching case though does it make sense to have qf and pf differ? Thanks Amit On Oct 28, 2013 3:36 AM, "Erick Erickson" wrote: > The facetious answer is "when phrases aren

Re: Need idea to standardize keywords - ring tone vs ringtone

2013-10-28 Thread Jonathan Rochkind
Do you know about the Solr synonym feature? That seems more applicable to what you're describing then stopwords. I'd stay away from stopwords entirely here, and try to do what you want with synonyms. Multi-word synonyms can be tricky, I'm not entirely sure the right way to do it for this use

Re: Apache-Solr with Tomcat: displaying the format of search result

2013-10-28 Thread Shawn Heisey
On 10/28/2013 4:40 AM, pyramesh wrote: > But this is not I want.. I want to display data as same as input format. > > can anyone please help on this What Solr outputs in its fields for search results is identical to what it receives when data is indexed, unless you have update processors configur

Replace document title with filename if it's empty

2013-10-28 Thread Bayu Widyasanyata
Hi, I just found that some of PDFs files crawled has no (empty) 'title' metadata. How to define or fetch the filename, and use it (filename) replacing empty 'title' field? I didn't found "filename" field on schema.xml, and don't know how to make conditional for above conditions (if title is empty

Re: Need idea to standardize keywords - ring tone vs ringtone

2013-10-28 Thread Developer
Thanks for your response Eric. Sorry for the confusion. I currently display both 'ring tone' as well as 'ringtone' when the user types in 'r' but I am trying to figure out a way to display just 'ringtone' hence I added 'ring tone' to stopwords list so that it doesn't get indexed. I have the list

Re: Solr - what's the next big thing?

2013-10-28 Thread Otis Gospodnetic
Hi, On Sun, Oct 27, 2013 at 2:57 PM, Saar Carmi wrote: > If I get it right, Solr can store its data files on HDFS but it will not Correct. And can be used to build indices in parallel, using MapReduce, from data living on HDFS. > use map reduce to process the data (e.g. evaluating queries). Ri

Re: Compound words

2013-10-28 Thread Roman Chyla
Hi Parvesh, I think you should check the following jira https://issues.apache.org/jira/browse/SOLR-5379. You will find there links to other possible solutions/problems:-) Roman On 28 Oct 2013 09:06, "Erick Erickson" wrote: > Consider setting expand=true at index time. That > puts all the tokens i

Re: Proposal for new feature, cold replicas, brainstorming

2013-10-28 Thread Toke Eskildsen
On Sat, 2013-10-26 at 02:14 +0200, Chris Hostetter wrote: > I suspect that the most straight forward way to achieve what you are > folks seem to be describing would be to add a hook into the request > distribution processing so that you could have a custom plugin used when > solr does "Replica r

Re: return value from SolrJ client to php

2013-10-28 Thread Anshum Gupta
Hi Amit, I haven't personally tried it, but have a look at the options listed here: http://wiki.apache.org/solr/IntegratingSolr Also, just check if the library you try is known to work with the version of Solr you'd want to use. Otherwise, how about just using a serialization library for apps in

Re: Field Value depending on another field value

2013-10-28 Thread Anshum Gupta
Hi Ben, You can actually look at indexing single valued documents i.e. a different one for every store and then group by on the product id. Have a look at this presentation by Adrian Trenaman at the Lucene Revolution earlier this year: Presentation: http://www.slideshare.net/trenaman/personalized

return value from SolrJ client to php

2013-10-28 Thread Amit Aggarwal
Hello All, I have a requirement where I have to conect to Solr using SolrJ client and documents return by solr to SolrJ client have to returned to PHP. I know its simple to get document from Solr to SolrJ But how do I return documents from SolrJ to PHP ? Thanks Amit Aggarwal

Re: One of all shard stopping, all shards stop

2013-10-28 Thread Erick Erickson
I think if you set shards.tolerant=true you get information in the return packet if a shard is completely down. The other thing you can do is query the ZooKeeper cluster state directly. But I have to ask why you're not using a replica or two per shard. That should provide automatic fail-over etc

Re: Compound words

2013-10-28 Thread Erick Erickson
Consider setting expand=true at index time. That puts all the tokens in your index, and then you may not need to have any synonym processing at query time since all the variants will already be in the index. As it is, you've replaced the words in the original with synonyms, essentially collapsed t

Re: Data import handler with multi tables

2013-10-28 Thread dtphat
Hi, is there no another way to import all data for this case instead Only the way using compound key? Thanks. - Phat T. Dong -- View this message in context: http://lucene.472066.n3.nabble.com/Re-Data-import-handler-with-multi-tables-tp4098048p4098056.html Sent from the Solr - User mailing

Re: One of all shard stopping, all shards stop

2013-10-28 Thread hongkeun.yoo
Thanks for your reply. If one of server have stop and error, this option(distrib=false) is good work. Similarly option is "shards.tolerant=true". but I don't want to using this option. because the died server isn't show error message. only return not nothing data. I want to show error message at d

Re: Compound words

2013-10-28 Thread Parvesh Garg
Hi Erick, Thanks for the suggestion. Like I said, I'm an infant. We tried synonyms both ways. sea biscuit => seabiscuit and seabiscuit => sea biscuit and didn't understand exactly how it worked. But I just checked the analysis tool, and it seems to work perfectly fine at index time. Now, I can ha

Re: Data import handler with multi tables

2013-10-28 Thread Stefan Matheis
> I think because is unique. When importing tbl_tableA import first, > tbl_tableB import after. tbl_tableB has id which the same id in tableA, so > only data of tableB had indexed with unique id. > > That's exactly what happens here :) If the second table would have fewer records than the firs

Field Value depending on another field value

2013-10-28 Thread bengates
Hello, I'm pretty new to Solr, and I have a question about best practice. I want to handle a Solr collection with products that are available in different shops. For several reasons, the price of a product may be the same or vary, depending the shop's location. What I don't know how to handle co

Solr 4.5.1 replication Bug? "Illegal to have multiple roots (start tag in epilog?)."

2013-10-28 Thread Sai Gadde
we have a similar error as this thread. http://www.mail-archive.com/solr-user@lucene.apache.org/msg90748.html Tried tomcat setting from this post. We used exact setting sepecified here. we merge 500 documents at a time. I am creating a new thread because Michael is using Jetty where as we use Tom

Re: Solr Update URI is not found

2013-10-28 Thread Bayu Widyasanyata
Hi Erick and All, The problem is solved by copying schema-solr4.xml into my collection's Solr conf (renamed to schema.xml). I didn't use hadoop there, and apologize if it's better to post on this Solr list since the problem appeared first on Solr Indexer step. Regarding "/2" option it's "e-mail b

Re: Optimal interval for soft commit

2013-10-28 Thread Erick Erickson
To reply to your original question, when you soft commit the top-level caches are thrown away. I.e. the filterCache, documentResultCache, all the ones in solrconfig.xml. And if you have a high autowarm count on them, you wind up doing a lot of work for no gain. Say your soft commit interval is 1 s

Re: Compound words

2013-10-28 Thread Erick Erickson
Why did you reject using synonyms? You can have multi-word synonyms just fine at index time, and at query time, since the multiple words are already substituted in the index you don't need to do the same substitution, just query the raw strings. I freely acknowledge you may have very good reasons

Apache-Solr with Tomcat: displaying the format of search result

2013-10-28 Thread pyramesh
Hi All, Recently I have integrated Apache solr with Tomcat server.everything is working fine. I am displaying the search result using velocity template. But Here is my problem. search results are displaying the correct format as input data format. For Example: input data (whole data contains in

Re: Optimal interval for soft commit

2013-10-28 Thread Mugoma Joseph O.
Hello, > How do you add the documents to the index - one by one, batches of n ? Documents are added one by one using solrj > When do you do your commits ? We have the following settings in solrconfig.xml: 180 false 15000 Thanks. Mug

Re: Newbie to Solr

2013-10-28 Thread michael.boom
I don't see the mentioned attachement. Try using http://snag.gy/ to provide it. As for where do you find it, the default is http://localhost:8983/solr/collection1/query - Thanks, Michael -- View this message in context: http://lucene.472066.n3.nabble.com/Newbie-to-Solr-tp4092876p4098041.ht

Re: Solr Update URI is not found

2013-10-28 Thread Erick Erickson
This seems like a better question for the Nutch list. I see hadoop in there, so unless you've specifically configured solr to use the HDFS directory writer factory, this has to be coming from someplace else. And there are map/reduce tasks in here. BTW, it would be more helpful if you posted the UR

Re: When is/should qf different from pf?

2013-10-28 Thread Erick Erickson
The facetious answer is "when phrases aren't important in the fields". If you're doing a simple boolean match, adding phrase fields will add expense, to no good purpose etc. Phrases on numeric fields seems wrong. FWIW, Erick On Mon, Oct 28, 2013 at 1:03 AM, Amit Nithian wrote: > Hi all, > > I

Re: Newbie to Solr

2013-10-28 Thread Mamta Alshi
how do I get the solr admin web user interface? On Mon, Oct 28, 2013 at 2:32 PM, Mamta Alshi wrote: > Hi Michael, > > Thanks for the prompt response. Have a look at my attached admin user > interfaces. > > I do not quite see the options you mention. > > > On Mon, Oct 28, 2013 at 2:18 PM, michae

Re: Newbie to Solr

2013-10-28 Thread Mamta Alshi
Hi Michael, Thanks for the prompt response. Have a look at my attached admin user interfaces. I do not quite see the options you mention. On Mon, Oct 28, 2013 at 2:18 PM, michael.boom wrote: > Put "*:*" in the q field > Then check the facet check box (look lower close to the Execute button) a

Re: Background merge errors with Solr 4.4.0 on Optimize call

2013-10-28 Thread Erick Erickson
For Tomcat, the Solr is often put into catalina.out as a default, so the output might be there. You can configure Solr to send the logs most anywhere you please, but without some specific setup on your part the log output just goes to the default for the servlet. I took a quick glance at the code

Re: AW: AW: auto completion search with solr using NGrams in SOLR

2013-10-28 Thread anurag.sharma
Hi ... I am trying to build autocomplete functionality using your post. But I am getting the following error *2577 [coreLoadExecutor-3-thread-1] WARN org.apache.solr.spelling.suggest.Suggester – Loading stored lookup data failed java.io.FileNotFoundException: /home/anurag/Downloads/solr-4.4.0/ex

Re: Newbie to Solr

2013-10-28 Thread michael.boom
Put "*:*" in the q field Then check the facet check box (look lower close to the Execute button) and in the facet.field insert "Name". This should do the trick. - Thanks, Michael -- View this message in context: http://lucene.472066.n3.nabble.com/Newbie-to-Solr-tp4092876p4098031.html Sent f

Re: Newbie to Solr

2013-10-28 Thread Mamta Alshi
Hi Alex, I have been able to run a few simple queries with my own schema.xml and data file. My concern now is that i'm able to run queries like http://localhost:8983/solr/select/?q=*:* http://localhost:8983/solr/select/?q=*:*&facet=true&facet.field=Name from the url However, when I try to run

error in suggester component in solr

2013-10-28 Thread anurag.sharma
I am working with solr auto complete functionality,I am using solr 4.50 to build my application, and I am following this link as a reference. http://lucene.472066.n3.nabble.com/auto-completion-search-with-solr-using-NGrams-in-SOLR-td3998559i20.html My suggest component is something like this

Data import handler with multi tables

2013-10-28 Thread dtphat
Hi, I wanna to import many tables from MySQL. Assume that, I have two tables: *** Tables 1: tbl_tableA(id, nameA) with data (1, A1), (2, A2), (3, A3). *** Tables 2: tbl_tableB(id, nameB) with data (1, B1), (2, B2), (3, B3), (4, B4), (5, B5). I configure:    

Re: Solr For

2013-10-28 Thread michael.boom
You're describing two different entities: Job and Employee. Since they are clearly different in any way you will need two different cores with two different schemas. - Thanks, Michael -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-For-tp4097928p4098025.html Sent fr

Re: One of all shard stopping, all shards stop

2013-10-28 Thread michael.boom
When one of your shards dies, your index becomes incomplete. By default the querying is distributed (on all shards - &distrib=true) and if one of them (shard X) is down, then you get an error stating that there are "no servers hosting shard X". If the other shards are still up you can query them d

Re: Optimal interval for soft commit

2013-10-28 Thread michael.boom
How do you add the documents to the index - one by one, batches of n ? When do you do your commits ? Because 8k docs per day is not a lot. Depending on the above, commiting with softCommit=true might also be a solution. - Thanks, Michael -- View this message in context: http://lucene.472066

Re: Solr search in case the first keyword are not index

2013-10-28 Thread dtphat
I have solve it. Thanks. - Phat T. Dong -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-search-in-case-the-first-keyword-are-not-index-tp4097699p4098021.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: SolrCloud: optimizing a core triggers optimizations of all cores in that collection?

2013-10-28 Thread michael.boom
Thanks @Mark & @Erick Should I create a JIRA issue for this ? - Thanks, Michael -- View this message in context: http://lucene.472066.n3.nabble.com/SolrCloud-optimizing-a-core-triggers-optimization-of-another-tp4097499p4098020.html Sent from the Solr - User mailing list archive at Nabble.c

Re: Compound words

2013-10-28 Thread Parvesh Garg
One more thing, Is there a way to remove my "accidentally sent phone number in the signature" from the previous mail? aarrrggghhh

Compound words

2013-10-28 Thread Parvesh Garg
Hi, I'm an infant in Solr/Lucene family, just a couple of months old. We are trying to find a way to combine words into a single compound word at index and query time. E.g. if the document has "sea bird" in it, it should be indexed as seabird and any query having sea bird in it should also look f

Optimal interval for soft commit

2013-10-28 Thread Mugoma Joseph O.
Hello, We have solr index with about 1m docs. Every day we add 5,000 to 8,000 docs. We have defined 15 sec interval for soft commit. But for the impatient user 15 secs looks like eternity. The wiki http://wiki.apache.org/solr/NearRealtimeSearch advises on 1s soft commit interval but warns "Be s

One of all shard stopping, all shards stop

2013-10-28 Thread hongkeun.yoo
Hi. I hava 3 shard solr cloud version 4.4.0 not replication. for example, if one shard(leader) died for OOM, all shard is stop. is it just the way that it is? I want to find a option this problem. I want to change if 1 shard died, rem