Re: solr replication failing with error: Master at: is not available. Index fetch failed

2012-04-23 Thread Erick Erickson
Hmmm, does your master have an index? In other words have you added anything to it? I actually doubt that's an issue, but An aside, a polling interval of 20 seconds is rather short, beware of your autowarming time exceeding your index updates But my _first_ guess is that somehow you're So

Re: java 1.6 requirement not documented clearly?

2012-04-23 Thread jmlucjav
oh, then it should work with 1.5?? OK i know what happened then. I did not see it happening myself, but he unzipped 3.6, started solr with the example config and got the error. He had java1.5, so I told him to upgrade and it worked, so I assumed Solr required 1.6 But this was in a linux box, so mo

Re: Deciding whether to stem at query time

2012-04-23 Thread Walter Underwood
Right. Stemming is less useful for author fields, you don't need to match "bill gate" or "steve job". Also, if you want to do fuzzy matching, you should only do that on the exact fields, not the stemmed fields. wunder On Apr 23, 2012, at 3:45 PM, Michael Sokolov wrote: > Yes, and you might ch

Re: Deciding whether to stem at query time

2012-04-23 Thread Michael Sokolov
Yes, and you might choose to use different options for different fields. For dictionary searches, where users are searching for specific words, and a high degree of precision is called for, stemming is less helpful, but for full text searches, more so. -Mike On 4/23/2012 3:35 PM, Walter Unde

Re: java 1.6 requirement not documented clearly?

2012-04-23 Thread Chris Hostetter
: Both wiki http://wiki.apache.org/solr/SolrInstall and tutorial : http://lucene.apache.org/solr/api/doc-files/tutorial.html state java 1.5 is : required, but trying to run solr3.6 with java 1.5 was giving some cryptic : error to a colleague. You'll have to be more specific about what you (or your

java 1.6 requirement not documented clearly?

2012-04-23 Thread jmlucjav
Both wiki http://wiki.apache.org/solr/SolrInstall and tutorial http://lucene.apache.org/solr/api/doc-files/tutorial.html state java 1.5 is required, but trying to run solr3.6 with java 1.5 was giving some cryptic error to a colleague. xab -- View this message in context: http://lucene.472066.n3.

Re: Language Identification

2012-04-23 Thread Jan Høydahl
I think nothing has "moved". We just offer Solr users to do language detection inside of Solr, using any of these two libs. If you choose to do language detection on client side instead, using any of these, what is stopping you? -- Jan Høydahl, search solution architect Cominvent AS - www.cominv

Re: FastVectorHighlighter -> no highlights

2012-04-23 Thread Jeffrey Schmidt
This does not appear to be shingle specific. A non-shingled field is also NOT highlighted in the same manner with FVH. I can see in the timing information that it takes much longer to run FVH than no highlighting at all, so Solr must be doing something. But why it just lists the document IDs

Re: null pointer error with solr deduplication

2012-04-23 Thread Peter Markey
Thanks for the response. Yes, I agree with you that I have to check for the uniqueness of doc ids but our requirement is such that we need to send it to solr and I know that solr discards duplicate documents and it does not work fine when we manually create the unique id. But I just wanted to repor

RE: Apache Tomcat 6 service terminated unexpectedly. It has done this 2 time(s).

2012-04-23 Thread Husain, Yavar
I am sorry, i should have raised this issue on tomcat forums. However just was trying my luck here as it was indirectly related to solr. From: Husain, Yavar Sent: Monday, April 23, 2012 11:07 PM To: solr-user@lucene.apache.org Subject: Apache Tomcat 6 servi

Re: Language Identification

2012-04-23 Thread Robert Muir
On Mon, Apr 23, 2012 at 1:27 PM, Bai Shen wrote: > I was under the impression that solr does Tika and the language identifier > that Shuyo did.  The page at > http://wiki.apache.org/solr/LanguageDetectionlists them both. > > class="org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProc

Re: Spatial4j

2012-04-23 Thread Eric Grobler
Thank you David, it is fantastic what people like you do for the Solr community. On Mon, Apr 23, 2012 at 8:08 PM, David Smiley (@MITRE.org) < dsmi...@mitre.org> wrote: > Yes, I definitely think so. At a minimum, I expect there will at least be > a > patch or built jar file for you to get going

Re: Deciding whether to stem at query time

2012-04-23 Thread Walter Underwood
There is a third approach. Create two fields and always query both of them, with the exact field given a higher weight. This works great and performs well. It is what we did at Netflix and what I'm doing at Chegg. wunder On Apr 23, 2012, at 12:21 PM, Andrew Wagner wrote: > So I just realized t

Re: Spatial4j

2012-04-23 Thread David Smiley (@MITRE.org)
Yes, I definitely think so. At a minimum, I expect there will at least be a patch or built jar file for you to get going by 1 June. - Author: http://www.packtpub.com/apache-solr-3-enterprise-search-server/book -- View this message in context: http://lucene.472066.n3.nabble.com/Spatial4j-tp3

Re: Solr Core Admin Question on Trunk

2012-04-23 Thread Jamie Johnson
No problem, created this. https://issues.apache.org/jira/browse/SOLR-3401 and related to 2605. On Mon, Apr 23, 2012 at 11:39 AM, Stefan Matheis wrote: > Jamie, right .. that makes sense. right now the core-admin will not work in > singlecore-mode because we have no core-name there. > https://i

Re: # open files with SolrCloud

2012-04-23 Thread Gopal Patwa
Great! I am going to try new Solr 4 build from April 23rd On Sun, Apr 22, 2012 at 11:35 PM, Sami Siren wrote: > On Sat, Apr 21, 2012 at 9:57 PM, Yonik Seeley > wrote: > > I can reproduce some kind of searcher leak issue here, even w/o > > SolrCloud, and I've opened > > https://issues.apache.org

Apache Tomcat 6 service terminated unexpectedly. It has done this 2 time(s).

2012-04-23 Thread Husain, Yavar
Solr 3.5 was not returning results. To my surprise Tomcat 6.x (64 bit) was not running on my Windows. There were absolutely no errors in the logs, no crash dumps nothing. I restarted it and everything seems to be fine now. Went to the Windows Event viewer and exported the following information

Re: Language Identification

2012-04-23 Thread Bai Shen
I was under the impression that solr does Tika and the language identifier that Shuyo did. The page at http://wiki.apache.org/solr/LanguageDetectionlists them both. Again, I'm just trying to understand why it was moved to solr. On Fri, Apr 20, 2012 at 6:02 PM, Jan Høydahl wrote: > Hi, > >

Kernel methods in SOLR

2012-04-23 Thread Peyman Faratin
Hi Has there been any work that tries to integrate Kernel methods [1] with SOLR? I am interested in using kernel methods to solve synonym, hyponym and polysemous (disambiguation) problems which SOLR's Vector space model ("bag of words") does not capture. For example, imagine we have only 3 wo

Re: Spatial4j

2012-04-23 Thread Eric Grobler
Hi David, Thank you for the information. I am glad to hear that is basically ready to be integrated into lucene. Regarding your backlog, is it realistic to expect 3304 resolved before June? Best Regards Ericz On Mon, Apr 23, 2012 at 4:38 PM, Smiley, David W. wrote: > Ericz, > > See this issue

solr replication failing with error: Master at: is not available. Index fetch failed

2012-04-23 Thread geeky2
hello all, enviornment: centOS and solr 3.5 i am attempting to set up replication betweeen two solr boxes (master and slave). i am getting the following in the logs on the slave box. 2012-04-23 10:54:59,985 SEVERE [org.apache.solr.handler.SnapPuller] (pool-12-thread-1) Master at: http://someip:

Re: Solr Core Admin Question on Trunk

2012-04-23 Thread Stefan Matheis
Jamie, right .. that makes sense. right now the core-admin will not work in singlecore-mode because we have no core-name there. https://issues.apache.org/jira/browse/SOLR-2605 should fix this, afterwards we can show the core-admin for every configuration. would you mind to open a ticket for tha

Re: Spatial4j

2012-04-23 Thread Smiley, David W.
Ericz, See this issue: https://issues.apache.org/jira/browse/SOLR-3304 It's just a TODO issue right now but when it's completed, you'll be able to do polygon spatial queries. All the software is written to do it right now but the missing Solr piece is temporarily at Spatial4j.com. If you were

Re: Solr Core Admin Question on Trunk

2012-04-23 Thread Jamie Johnson
So I believe I see the reason now. Basically in app.js we check to see if there is more than 1 core deployed to decide if we show the core admin or not. I am not sure this is intended or not, but I would think this isn't what we want the default action to be. Shouldn't we always show the core ad

Spatial4j

2012-04-23 Thread Eric Grobler
Hello Solr Community, We are interested in polygon spatial queries. I believe that Spatial4j supports it. Is there a solr branch available that includes Spatial4j? Will this be part of a furure solr release? Thank you. Best Regards Ericz

Re: The index speed in the solr

2012-04-23 Thread Smiley, David W.
On Apr 23, 2012, at 9:27 AM, Erick Erickson wrote: > 50 hours is a really long time for 2M docs though, so something > doesn't seem right unless the docs are really unusual. Don't forget he's n-gramming ;-) There's not much more demanding you could ask of text analysis except for throwing shin

RE: Performance problem with DIH in solr 3.3

2012-04-23 Thread Dyer, James
See this page for an alternate way to use DIH for Delta updates that does not generate n+1 Selects: http://wiki.apache.org/solr/DataImportHandlerDeltaQueryViaFullImport James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: Pravin Agrawal [mailto:pra

Re: Full Import failed:org.apache.solr.handler.dataimport.DataImportHandlerException: com.mysql.jdbc.CommunicationsException: Communications link failure due to underlying exception

2012-04-23 Thread Jeevanandam
On 23-04-2012 8:18 pm, sivaprasad wrote: Hi, When i am trying to index 16 millions of documents using dataimport handler, intermittently i am getting the below exception and the indexing get stopped. STACKTRACE: java.io.EOFException: Can not read response from server. Expected to read 4 by

Re: Using two repeater to rapidly switching Master and Slave (Replication)?

2012-04-23 Thread Jeevanandam
On 23-04-2012 10:28 am, A Vorderegger wrote: This setup would be highly convenient and perfect for the purpose of failing over the Master role however it does not work for me. Resolving http://slave_host:port/solr/replication?command=enablepoll I am met with: ERRORNo slave configured no matter

Full Import failed:org.apache.solr.handler.dataimport.DataImportHandlerException: com.mysql.jdbc.CommunicationsException: Communications link failure due to underlying exception

2012-04-23 Thread sivaprasad
Hi, When i am trying to index 16 millions of documents using dataimport handler, intermittently i am getting the below exception and the indexing get stopped. STACKTRACE: java.io.EOFException: Can not read response from server. Expected to read 4 bytes, read 0 bytes before connection was unexpec

Re: The index speed in the solr

2012-04-23 Thread Erick Erickson
Hard to say. Here's the basic approach I'd use to try to narrow it down: 1> take out ngrams. What does that do to your speed? 2> are you committing very often? Lengthen the time here if so. 3> Posting is probably not the more performant thing in world. Consider using SolrJ. 4> What does a docu

RE: StandardTokenizer and domain names containing digits

2012-04-23 Thread Steven A Rowe
Hi Alex, Thanks for reporting back with concrete details of what worked for you - very helpful for others with similar projects. Steve -Original Message- From: Alex Willmer [mailto:al.will...@logica.com] Sent: Monday, April 23, 2012 5:35 AM To: solr-user@lucene.apache.org Subject: Re:

Re: Solr Hanging

2012-04-23 Thread Mark Miller
And see https://issues.apache.org/jira/browse/SOLR-683 as it also may be related or have helpful info... On Apr 23, 2012, at 8:17 AM, Mark Miller wrote: > Perhaps related is > http://www.lucidimagination.com/search/document/6d0e168c82c86a38#45c945b2de6543f4 > > On Apr 23, 2012, at 5:37 AM, Try

Re: Solr Hanging

2012-04-23 Thread Mark Miller
Perhaps related is http://www.lucidimagination.com/search/document/6d0e168c82c86a38#45c945b2de6543f4 On Apr 23, 2012, at 5:37 AM, Trym R. Møller wrote: > Hi > > I have succeeded in reproducing the scenario with two Solr instances running. > They cover a single collection with two slices and tw

Synonyms file in solr

2012-04-23 Thread ggggGuys
I have some problems with the synonyms file, it seems i can't make it work the way i'd want. Here is an exemple : I have these words : cat, animal, dog, living thing, baby shark if i search for animal OR animals, i'd like to have the results for cat, animal, dog, baby shark as well as their plu

Re: null pointer error with solr deduplication

2012-04-23 Thread Mark Miller
A better error would be nicer. In the past, when I have had docs with the same id on multiple shards, I never saw an NPE problem. A lot has changed since then though. I guess, to me, checking if the id is stored sticks out a bit more. Roughly based on the stacktrace, it looks to me like it's not f

Performance problem with DIH in solr 3.3

2012-04-23 Thread Pravin Agrawal
Hi All, I am using Delta import handler(solr 3.3) to index data from my database (using 19 tables) Total Number of solr documents that get created from these 19 table is 444 Total number of request send to data source during clean full import is 91083. My problem is that, DIH makes too many

Facing problem to integrate UIMA in SOLR

2012-04-23 Thread dsy99
Hello all, I am facing problem to integrate the UIMA in SOLR. I followed the following steps, provided in README file shipped along with Uima to integrate it in Solr Step1. I set tags in solrconfig.xml appropriately to point the jar files. Step2. modified my "schema.xml" adding th

Re: Exception fixing docBase for context [error in opening zip file]

2012-04-23 Thread Yung-chung Lin
Hi, I have figured out this on my own. It was just a stupid permission thing. This error Exception fixing docBase for context java.util.zip.ZipException: error in opening zip file can be fixed by changing the permission of parent paths to 0755. find PARENT_PATH -type d -exec chmod 0755 {} \;

Re: Solr Hanging

2012-04-23 Thread Trym R. Møller
Hi I have succeeded in reproducing the scenario with two Solr instances running. They cover a single collection with two slices and two replica, two cores in each Solr instance. I have changed the number of threads that Jetty is allowed to use as follows: 3 3 0 And when indexing a single do

Re: StandardTokenizer and domain names containing digits

2012-04-23 Thread Alex Willmer
Steven A Rowe syr.edu> writes: > StandardTokenizer in Lucene/Solr v3.1+ implements the Word Boundary rules > from Unicode 6.0.0 Standard > Annex #29, a.k.a. UAX#29: . > These rules don't include recognition of URLs or domain nam

Re: 'Error 404: missing core name in path' in Solr

2012-04-23 Thread Jan Høydahl
Hi, Perhaps your search server uses a multi core setup? In that case you need your core name as part of the URL http://wiki.apache.org/solr/CoreAdmin#Example -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com On 23. apr. 2012, at 01:5

Re: 'Error 404: missing core name in path' in Solr

2012-04-23 Thread Dan Tuffery
Looks like you need to select a core name on the admin UI before select search. Have a look in the solr.xml file in your solr home directory, what cores are defined? Solr is expecting the core name in the URL: http://localhost:8080/solr//admin/ On Mon, Apr 23