Re: Reading files in default Conf dir

2014-09-10 Thread Jorge Luis Betancourt Gonzalez
What are you developing a custom search component? update processor? a different class for one of the zillion moving parts of Solr? If you have access to a SolrCore instance you could use it to get access of, essentially using the SolrCore instance specific to the current core will cause the l

FileNotFoundException, Error closing IndexWriter, Error opening new searcher

2014-09-10 Thread Oliver Schrenk
Hi, in the last few days we had some troubles with one of our clusters (5 machines each running 4.7.2 inside jetty container, no replication, Java 1.7.21). Two time we had troubles to restart one server (same machine) because of some FileNotFoundException. 1. First time: Stopping Solr while

Solr Spellcheck suggestions only return from /select handler when returning search results

2014-09-10 Thread Thomas Michael Engelke
Hi, I'm experimenting with the Spellcheck component and have therefor used the example configuration for spell checking to try things out. My solrconfig.xml looks like this: spell default spell solr.DirectSolrSpellChecker internal wordbreak solr.WordBreakSolrSpellChecker

Re: Integrate solr with openNLP

2014-09-10 Thread Aman Tandon
Hi, What is the progress of integration of nlp with solr. If you have achieved this integration techniques successfully then please share with us. With Regards Aman Tandon On Tue, Jun 10, 2014 at 11:04 AM, Vivekanand Ittigi wrote: > Hi Aman, > > Yeah, We are also thinking the same. Using UIMA

Genre classification/Document classification for apache solr

2014-09-10 Thread vineet yadav
Hi, I want to crawl links and want to identify if link is company website. For example, If I use word 'financial advisory' in google search engine. I will get list of urls in search result. Some of links are company website. I want to identify those links which are company website and index them in

Re: Edismax mm and efficiency

2014-09-10 Thread Peter Keegan
I implemented a custom QueryComponent that issues the edismax query with mm=100%, and if no results are found, it reissues the query with mm=1. This doubled our query throughput (compared to mm=1 always), as we do some expensive RankQuery processing. For your very long student queries, mm=100% woul

Re: Integrate solr with openNLP

2014-09-10 Thread Vivekanand Ittigi
Actually we dropped integrating nlp with solr but we took two different ideas: * we're using nlp seperately not with solr * we're taking help of UIMA for solr. Its more advanced. If you've a specific question. you can ask me. I'll tell you if i know. -Vivek On Wed, Sep 10, 2014 at 3:46 PM, Aman

Installing solr on tomcat 7.x | Window 8

2014-09-10 Thread Umesh Awasthi
I am trying to follow official document as well other resource available on the net but unable to run solr on my tomcat. I am trying to install and run `solr-4.10.0` on tomcat. this is what I have done so far 1. Copy solr-4.10.0.war to tomcat web-app and renamed it to solr.war. 2. Created a fol

Problem while extending TokenizerFactory in Solr 4.4.0

2014-09-10 Thread Francesco Valentini
Hi All, I’m using Solr 4.4.0 distro and now, I have a strange issue while extending TokenizerFactory with a custom class. This is an excerpt of pom I use: 4.4.0 org.apache.lucene *lucene*-core ${solr.version} org.apache.lucene *lucene*-anal

RE: Solr Spellcheck suggestions only return from /select handler when returning search results

2014-09-10 Thread Dyer, James
Thomas, It looks like you've set things up correctly in that while the user is searching against a stemmed field ("name"), spellcheck is checking against a lightly-analyzed copy of it ("spell"). This is the right way to do it as spellcheck against stemmed forms is usually undesirable. But as

Re: Problem while extending TokenizerFactory in Solr 4.4.0

2014-09-10 Thread Shawn Heisey
On 9/10/2014 7:14 AM, Francesco Valentini wrote: > I’m using Solr 4.4.0 distro and now, I have a strange issue while > extending TokenizerFactory with a custom class. I think what we have here is a basic Java error, nothing specific to Solr. This jumps out at me: Caused by: java.lang.NoSuchMet

Re: Installing solr on tomcat 7.x | Window 8

2014-09-10 Thread Shawn Heisey
On 9/10/2014 6:45 AM, Umesh Awasthi wrote: > I am trying to follow official document as well other resource available on > the net but unable to run solr on my tomcat. > > I am trying to install and run `solr-4.10.0` on tomcat. this is what I have > done so far > > 1. Copy solr-4.10.0.war to tom

Re: Problem while extending TokenizerFactory in Solr 4.4.0

2014-09-10 Thread Francesco Valentini
Hi Shawn, thank you very much for your quick anwser, I fixed it. Thanks Francesco 2014-09-10 15:34 GMT+02:00 Shawn Heisey : > On 9/10/2014 7:14 AM, Francesco Valentini wrote: > > I’m using Solr 4.4.0 distro and now, I have a strange issue while > > extending TokenizerFactory with a custom clas

Modify Schema - Schema API

2014-09-10 Thread Joseph Obernberger
In addition to adding new fields to the schema, is there a way to modify an existing field? If I created a field called userID as a long, but decided later that it should be a string? Thank you! -Joe

Re: Edismax mm and efficiency

2014-09-10 Thread Walter Underwood
We do that strict/loose query sequence, but on the client side with two requests. Would you consider contributing the QueryComponent? wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ On Sep 10, 2014, at 3:47 AM, Peter Keegan wrote: > I implemented a custom QueryCo

Re: Modify Schema - Schema API

2014-09-10 Thread Anshum Gupta
Hi Joseph, It isn't supported by an exiting REST API (if that was your question) but you can always edit the schema manually (if it isn't managed), upload the new schema and reload the collections (or cores in case of non-SolrCloud mode). Do remember that changing the field type might require you

Re: Modify Schema - Schema API

2014-09-10 Thread Joseph Obernberger
Thank you - yes that was my question. I should have stated that it was for SolrCloud and hence a managed schema. Could I bring down the shards, edit the managed schema on zookeeper, fire the shards back up and re-index? -Joe On Wed, Sep 10, 2014 at 11:50 AM, Anshum Gupta wrote: > Hi Joseph, >

Problems for indexing large documents on SolrCloud

2014-09-10 Thread Olivier
Hi, I have some problems for indexing large documents in a SolrCloud cluster of 3 servers (Solr 4.8.1) with 3 shards and 2 replicas for each shard on Tomcat 7. For a specific document (with 300 K values in a multivalued field), I couldn't index it on SolrCloud but I could do it in a single insta

Re: Wildcard in FL parameter not working with Solr 4.10.0

2014-09-10 Thread Mike Hugo
This may have been introduced by changes made to solve https://issues.apache.org/jira/browse/SOLR-5968 I created https://issues.apache.org/jira/browse/SOLR-6501 to track the new bug. On Tue, Sep 9, 2014 at 4:53 PM, Mike Hugo wrote: > Hello, > > With Solr 4.7 we had some queries that return dyna

Re: Modify Schema - Schema API

2014-09-10 Thread Anshum Gupta
You don't need to bring down the shards/collections, instead here's what you can do: * Retain the filename ("managed_schema", if you didn't change the default resource name). * Edit the file locally * Upload it to replace the current zk file. * Reload the collection(s). * Reindex Here's another th

RE: [Announce] Apache Solr 4.10 with RankingAlgorithm 1.5.4 available now with complex-lsa algorithm (simulates human language acquisition and recognition)

2014-09-10 Thread nnagarajayya
Hi Deigo: Not sure of solr-sense, but complex-lsa is an enhanced lsa implementation with TERM-DOCUMENT Similarity, etc. (not found in lsa). The relevance/ranking is again different and is more accurate as it uses the RankingAlgorithm scoring model. The query performance gain with this version

Re: Modify Schema - Schema API

2014-09-10 Thread Joseph Obernberger
Wow - that's really cool! Thank you! -Joe On Wed, Sep 10, 2014 at 12:29 PM, Anshum Gupta wrote: > You don't need to bring down the shards/collections, instead here's what > you can do: > * Retain the filename ("managed_schema", if you didn't change the default > resource name). > * Edit the fi

Re: Edismax mm and efficiency

2014-09-10 Thread Peter Keegan
Sure. I created SOLR-6502. The tricky part was handling the behavior in a sharded index. When the index is sharded. the response from each shard will contain a parameter that indicates if the search results are from the conjunction of all keywords (mm=100%), or from disjunction (mm=1). If the shard

Re: Reading files in default Conf dir

2014-09-10 Thread Ramana OpenSource
Thank you for the inputs Jorge. Now i am getting the ResourceLoader using SolrCore API. Before: return new HashSet(new SolrResourceLoader(null). getLines("stopwords.txt")); After: return new HashSet(core.getResourceLoader().getLines( "stopwords.txt")); I am able to load the resource successfull

How to get access to SolrCore in init method of Handler Class

2014-09-10 Thread Ramana OpenSource
Hi, I need to load a file in instance's conf directory and this data is going to be used in handleRequestBody() implementation. As of now, i am loading the file in the handleRequestBody method like below. SolrCore solrCore = req.getCore(); solrCore .getResourceLoader().getLines(fileToLoad); But,

Re: How to get access to SolrCore in init method of Handler Class

2014-09-10 Thread Chris Hostetter
: But, To make it better, I would like to load this file only once and in the : init() method of handler class. I am not sure how to get the access of : SolrCore in the init method. you can't access the SolrCore during hte init() method, because at the time it's called the SolrCore itself is not

Inconsistent relevancy score between browser refreshes

2014-09-10 Thread Tao, Jing
I am seeing different relevancy scores for the same documents, between browser refreshes. Any ideas why? The query is the same, index is the same - why would score change? Example: First request returns: Stroke Anticoagulation and Prophylaxis 3.463463 Hemorrhagic Stroke 3.463463 Vertebrob

Re: Creating Solr servers dynamically in Multicore folder

2014-09-10 Thread Erick Erickson
You should be good to go. Do note that you can the variables that were defined in your schema.xml in the individual core.properties file for the core in question if you need to, although the defaults work for most people's needs. Best, Erick On Tue, Sep 9, 2014 at 9:15 PM, nishwanth wrote: > He

Re: Problem while extending TokenizerFactory in Solr 4.4.0

2014-09-10 Thread Erick Erickson
Francesco: What was the fix? It'll help others with the same issue. On Wed, Sep 10, 2014 at 6:53 AM, Francesco Valentini wrote: > Hi Shawn, > thank you very much for your quick anwser, > I fixed it. > > Thanks > Francesco > > 2014-09-10 15:34 GMT+02:00 Shawn Heisey : > >> On 9/10/2014 7:14 A

Re: Problems for indexing large documents on SolrCloud

2014-09-10 Thread Erick Erickson
bq: org.apache.solr.common.SolrException: Unexpected end of input block; expected an identifier This is very often an indication that your packets are being truncated by "something in the chain". In your case, make sure that Tomcat is configured to handle inputs of the size that you're sending. T

Re: Inconsistent relevancy score between browser refreshes

2014-09-10 Thread Erick Erickson
More info please. 1> Are there replicas involved? 2> Is there any indexing going on? 3> If more than one node, did you optimize? 4> Did you optimize between refreshes? Best, Erick On Wed, Sep 10, 2014 at 12:28 PM, Tao, Jing wrote: > I am seeing different relevancy scores for the same documents,

RE: Inconsistent relevancy score between browser refreshes

2014-09-10 Thread Tao, Jing
1) It is a SolrCloud setup on 4 servers, 4 shards, replication factor of 2. 2) There is no indexing going on. 3) No, I did not optimize. 4) Did not optimize between refreshes. Thanks, Jing -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Wednesday, September

Re: ExtractingRequestHandler indexing zip files

2014-09-10 Thread keeblerh
Thanks for the info Sergio. I updated my 4.8.1 version with that patch and SOLR 4216 (which was really the same thing). It took a day to get it to compile on my network and it still doesn't work. Did my config file look correct? I'm wondering if I need another param somewhere. "Patch has to be

Re: Solr WARN Log

2014-09-10 Thread Chris Hostetter
: > I'm trying to upgrade Solr from version 4.2 to 4.9, since then I'm ... : haven't configured it. You can ignore this message. To get it to go The fact that a WARN is logged at all was a bug in 4.9 that got fixed in 4.10... https://issues.apache.org/jira/browse/SOLR-6179 -Hoss http

Re: Problems for indexing large documents on SolrCloud

2014-09-10 Thread Shawn Heisey
On 9/10/2014 2:05 PM, Erick Erickson wrote: > bq: org.apache.solr.common.SolrException: Unexpected end of input > block; expected an identifier > > This is very often an indication that your packets are being > truncated by "something in the chain". In your case, make sure > that Tomcat is configur

Re: How to get access to SolrCore in init method of Handler Class

2014-09-10 Thread Ramana OpenSource
Thanks Chris. I have implemented SolrCoreAware interface and loading the required file in the inform method. Thanks, Ramana. On Wed, Sep 10, 2014 at 10:59 PM, Chris Hostetter wrote: > > : But, To make it better, I would like to load this file only once and in > the > : init() method of handler

Re: Creating Solr servers dynamically in Multicore folder

2014-09-10 Thread nishwanth
Hello Erick, Thanks for the response I have attached the core.properties and solr.xml for your reference. . solr.xml core.properties Below is our plan on the crea