Re: Question about autoAddReplicas

2017-03-31 Thread Sumit Nigam
Hi all, I have exactly the same problem as mentioned in this thread. I would assume that handling the stale write lock should be automatically handled by this feature of add replica automatically. Can anyone provide inputs on what is missing (in configuration or otherwise) for auto add replicas t

Suggestions with EdgeNGramFilterFactory and FuzzyLookupFactory

2017-03-31 Thread Alexis Aravena Silva
Hello All, I'm using the suggester component in Solr 6.4 with FuzzyLookupFactory and AnalyzingInfixLookupFactory, everything was ok until added EdgeNGramFilterFactory to my field type definition, after loading 8 documents, I index manually, the process of indexing consumes 16GB of my hard disk

Re: Searchable archive of this mailing list

2017-03-31 Thread Alexandre Rafalovitch
I love the catch-22 of this question. It is absolutely useless to exactly the people who could benefit from it most :-) Regards, Alex. http://www.solr-start.com/ - Resources for Solr users, new and experienced On 31 March 2017 at 18:07, Erick Erickson wrote: > Couple of other places: >

Re: Is CloudSolrClient thread-safe?

2017-03-31 Thread Ishan Chattopadhyaya
> I use SolrJ 6.4.0 and SolrCloud 6.4.0 Please upgrade to 6.4.2 immediately. 6.4.0 has major performance problems. On Wed, Mar 29, 2017 at 12:12 AM, Mikhail Ibraheem < mikhail.ibrah...@oracle.com> wrote: > Thanks Shawn so much. > I use SolrJ 6.4.0 and SolrCloud 6.4.0 > The code is very simple: >

solr learning_to_rank (normalizer) unmatched argument type issue

2017-03-31 Thread Jianxiong Dong
Hi, I created a toy learning-to-rank model in solr in order to show the issues. Feature.json - [ { "store" : "wikiFeatureStore", "name" : "doc_len", "class" : "org.apache.solr.ltr.feature.FieldLengthFeature", "params" : {"field":"a_text"} }, { "store" : "wikiF

Re: Searchable archive of this mailing list

2017-03-31 Thread Erick Erickson
Couple of other places: http://jirasearch.mikemccandless.com/search.py?index=jira for JIRAs that have already been raised if you see behavior you think is a bug. and https://lucidworks.com/resources/searchhub Best, Erick On Fri, Mar 31, 2017 at 2:29 PM, Walter Underwood wrote: > MarkMail is a

Re: Searchable archive of this mailing list

2017-03-31 Thread Walter Underwood
MarkMail is also good. http://markmail.org/search/?q=solr-user#query:solr-user%20list%3Aorg.apache.lucene.solr-user+page:1+state:facets wunder Walter Underwood wun...@wunderwood.org ht

Re: Searchable archive of this mailing list

2017-03-31 Thread Kevin Risden
Google usually does a pretty good job of indexing this mailing list. The other place I'll usually go is here: http://search-lucene.com/?project=Solr Kevin Risden On Fri, Mar 31, 2017 at 4:18 PM, OTH wrote: > Hi all, > > Is there a searchable archive of this mailing list? > > I'm asking just so

Searchable archive of this mailing list

2017-03-31 Thread OTH
Hi all, Is there a searchable archive of this mailing list? I'm asking just so I don't have to post a question in the future which may have been answered before already. Thanks

Re: Solr performance issue on indexing

2017-03-31 Thread Erick Erickson
If, by chance, the docs you're sending get routed to different Solr nodes then all the processing is in parallel. I don't know if there's a good way to insure that the docs get sent to different replicas on different Solr instances. You could try addressing specific Solr replicas, something like "b

Re: Solr performance issue on indexing

2017-03-31 Thread tstusr
Hi, thanks for the feedback. Yes, it is about OOM, indeed even solr instance makes unavailable. As I was saying I can't find more relevant information on logs. We're are able to increment JVM amout, so, the first thing we'll do will be that. As far as I know, all documents are bounded to that am

Phrase Fields performance

2017-03-31 Thread David Hastings
Hey all. I ran into an issue recently. I have a rather large index and in my application I had defined values for the "pf" parameter, but I had commented it out years ago not really knowing why I did it. Obviously the point of the pf helps rank fields higher if they are in close proximity, and th

Re: Solr performance issue on indexing

2017-03-31 Thread Erick Erickson
First, running multiple threads with PDF files to a Solr running 4G of JVM is...ambitious. You say it crashes; how? OOMs? Second while the extracting request handler is a fine way to get up and running, any problems with Tika will affect Solr. Tika does a great job of extraction, but there are so

RE: SOLr 6.2.1, dealing with the redirected SOLr web admin

2017-03-31 Thread Stewart, Scott A. CTR OSD/DoDEA
It seems to be working once I created a dummy core... Thanks sas Scott Stewart, ACE-CF, ACI-CF. CSM, CTT+ Senior Systems Engineer -- IBSS Corporation DoD Education Activity 4800 Mark Center Drive, Alexandria, Va 22350 PH: 571.372.0602 Email:scott.

RE: SOLr 6.2.1, dealing with the redirected SOLr web admin

2017-03-31 Thread Stewart, Scott A. CTR OSD/DoDEA
Thanks Alex, I'm getting a little further with this Scott Stewart, ACE-CF, ACI-CF. CSM, CTT+ Senior Systems Engineer -- IBSS Corporation DoD Education Activity 4800 Mark Center Drive, Alexandria, Va 22350 PH: 571.372.0602 Email:scott.stewart@hq

Re: Disable All kind of caching in Solr/Lucene

2017-03-31 Thread Yonik Seeley
On Fri, Mar 31, 2017 at 1:53 PM, Nilesh Kamani wrote: > @Alexandre - Could you please point me to reference doc to remove default > cache settings ? > > @Yonik - The code change is in Solr Indexer to sort the results. OK, so to test indexing performance, there are no caches to worry about (as lon

storing the analyzed value

2017-03-31 Thread John Blythe
hey all i'm wanting to store one of my field's analyzed token for retrieval. is there any way to do this? the preliminary googling i'd done had discussions from 2007-2010, i didn't notice anything very recent touching on the concept. thanks-

Re: Disable All kind of caching in Solr/Lucene

2017-03-31 Thread Nilesh Kamani
@Alexandre - Could you please point me to reference doc to remove default cache settings ? @Yonik - The code change is in Solr Indexer to sort the results. On Fri, Mar 31, 2017 at 10:34 AM, Yonik Seeley wrote: > On Fri, Mar 31, 2017 at 9:44 AM, Nilesh Kamani > wrote: > > I am planning to do lo

Solr performance issue on indexing

2017-03-31 Thread tstusr
Hi there. We are currently indexing some PDF files, the main handler to index is /extract where we perform simple processing (extract relevant fields and store on some fields). The PDF files are about 10M~100M size and we have to have available the text extracted. So, everything works correct on

Re: Fieldtype json supported in SOLR 5.4.0 or 5.4.1

2017-03-31 Thread Abhijit Pawar
Hi Rick, I tried installing SOLR 6.0 since SOLR 6.0 has managed-schema and tried index the data from mongoDB : ​This is the error I get : getNext() failed for query 'SELECT orgidStr,idStr,name,code,description,price,images,categoriesStr,enddate_solar,begin_date_solar,status_solar,c

Re: SOLr 6.2.1, dealing with the redirected SOLr web admin

2017-03-31 Thread Alexandre Rafalovitch
The Admin UI is a Javascript interface over the API calls Solr provides. The # part is Angular.JS way of doing UI state management. You are very unlikely to need to provide the Admin UI URL to any system. They will talk directly to Solr at /solr/collectionname/handlername, e.g. /solr/atom/select?q

Re: Indexing speed reduced significantly with OCR

2017-03-31 Thread Zheng Lin Edwin Yeo
This is my comparison of the indexing speed with and without Tesseract OCR. The smaller file is taking longer to index, probably because there are more text to do the OCR, as compared to the bigger file, which has lesser text. Is that usually the case? *With Tesseract OCR* 174KB - 5.20 sec 446KB

SOLr 6.2.1, dealing with the redirected SOLr web admin

2017-03-31 Thread Stewart, Scott A. CTR OSD/DoDEA
Hi all, I'm another SOLr virgin. :p I've got version 6.2.1 set up on a Windows 2012 Ent. Server, even have it running as a Windows service. My issue is with the redirect that happens when you got to the admin url. My instance is set up as localhost:8983, when I go to that page it redirects to

Re: Disable All kind of caching in Solr/Lucene

2017-03-31 Thread Yonik Seeley
On Fri, Mar 31, 2017 at 9:44 AM, Nilesh Kamani wrote: > I am planning to do load testing for some of my code changes and I need to > disable all kind of caching. Perhaps you should be aiming to either: 1) seek a config + query load that maximizes time spent in your code in order to optimize it 2)

Re: Disable All kind of caching in Solr/Lucene

2017-03-31 Thread Alexandre Rafalovitch
I think there are default caching settings. You may need to explicitly disable them. Regards, Alex On 31 Mar 2017 9:44 AM, "Nilesh Kamani" wrote: > Hello All, > > I am planning to do load testing for some of my code changes and I need to > disable all kind of caching. > > I removed all cachi

Re: Disable All kind of caching in Solr/Lucene

2017-03-31 Thread Charlie Hull
On 31/03/2017 14:44, Nilesh Kamani wrote: Hello All, I am planning to do load testing for some of my code changes and I need to disable all kind of caching. I removed all caching related elements from solr config (in zookeeper). This is the document I referred. https://cwiki.apache.org/confluen

Disable All kind of caching in Solr/Lucene

2017-03-31 Thread Nilesh Kamani
Hello All, I am planning to do load testing for some of my code changes and I need to disable all kind of caching. I removed all caching related elements from solr config (in zookeeper). This is the document I referred. https://cwiki.apache.org/confluence/display/solr/Query+Settings+in+SolrConfig

Re: Solr Shingle is not working properly in solr 6.5.0

2017-03-31 Thread Aman Deep Singh
Hi Rich, Query creation is correct only thing what causing the problem is that Boolean + query while building the lucene query which causing all tokens to be matched in the document (equivalent of mm=100%) even though I use mm=1 it was using BOOLEAN + query as normal query one plus one abc Lucene q

Re: Solr Shingle is not working properly in solr 6.5.0

2017-03-31 Thread Rick Leir
Hi Aman Did you try the Admin Analysis tool? It will show you which filters are effective at index and query time. It will help you understand why you are not getting a mach. Cheers -- Rick On March 31, 2017 2:36:33 AM EDT, Aman Deep Singh wrote: >Hi, >I was trying to use the shingle filter bu