Re: Problem starting solr 6.5

2017-04-03 Thread Erick Erickson
Looks like a permissions issue. Best, Erick On Mon, Apr 3, 2017 at 7:42 PM, wlee wrote: > Try to start solr and get this error message. What is the problem ? > > > $ bin/solr start > > Exception in thread "main" java.nio.file.AccessDeniedException: > /usr/local/solr-6/solr-6.5.0/server/logs >

Re: How to Insert and retrieve data from specific shard in Solr 6.1.0

2017-04-03 Thread Erick Erickson
Why? By it's nature, SolrCloud usually doesn't care about what shard a document came from. Unless you use "implicit" routing, even _you_ don't know what shard the doc landed on. But if you insist, address the request to a particular _replica_ that happens to belong to the shard and add &distrib=fa

Re: Problem starting solr 6.5

2017-04-03 Thread Yasufumi Mizoguchi
Hi, I think you should check the permission of /usr/local/solr-6/solr-6.5.0/server/log (maybe, you do not have write permission on the directory) regards, Yasufumi On 2017/04/04 11:42, wlee wrote: Try to start solr and get this error message. What is the problem ? $ bin/solr start Exce

Re: Using Tesseract OCR to extract PDF files in EML file attachment

2017-04-03 Thread Rick Leir
Tesseract prolly knows nothing of the EML format. Your scripts could pull EML's apart. On April 4, 2017 2:00:19 AM EDT, Zheng Lin Edwin Yeo wrote: >Hi, > >Currently, I am able to extract scanned PDF images and index them to >Solr >using Tesseract OCR, although the speed is very slow. > >However

How to Insert and retrieve data from specific shard in Solr 6.1.0

2017-04-03 Thread Ketan Thanki
Hi, Please help for the below mention query. I need to insert/update data to specific shard and also retrieve the data from specific shard in solr 6.1.0. Pease let me for the configuration/code changes required for the existing solr collections. Regards, [CC Award Winners!]

Using Tesseract OCR to extract PDF files in EML file attachment

2017-04-03 Thread Zheng Lin Edwin Yeo
Hi, Currently, I am able to extract scanned PDF images and index them to Solr using Tesseract OCR, although the speed is very slow. However, for EML files with PDF attachments that consist of scanned images, the Tesseract OCR is not able to extract the text from those PDF attachments. Can we use

Problem starting solr 6.5

2017-04-03 Thread wlee
Try to start solr and get this error message. What is the problem ? $ bin/solr start Exception in thread "main" java.nio.file.AccessDeniedException: /usr/local/solr-6/solr-6.5.0/server/logs at sun.nio.fs.UnixException.translateToIOException(UnixException.java:84) at sun.nio.fs.UnixException.

Do streaming expressions support range facets?

2017-04-03 Thread adfel70
Specifically date ranges? I would like to perform some kind of OLAP cube on the data in solr, and looking at streaming expressions for this. -- View this message in context: http://lucene.472066.n3.nabble.com/Do-streaming-expressions-support-range-facets-tp4328233.html Sent from the Solr - Us

Re: edismax parsing confusion

2017-04-03 Thread Greg Pendlebury
eDismax uses 'mm', so knowing what that has been set to is important, or if it has been left unset/default you would need to consider whether 'q.op' has been set. Or the default operator from the config file. Ta, Greg On 3 April 2017 at 23:56, Steve Rowe wrote: > Hi Abhishek, > > Which version

Str vs Int/Long uniqueKey field performance

2017-04-03 Thread Dorian Hoxha
Hey friends, Is there any difference on the index-size/performance of having the 'uniqueKey` field a long vs a string ? Meaning, does it use a different data-structure ? Cause I remember elasticsearch always uses a string (since it adds the #type, which solr doesn't have). Regards, Dorian

Re: Solr Delete By Id Out of memory issue

2017-04-03 Thread Rohit Kanchan
Thanks everyone for replying to this issue. Just a final comment on this issue which I was closely working on. We have fixed this issue. It was a bug in our custom component which we wrote to convert delete by query to delete by id. We were using BytesRef differently, we were not making a deep copy

Handling of local params in QParserPlugin.createParser

2017-04-03 Thread Ken Krugler
Hi all, As part of some interesting work creating a custom query parser, I was writing unit tests that exercised ExtendedDismaxQParser. So I first created the ExtendedDismaxQParserPlugin, and then used that to create the QParser via: QParser parser = plugin.createParser(query, localPar

Ping handler queuing/problem?

2017-04-03 Thread Will Miller
I have 3 different SolrCloud clusters that share a single set (3) of zookeeper servers. Each SolrCloud cluster has its own set of collections stored on Zookeeper. Twice in the past week all 3 clusters have had about a 1 minute period where all requests stopped coming in. Solr recovers and furthe

Solr4 To Solr5 Upgrade Causing CPU issues

2017-04-03 Thread ~$alpha`
Hi Recently upgraded from solr4.0(tomcat) to solr5.3(tomcat) and my performance degraded to a very high level. Issues: My CPU consumption increased to the very high level of the same traffic. Note: My queries contains lots of in queries. What can be the issue here. I also tried solr6 with jet

NLP4j

2017-04-03 Thread Joel Bernstein
Hi, At Alfresco we've become interested in NLP4j. I think we'd prefer to use this library rather OpenNLP. This is due to some internal work that we've already done with NLP4j. LUCENE-2899 is working on an OpenNLP integration, but is not committed yet. This brings up two questions: 1) Does it ma

Solr 6.4.1 Issue

2017-04-03 Thread Islam Omar
Hi All , I have a trouble problem when doing *full import in solr 6.4.1 using MySQL DB , the problem is : * i need to create 1 core which will be around 9,500,000 documents , when i do full import with *batchSize* *= -1* in datasource , everything was ok but when the solr finish fetching data fr

Re: Solr Cloud 6.5.0 Replicas go down while indexing

2017-04-03 Thread Erick Erickson
bq: We set Auto hardcommit time to 15sec and 1 docs, and soft commit to 6 sec and 5000 seconds Just a sanity check, the commit intervals are in milliseconds, your units look mixed up above, I'm guessing it's just a typo though. I usually don't use maxDocs because it's unpredictable. Say yo

Re: edismax parsing confusion

2017-04-03 Thread Steve Rowe
Hi Abhishek, Which version of Solr are you using? I can see that the parsed queries are different, but they’re also very similar, and there’s a lot of detail there - can you be more specific about what the problem is? -- Steve www.lucidworks.com > On Apr 3, 2017, at 4:54 AM, Abhishek Mishra

Solr Cloud 6.5.0 Replicas go down while indexing

2017-04-03 Thread Salih Sen
Hi, We have a three server set up with each server having 756G ram, 48 cores, 4SSDs (each having tree solr instances on them) and a dedicated mechanical disk for zookeeper (3 zk instances total). Each Solr instances have 31G of heap space allocated to them. In total we have 36 Solr Instances and 3

Re: Disable All kind of caching in Solr/Lucene

2017-04-03 Thread Nilesh Kamani
@Yonik even though the code change is in SolrIndexer class, it has nothing do with index itself. After fetching docIds, I am filtering them on one more criteria. (Very weird code it is). I tried q={!cache=false}, but not working. Subsequent search is done under 2 milliseconds. Does anybdody have

Error whan using percentile facet on multivalued fields (again)

2017-04-03 Thread ron visbord
Hi, I'm using Solr 5.3.1. When trying to do a percentile facet on a multivalued field I get the following exception - *org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at http://host_name:8983/solr/core_name : can not us

edismax parsing confusion

2017-04-03 Thread Abhishek Mishra
Hi all i am running solr query with these parameter bf: "sum(product(new_popularity,100),if(exists(third_price),50,0))" qf: "test_product^5 category_path_tf^4 product_id gender" q: "handbags between rs150 and rs 400" defType: "edismax" parsed query is like below one for q:- (+(DisjunctionMaxQuer

Use of facet.pivot possible when there are multiple paths per document

2017-04-03 Thread Andreas Kahl
Hello everyone, we intend to index a set of documents with a monohierarchical classification. For the classification we need hierarchical facets in our UI. We would like to use Pivot facets because they are more flexible than hierarchical facets; but we are wondering if it is possible to index mul