Re: regarding Extracting text from Images

2020-01-17 Thread Retro
Hello, can you please advise me, how to configure Solr so that embedded Tika is able to use Tesseract to do the ocr of images? I have installed the following software - SOLR - 7.4.0 Tesseract - 4.1.1-rc2-20-g01fb TIKA - TIKA 1.18 Tesseract is installed in to the following directory: /u

Re: regarding Extracting text from Images

2020-01-17 Thread Jörn Franke
Have you checked this? https://cwiki.apache.org/confluence/display/TIKA/TikaOCR > Am 17.01.2020 um 10:54 schrieb Retro : > > Hello, can you please advise me, how to configure Solr so that embedded Tika > is able to use Tesseract to do the ocr of images? I have installed the > following softwar

Re: regarding Extracting text from Images

2020-01-17 Thread Marco Reis
Are you intending to use the solution in production? If so, combining Tika and Tesseract on the same server could not be a good choice. Tika and Tesseract are heavy processing consumers, harming the main service on the solution, in your case, Solr service. I had the same situation here, and the com

Handling overlapping synonyms

2020-01-17 Thread fiedzia
Having synonyms defined for new york -> new_york new york city -> new_york_city I'd like the phrase new york city to be indexed as both, but SynonymGraphFilter picks only one. Is there a way around that? -- Maciej Dziardziel fied...@gmail.com -- Sent from: https://lucene.472066.n3.nabble.co

Re: Handling overlapping synonyms

2020-01-17 Thread Audrey Lorberfeld - audrey.lorberf...@ibm.com
If you instead write "new york => new_york, new_york_city" it should work (https://doc.lucidworks.com/fusion/3.1/Collections/Synonyms-Files.html) On 1/17/20, 6:29 AM, "fiedzia" wrote: Having synonyms defined for new york -> new_york new york city -> new_york_city I'd

Re: Handling overlapping synonyms

2020-01-17 Thread fiedzia
> If you instead write "new york => new_york, new_york_city" it should work I can't do that, as that would turn "new york" into "new york_city", which is not what I want. Doing it the other way (new york city -> new_york_city, new_york) makes more sense, though I expect this to get positions wrong

Re: Dependency log4j-slf4j-impl for solr-core:7.5.0 causing a number of build problems

2020-01-17 Thread Mark H. Wood
On Thu, Jan 16, 2020 at 03:13:17PM +, Wolf, Chris (ELS-CON) wrote: > --- original message --- > It looks to me as though solr-core is not the only artifact with that > dependency. The first thing I would do is examine the output of 'mvn > dependency:tree' to see what has dragged log4j-slf4j-im

Re: Handling overlapping synonyms

2020-01-17 Thread fiedzia
> Doing it the other way (new york city -> new_york_city, new_york) makes more sense, Just checked it, that way does the matching as expected, but highlighting is wrong ("new york: query matches "new york city" as it should, but also highlights all of it) -- Sent from: https://lucene.472066.n3.

Re: Failed to connect to server

2020-01-17 Thread rhys J
On Thu, Jan 16, 2020 at 3:48 PM David Hastings wrote: > > 'Error: Solr core is loading' > > do you have any suggesters or anything configured that would get rebuilt? > > > I don't think so? But I'm not quite sure what you are asking? > Rhys

Re: Failed to connect to server

2020-01-17 Thread David Hastings
something like this in your solr config: autosuggest false text 0.005 DocumentDictionaryFactory title weight true true On Fri, Jan 17, 2020 at 12:02 PM rhys J wrote: > On Thu, Jan 16, 2020 at 3:48 PM David Hastings < > hastings.recurs...@gmail.com> > wrote: > > > > 'Error: Solr core is

Re: Failed to connect to server

2020-01-17 Thread rhys J
On Fri, Jan 17, 2020 at 12:10 PM David Hastings < hastings.recurs...@gmail.com> wrote: > something like this in your solr config: > > autosuggest "exactMatchFirst">false text str> 0.005 > DocumentDictionaryFactory title "weightField">weight true "buildOnOptimize">true > > I checked both /var

Re: Failed to connect to server

2020-01-17 Thread Dave
It doesn’t need to be identical, just anything with a buildon reload statement > On Jan 17, 2020, at 12:17 PM, rhys J wrote: > > On Fri, Jan 17, 2020 at 12:10 PM David Hastings < > hastings.recurs...@gmail.com> wrote: > >> something like this in your solr config: >> >> autosuggest > "exactMa

Solr cloud production set up

2020-01-17 Thread Rajdeep Sahoo
Hi all, We are using solr cloud 7.7.1 In a live production environment how many solr cloud server do we need, Currently ,we are using master slave set up with 16 slave server with solr 4.6. In solr cloud do we need to scale it up or 16 server will suffice the purpose.

Re: Failed to connect to server

2020-01-17 Thread Edward Ribeiro
> I have increased the number of maxConnections to see if this fixes the problem. This solved the "connection refused" issue? > I noticed in the log that there was an error from a curl statement that said 'Error: Solr core is loading' This is weird. Solr usually don't just reload cores. Are you

Re: Re: Handling overlapping synonyms

2020-01-17 Thread Audrey Lorberfeld - audrey.lorberf...@ibm.com
Hmm what is the reasoning behind adding the bigrams and trigrams manually like that? Maybe if we knew the end goal, we could figure out a different strategy. Happy that at least the matching is working now! On 1/17/20, 10:28 AM, "fiedzia" wrote: > Doing it the other way (new york cit

Solr 7.7.1: Disaster Recovery options for an out-of-region data center

2020-01-17 Thread Ahlberg, Christopher C.
Hello Solr Community, I understand that for an out-of-region disaster recovery solution with Solr, we basically have two choices: 1. Backup and Restore API for Collections 2. Solr CDCR Has anyone ever used EMC VMAX storage and SRDF replication for their Solr environments? We do this for

Re: Solr cloud production set up

2020-01-17 Thread Rajdeep Sahoo
Please reply anyone On Sat, 18 Jan, 2020, 12:13 AM Rajdeep Sahoo, wrote: > Hi all, > We are using solr cloud 7.7.1 > In a live production environment how many solr cloud server do we need, > Currently ,we are using master slave set up with 16 slave server with > solr 4.6. > In solr cloud do we

Re: Solr cloud production set up

2020-01-17 Thread Walter Underwood
Why do you want to change to Solr Cloud? Master/slave is a great, stable cluster architecture. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Jan 17, 2020, at 6:19 PM, Rajdeep Sahoo wrote: > > Please reply anyone > > On Sat, 18 Jan, 2020, 12:13 A