Re: Re: Re: Protecting Tokens from Any Analysis

2019-10-09 Thread David Hastings
majority of you all do NOT use stop > words? > > -- > Audrey Lorberfeld > Data Scientist, w3 Search > IBM > audrey.lorberf...@ibm.com > > > On 10/9/19, 11:14 AM, "David Hastings" > wrote: > > However, with all that said, stopwords CAN be useful

Re: Re: Re: Re: Protecting Tokens from Any Analysis

2019-10-09 Thread David Hastings
the majority of you all do NOT use > stop words? > > -- > Audrey Lorberfeld > Data Scientist, w3 Search > IBM > audrey.lorberf...@ibm.com > > > On 10/9/19, 11:14 AM, "David Hastings" > wrote: > > However, with all th

Re: Re: Re: Re: Re: Protecting Tokens from Any Analysis

2019-10-09 Thread David Hastings
r corpus is ~5MM documents, multi-language, > and we get ~80k-100k queries/day) > > Are you using edismax? > > -- > Audrey Lorberfeld > Data Scientist, w3 Search > IBM > audrey.lorberf...@ibm.com > > > On 10/9/19, 3:11 PM, "David Hastings" > wrote:

AutoAddReplicas doesn't work with TLOG and PULL replicas

2019-10-10 Thread David Kovář
ver restarting. Do you have any solution how to automatically survive one server crash (auto create replica on new server with correct type and migrate data) when using TLOG and PULL replicas? Thank you for answer. David Kovar

Re: POS Tagger

2019-10-25 Thread David Hastings
Do you mean for entity extraction? I make a LOT of use from the stanford nlp project, and get out the entities and use them for different purposes in solr -Dave On Fri, Oct 25, 2019 at 10:16 AM Audrey Lorberfeld - audrey.lorberf...@ibm.com wrote: > Hi All, > > Does anyone use a POS tagger with t

Re: POS Tagger

2019-10-25 Thread David Hastings
https://nlp.stanford.edu/ On Fri, Oct 25, 2019 at 10:29 AM David Hastings < hastings.recurs...@gmail.com> wrote: > Do you mean for entity extraction? > I make a LOT of use from the stanford nlp project, and get out the > entities and use them for different purposes in solr >

Re: Re: POS Tagger

2019-10-25 Thread David Hastings
ntist, w3 Search > IBM > audrey.lorberf...@ibm.com > > > On 10/25/19, 10:30 AM, "David Hastings" > wrote: > > Do you mean for entity extraction? > I make a LOT of use from the stanford nlp project, and get out the > entities > and use them f

Re: POS Tagger

2019-10-25 Thread David Hastings
d POS tagger for french. The processing time is > mitigated by the spark-corenlp package which distribute the process over > multiple node. > > Also I am interesting in the way you use POS information within solr > queries, or solr fields. > > Thanks, > On Fri, Oct 25, 2019 at

Re: Re: POS Tagger

2019-10-25 Thread David Hastings
M > audrey.lorberf...@ibm.com > > > On 10/25/19, 12:06 PM, "David Hastings" > wrote: > > I use them for query boosting, so if someone searches for: > > i dont want to rush limbaugh out the door > vs > i talked to rush limbaugh through t

Re: Re: POS Tagger

2019-10-25 Thread David Hastings
oh i see what you mean, sorry, i explained it incorrectly. those sentences are what would be in the index, and a general search for 'rush limbaugh' would come back with results where he is an entity higher than if it was two words in a sentence On Fri, Oct 25, 2019 at 12:12 PM Davi

Re: Delete documents from the Solr index using SolrJ

2019-11-04 Thread David Hastings
when you add a new document using the same "id" value as another it just over writes it On Mon, Nov 4, 2019 at 9:30 AM Khare, Kushal (MIND) < kushal.kh...@mind-infotech.com> wrote: > Could you please let me know how to achieve that ? > > > -Original Message- > From: Jörn Franke [mailto:jo

Re: Delete documents from the Solr index using SolrJ

2019-11-04 Thread David Hastings
In my case, id won't be same. > Suppose, I have a doc with id : 20 > Now, it's newer version would be either 20.1 or 22 > What in this case? > -Original Message- > From: David Hastings [mailto:hastings.recurs...@gmail.com] > Sent: 04 November 2019 20:04 > To: solr

Re: When search term has two stopwords ('and' and 'a') together, it doesn't work

2019-11-05 Thread David Hastings
The first thing you should do is remove any reference to stop words and never use them, then re-index your data and try it again. On Tue, Nov 5, 2019 at 9:14 AM Guilherme Viteri wrote: > Hi, > > I am performing a search to match a name (text_field), however this term > contains 'and' and 'a' and

Re: When search term has two stopwords ('and' and 'a') together, it doesn't work

2019-11-05 Thread David Hastings
e this here ? >positionIncrementGap="100" omitNorms="false" > > > > > > > words="stopwords.txt"/> > > > > > On 5 Nov 2019, at 14:15, David

Re: When search term has two stopwords ('and' and 'a') together, it doesn't work

2019-11-07 Thread David Hastings
t; > paras.leh...@indiamart.com> > >>> wrote: > >>>>>> > >>>>>> Hi Walter, > >>>>>> > >>>>>> The solr.StopFilter removes all tokens that are stopwords. Those > words > >>> will > >>>>>&

Re: Good Open Source Front End for Solr

2019-11-07 Thread David Hastings
well thats pretty slick On Thu, Nov 7, 2019 at 1:59 PM Erik Hatcher wrote: > Blacklight: http://projectblacklight.org/ > > ;) > > > > > On Nov 6, 2019, at 11:16 PM, Java Developer > wrote: > > > > Hi, > > > > What is the best open source front-end for Solr >

Re: When search term has two stopwords ('and' and 'a') together, it doesn't work

2019-11-08 Thread David Hastings
the covers > is to > >>>>>> add “&debug=true” to the query and look at the parsed query. Ignore > all the > >>>>>> relevance calculations for the nonce, or specify “&debug=query” to > skip > >>>>>> that part. > >>>>>>>> >

Re: When search term has two stopwords ('and' and 'a') together, it doesn't work

2019-11-08 Thread David Hastings
dd > about the > >>>>>>>> interactions. > >>>>>>>>>> > >>>>>>>>>> Second, I have no idea what this will do. Are the equal signs > typos? > >>>>>>>> Used by custom code?

Re: When search term has two stopwords ('and' and 'a') together, it doesn't work

2019-11-08 Thread David Hastings
a hack that mostly works. > > Infoseek had phrase IDF and it was a killer algorithm for relevance. > > wunder > Walter Underwood > wun...@wunderwood.org > http://observer.wunderwood.org/ (my blog) > > > On Nov 8, 2019, at 11:08 AM, David Hastings < > hastings.recu

Re: Highlighting on typing in search box

2019-11-21 Thread David Hastings
you can modify the result in this SO question to fit your needs: https://stackoverflow.com/questions/16742610/retrieve-results-from-solr-using-jquery-calls On Thu, Nov 21, 2019 at 10:42 AM rhys J wrote: > Are there any recommended APIs or code examples of using Solr and then > highlighting resu

Re: How to tell which core was used based on Json or XML response from Solr

2019-11-22 Thread David Hastings
2 things (maybe 3): 1. dont have this code facing a client thats not you, otherwise anyone could view the source and see where the solr server is, which means they can destroy your index or anything they want. put at the very least a simple api/front end in between the javascript page for the use

Re: How to tell which core was used based on Json or XML response from Solr

2019-11-22 Thread David Hastings
2019 at 1:43 PM rhys J wrote: > On Fri, Nov 22, 2019 at 1:39 PM David Hastings < > hastings.recurs...@gmail.com> > wrote: > > > 2 things (maybe 3): > > 1. dont have this code facing a client thats not you, otherwise anyone > > could view the source and see w

Re: How to tell which core was used based on Json or XML response from Solr

2019-11-25 Thread David Hastings
my $agent = new LWP::UserAgent; my $request = POST($url, $data); my $response = $agent->request($request)->decoded_content; On Mon, Nov 25, 2019 at 10:37 AM rhys J wrote: > On Mon, Nov 25, 2019 at 1:10 AM Paras Lehana > wrote: > > > Hey rhys, > >

Re: Using an & in an indexed field and then querying for it.

2019-11-25 Thread David Hastings
its breaking on the & because its in the url and you are most likely sending a get request to solr. you should send it as post or as %26 On Mon, Nov 25, 2019 at 2:32 PM rhys J wrote: > I have some fields that have text like so: > > Reliable Van & Storage. > > They indexed fine when I used curl

Re: Using an & in an indexed field and then querying for it.

2019-11-25 Thread David Hastings
wrote: > On Mon, Nov 25, 2019 at 2:36 PM David Hastings < > hastings.recurs...@gmail.com> > wrote: > > > its breaking on the & because its in the url and you are most likely > > sending a get request to solr. you should send it as post or as %26 > > >

Re: A Last Message to the Solr Users

2019-11-27 Thread David Hastings
Personally I found nothing in solr cloud worth changing from standalone for, and just added more complications, more servers, and required becoming an expert/knowledgeable in zoo keeper, id rather spend my time developing than becoming a systems administrator On Wed, Nov 27, 2019 at 3:45 AM Mark M

Re: Exact match

2019-12-02 Thread David Hastings
if the query is in quotes it will work. also, not sure if youve been following, but get rid of: StopFilterFactory and all stopwords, or just make your stop word file empty if you need it to work in non quotes, add them to the query post submission ? On Mon, Dec 2, 2019 at 3:44 PM OTH wrote: > H

xms/xmx choices

2019-12-05 Thread David Hastings
Hey all, over time ive adjusted and changed the solr Xms/Xmx various times with not too much thought aside from more is better, but ive noticed in many of the emails the recommended values are much lower than the numbers ive historically put in. i never really bothered to change them as the perfor

Re: From solr to solr cloud

2019-12-05 Thread David Hastings
are you noticing performance decreases in stand alone solr as of now? On Thu, Dec 5, 2019 at 2:29 PM Vignan Malyala wrote: > Hi > I currently have 500 collections in my stand alone solr. Bcoz of day by day > increase in Data, I want to convert it into solr cloud. > Can you suggest me how to do i

Re: xms/xmx choices

2019-12-05 Thread David Hastings
youd like I would be happy to provide, this is interesting. On Thu, Dec 5, 2019 at 2:41 PM Shawn Heisey wrote: > On 12/5/2019 11:58 AM, David Hastings wrote: > > as of now we do an xms of 8gb and xmx of 60gb, generally through the > > dashboard the JVM hangs around 16gb. I know

Re: xms/xmx choices

2019-12-05 Thread David Hastings
That probably isnt enough data, so if youre interested: https://gofile.io/?c=rZQ2y4 On Thu, Dec 5, 2019 at 2:52 PM David Hastings wrote: > I know theres no hard answer, and I know the Xms and Xmx should be the > same, but it was a set it and forget it sort of thing from years ago. I

Re: xms/xmx choices

2019-12-05 Thread David Hastings
and if this may be of use: https://imgur.com/a/qXBuSxG just been more or less winging the options since solr 1.3 On Thu, Dec 5, 2019 at 2:41 PM Shawn Heisey wrote: > On 12/5/2019 11:58 AM, David Hastings wrote: > > as of now we do an xms of 8gb and xmx of 60gb, generally th

Re: xms/xmx choices

2019-12-06 Thread David Hastings
s) > > > > We added about 2.3m docs, then I replicated it to the production master > and since there was a change it replicated out to the slave node the gc > came from > > > > I’ll set one of the slaves to 31/31 and force all load to that one and > see how she doe

Re: Search returning unexpected matches at the top

2019-12-06 Thread David Hastings
whats the field type for: clt_ref_no *_no isnt a default dynamic character, and owl-2924-8 usually translates into owl 2924 8 David J. Hastings | Lead Developer dhasti...@wshein.com | 716.882.2600 x 176 William S. Hein & Co., Inc. 2350 North Forest Road | Getzville, NY 14068 www.wshein

Re: How to add a new field to already an existing index in Solr 6.6 ?

2019-12-08 Thread David Barnett
There is a few ways to add fields, adding the field definition in the managed-schema will do this for you but make sure you have downloaded the current config before you edit and reload the schema. Google - solr 6.6 upconfig downconfig for lots of guides on this On Tue, 3 Dec 2019, 13:21 Erick Er

Re: How to add a new field to already an existing index in Solr 6.6 ?

2019-12-08 Thread David Barnett
Also - look at adding fields using Solr admin, this will these will be available to use (I believe) without the need to restart and is very easy to do. On Sun, 8 Dec 2019, 13:03 David Barnett, wrote: > There is a few ways to add fields, adding the field definition in the > managed-schem

Re: [ANNOUNCE] Apache Solr 8.3.1 released

2019-12-09 Thread David Smiley
Thanks. I observe we too often write in that way and leave it up to the reader to assume we don’t intentionally add bugs :-) On Mon, Dec 9, 2019 at 5:45 AM Colvin Cowie wrote: > Oh, just looking at the way the announcement reads on > http://lucene.apache.org/solr/news.html : > Solr 8.3.1 Releas

Re: xms/xmx choices

2019-12-09 Thread David Hastings
SOLR Raw SOLR over 1/1000 secs : 208, 0.47% Raw SOLR over 1000/1000 secs : 5261, 11.97% post solr changes: 28369 searches Complete SOLR average : 4.77 / 10th seconds for SOLR Raw SOLR over 1/1000 secs : 94, 0.33% Raw SOLR over 1000/1000 secs : 3583, 12.63% On Fri, Dec 6, 2019 at 9:39 AM

Re: Help for importing large data (approx. 8GB) from old solr version to new solr version

2019-12-25 Thread David Hastings
Exactly. Although I’m a bit curious why your going a .1 version up, I always wait until an x2, so I won’t be upgrading until 9.3 > On Dec 25, 2019, at 9:45 AM, Erick Erickson wrote: > > Should work. At any rate, just try it. Since all you’re doing is copying > data, even if the new installat

Re: does copyFields increase indexe size ?

2019-12-26 Thread David Hastings
The field is stored somewhere > On Dec 26, 2019, at 3:22 PM, Nicolas Paris wrote: > > Hi Eric > > Below a part of the managed-schema. There is 1k section* fields. The > second experience, I removed the copyField, droped the collection and > re-indexed the whole. To mesure the index size, I we

Solr 7.3 cluster issue

2019-12-27 Thread David Barnett
Happy holidays folks, we have a production deployment usage Solr 7.3 in a three node cluster we have a number of collections setup, three shards with a replica factor of 2. The system has been fine, but we experienced issues with disk space one of the nodes. Node 0 starts but does not show any

Re: Solr 7.3 cluster issue

2019-12-28 Thread David Barnett
l collections redundant and then remove replicas from the defunct node > and finally decommission it. > > Jan Høydahl > > > 28. des. 2019 kl. 02:17 skrev David Barnett : > > > > Happy holidays folks, we have a production deployment usage Solr 7.3 in a > > three n

Fwd: Solr spatial search - overlapRatio of polygons

2020-01-08 Thread David Smiley
on. https://lucene.apache.org/solr/guide/8_3/query-re-ranking.html -- Forwarded message - From: Marc Date: Tue, Jan 7, 2020 at 6:14 AM Subject: Solr spatial search - overlapRatio of polygons To: David Smiley Dear Mr Smiley, I have a tricky question concerning the spatial search f

Re: Solr spatial search - overlapRatio of polygons

2020-01-08 Thread David Smiley
an 8, 2020 at 1:16 PM David Smiley wrote: > My response to a direct email (copying here with permission): > > It's possible; you'll certainly have to write some code here to make this > work, including some new Solr plugin; perhaps ValueSourceParser that can > compute a m

Re: SolrCloud upgrade concern

2020-01-16 Thread David Hastings
ha, im on that thread, didnt know they got stored on a site, thats good to know! -i stand by what i said in there. so i have nothing more to add On Thu, Jan 16, 2020 at 3:29 PM Arnold Bronley wrote: > Hi, > > I am trying to upgrade my system from Solr master-slave architecture to > SolrCloud a

Re: Failed to connect to server

2020-01-16 Thread David Hastings
> 'Error: Solr core is loading' do you have any suggesters or anything configured that would get rebuilt? On Thu, Jan 16, 2020 at 3:41 PM rhys J wrote: > On Thu, Jan 16, 2020 at 3:27 PM Edward Ribeiro > wrote: > > > A regular update is a delete followed by an indexing of the document. So >

Re: Dependency log4j-slf4j-impl for solr-core:7.5.0 causing a number of build problems

2020-01-16 Thread David Smiley
Ultimately if you deduce the problem, file a JIRA issue and share it with me; I will look into it. I care about this matter too; I hate having to exclude logging dependencies on the consuming end. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Wed

Re: Failed to connect to server

2020-01-17 Thread David Hastings
something like this in your solr config: autosuggest false text 0.005 DocumentDictionaryFactory title weight true true On Fri, Jan 17, 2020 at 12:02 PM rhys J wrote: > On Thu, Jan 16, 2020 at 3:48 PM David Hastings < > hastings.recurs...@gmail.com> > wrote: > > >

Re: Anyone have experience with Query Auto-Suggestor?

2020-01-20 Thread David Hastings
Ive used this quite a bit, my biggest piece of advice is to choose a field that you know is clean, with well defined terms/words, you dont want an autocomplete that has a massive dictionary, also it will make the start/reload times pretty slow On Mon, Jan 20, 2020 at 11:47 AM Audrey Lorberfeld - a

Re: Re: Anyone have experience with Query Auto-Suggestor?

2020-01-20 Thread David Hastings
Not a bad idea at all, however ive never used an external file before, just a field in the index, so not an area im familiar with On Mon, Jan 20, 2020 at 11:55 AM Audrey Lorberfeld - audrey.lorberf...@ibm.com wrote: > David, > > Thank you, that is useful. So, would you recommend using

Re: SPLITSHARD - data loss of child documents

2020-12-19 Thread David Smiley
https://issues.apache.org/jira/browse/SOLR-11191 and I assigned it to myself just now. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Thu, Dec 17, 2020 at 9:50 AM Mike Drob wrote: > I was under the impression that split shard doesn’t work w

Re: Highlighting large text fields

2021-01-11 Thread David Smiley
enefit from putting offsets into the search index (and re-index) -- storeOffsetsWithPositions. That's an option on the field/fieldType in your schema; it may not be obvious reading the docs. You have to opt-in to that; Solr doesn't normally store any info in the index for highlighting. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley

Re: Highlighting large text fields

2021-01-12 Thread David Smiley
On Tue, Jan 12, 2021 at 9:39 AM Shaun Campbell wrote: > Hi David > > First of all I wanted to say I'm working off your book!! Third edition, > and I think it's a bit out of date now. I was just going to try following > the section on the Postings highlighter, but I

Re: Highlighting large text fields

2021-01-12 Thread David Smiley
On Tue, Jan 12, 2021 at 1:08 PM Shaun Campbell wrote: > Hi David > > Getting closer now. > > First of all, a bit of a mistake on my part. I have two cores set up and I > was changing the solrconfig.xml on the wrong core doh!! That's why > highlighting wasn't bei

Re: Highlighting large text fields

2021-01-12 Thread David Smiley
likely to not highlight as much as you are highlighting now, and highlighting more is your goal right now it appears. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Tue, Jan 12, 2021 at 2:45 PM Shaun Campbell wrote: > That's great Da

Re: Exact matching without using new fields

2021-01-19 Thread David R
We had the same requirement. Just to echo back your requirements, I understand your case to be this. Given these 2 doc titles: doc 1: "information retrieval" doc 2: "Advanced information retrieval with Solr" You want a phrase search for "information retrieval" to find both documents, but an EXA

Re: Exact and non exact highlighting

2021-01-22 Thread David Smiley
uching the Solr schema. If you are up for it, comment on that issue to let the original contributor know you want to help move this forward. Maybe they do too. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Fri, Jan 22, 2021 at 12:46 PM d

Re: Performance issue with Solr 8.6.1 Unified Highlighter does not occur on Solr 6.

2021-01-28 Thread David Smiley
x2=4 tests). ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Wed, Jan 27, 2021 at 2:20 AM Kerwin wrote: > Hi, > > While upgrading to Solr 8 from 6 the Unified highlighter begins to have > performance issues going from approximately 100ms

Re: Performance issue with Solr 8.6.1 Unified Highlighter does not occur on Solr 6.

2021-01-29 Thread David Smiley
r than the original highlighter. Just because hl.requireFieldMatch=false is the default, doesn't mean it's the _right_ choice for everyone's app :-). I tend to think Solr should flip this in 9.0 for both accuracy & performance sake. And unset hl.maxAnalyzedChars -- mostly an obsolet

Re: Incorrect distance returned for indexed polygone shape

2021-01-31 Thread David Smiley
enough for what you want to do. Basically, calculate the geodist but subtract the radius field... maybe something like this (untested!): sort=sub(geodist(),radius) desc. Use LatLonPointSpatialField to store point data if you can (is appropriate), which succeeded RPT for that. ~ David Smiley

Re: SOLR upgrade

2021-02-12 Thread David Hastings
i generally will only upgrade every other release. since i started with 1.4, went to 3->5->7.X, and never EVER a .0 or an even .X release, On Fri, Feb 12, 2021 at 12:01 PM Ishan Chattopadhyaya < ichattopadhy...@gmail.com> wrote: > Just avoid 8.8.0 for the moment, until 8.8.1 is released. 8.7.x s

Re: Atomic Update (nested), Unified Highlighter and Lazy Field Loading => Invalid Index

2021-02-14 Thread David Smiley
ata; maybe that can illustrate the problem? It's not clear if nested schema or nested docs are actually required in your example. If you share the JIRA issue with me, I'll chase this one down. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley

Re: Atomic Update (nested), Unified Highlighter and Lazy Field Loading => Invalid Index

2021-02-17 Thread David Smiley
th it here. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Wed, Feb 17, 2021 at 6:36 AM Nussbaum, Ronen wrote: > Hello David, > > Thank you for your reply. > It was very hard but finally I discovered how to reproduce it. I thought >

Re: Atomic Update (nested), Unified Highlighter and Lazy Field Loading => Invalid Index

2021-02-17 Thread David Smiley
ing a query that only returns the "id" field. No highlighting. ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Wed, Feb 17, 2021 at 10:28 AM David Smiley wrote: > Thanks for more details. I was able to reproduce this locally! I hacke

Re: Atomic Update (nested), Unified Highlighter and Lazy Field Loading => Invalid Index

2021-02-18 Thread David Smiley
ple imagine if you stored the entire input data as JSON in a _json_ field or some-such. Nowadays, I'd set large="true" on such a field, which is a much newer option. I was able to tweak my test to have only alphabetic IDs, and the test still failed. I don't see how the ID'

Re: Congratulations to the new Apache Solr PMC Chair, Jan Høydahl!

2021-02-18 Thread David Smiley
Congratulations Jan! ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Thu, Feb 18, 2021 at 1:56 PM Anshum Gupta wrote: > Hi everyone, > > I’d like to inform everyone that the newly formed Apache Solr PMC nominated > and elected Jan Høy

Re: Atomic Update (nested), Unified Highlighter and Lazy Field Loading => Invalid Index

2021-02-19 Thread David Smiley
nd that ends up being LazyField if you have that feature enabled, or possible wasted space if you don't have that enabled. So I don't think the ability to exclude fields in "fl" would obsolete enableLazyFieldLoading which I think you are implying? ~ David Smiley Apach

Re: [ANNOUNCE] Apache Solr 8.8.1 released

2021-02-27 Thread David Smiley
The corresponding docker image has been released as well: https://hub.docker.com/_/solr (credit to Tobias Kässmann for helping) ~ David Smiley Apache Lucene/Solr Search Developer http://www.linkedin.com/in/davidwsmiley On Tue, Feb 23, 2021 at 10:39 AM Timothy Potter wrote: > The Lucene PMC

collection API timeout

2015-11-04 Thread Julien DAVID - Decalog
Hi all, We have a production environment composed by 6 solrcloud server and 3 zookeeper. We've got around 30 collections, with 6 shards each. We recently moved from 3 solr to 6, splitting the shards (3 to 6). As the last weeks were a low period we didn't noticed any problem. But since monday,

Re: [EXTERNAL] Re: OOM during indexing with 24G heap - Solr 6.5.1

2017-10-16 Thread David M Giannone
Sent via the Samsung Galaxy S® 6, an AT&T 4G LTE smartphone Original message From: Randy Fradin Date: 10/16/17 7:38 PM (GMT-05:00) To: solr-user@lucene.apache.org Subject: [EXTERNAL] Re: OOM during indexing with 24G heap - Solr 6.5.1 Each shard has around 4.2 million docum

Solr Cloud - How to maintain the addresses of the zookeeper servers

2017-01-26 Thread David Michael Gang
add 2 zookeeper nodes, I would have to update many places. This makes it hard to maintain it. How do you manage this? Is there a possibility to get the list of zookeeper services dynamically? Any other idea? I wanted to hear from your expereince how to achieve this task effectively. Thanks, David

Re: Solr Cloud - How to maintain the addresses of the zookeeper servers

2017-01-26 Thread David Michael Gang
Hi Markus and Jan, Thanks for the quick response and good ideas. I will look for the puppet direction. We already use puppet, so this is easy to add Thanks a lot, David On Thu, Jan 26, 2017 at 3:38 PM Markus Jelsma wrote: > Or you can administate the nodes via configuration managem

minimal solrconfig example

2017-03-01 Thread David Michael Gang
i miss is the most minimal file. Where i can i find a minimal solrconfig.xml file with just the required options? Thanks, David

Re: minimal solrconfig example

2017-03-02 Thread David Michael Gang
I use the latest version. Solr 6.4.1 On Thu, Mar 2, 2017 at 9:15 AM Aravind Durvasula wrote: > Hi David, > > What is the solr version you are using? > To get started, it's better to use the config file that comes out of the > box. > > Thanks, > Aravind > > &g

Re: minimal solrconfig example

2017-03-02 Thread David Michael Gang
Thanks Charly. This is what i looked for. On Thu, Mar 2, 2017 at 11:07 AM David Michael Gang wrote: I use the latest version. Solr 6.4.1 On Thu, Mar 2, 2017 at 9:15 AM Aravind Durvasula wrote: Hi David, What is the solr version you are using? To get started, it's better to use the c

messages in gc log not connected to gcs in indexing time

2017-03-02 Thread David Michael Gang
: 0.771 seconds They are not connected to gcs, but to safepoints http://blog.ragozin.info/2012/10/safepoints-in-hotspot-jvm.html What are the reasons in solr regarding these safepoints? Do i have take care about this? If yes what can i do regarding this? Thanks, David

recommended zookeeper version for solr cloud

2017-04-26 Thread David Michael Gang
Hi all, Which version of external zookeper is recommended to use in production environments? 3.4.6 which is the version shipped with solr or 3.4.10 which is the latest stable? Thanks, David

Re: Polygon search returning "InvalidShapeException: incompatible dimension (2)... error.

2014-03-10 Thread Smiley, David W.
You need to either quote your query (after the colon, and another at the very end), or escape any special characters, or use a different query parser like “field”. I prefer to use the field query parser: {!field f=loc}Intersects(POLYGON(... ~ David On 3/6/14, 10:52 AM, "leevduhl&qu

Re: Solr spatial search within the polygon

2014-03-10 Thread Smiley, David W.
& >fq=LOCATION:"IsWithin(POLYGON(( >-4.0594 40.8708, >-4.0621 40.7211 , >-3.8095 40.7127, >-3.8232 40.8687, >-4.0594 40.8708 >))) distErrPct=0" > >***I also tried modifying the order of lat/lon but I am not able to find >out >the solution to make it work. The “x y” order looks good. “IsWithin” should work but if all your indexed data is points then use “Intersects” which is much faster. As a sanity check can you simply do a {!geofilt} query with the “pt” set to madrid and a hundred kilometers or whatever? ~ David

Re: Solr spatial search within the polygon

2014-03-10 Thread Smiley, David W.
On 3/10/14, 12:12 PM, "Smiley, David W." wrote: >> >> >> >>c) I tried no WKT format by adding a comma and using "longitude,latitude" >> >> >> >> 40.442179,-3.69278 >> >> > >That is *wrong*. Remo

Re: Solr spatial search within the polygon

2014-03-10 Thread Smiley, David W.
but I don’t recommend that, as a matter of taste, if >>all >> your data is in latitude & longitude, as opposed to projected data or >>any >> other spatial data. >> >> What do you recommend? >> >> With "all your data is in latitude & longitude", do you refer that every >> doc in the index has only the field LOCATION? If the answer is yes, then >> no, there is more fields in all the documents. I’m only talking about the spatial field. I mean if your *spatial data* is entirely data points where the two dimensions are latitude and longitude on the surface of the earth (or hypothetically some other spherical place). ~ David

Re: Solr spatial search within the polygon

2014-03-10 Thread David Smiley (@MITRE.org)
You're going to have to use the Lucene-spatial module directly then. There's SpatialExample.java to get you started. javinsnc wrote > > David Smiley (@MITRE.org) wrote >> On 3/10/14, 12:56 PM, "javinsnc" < >> javiersangrador@ >> >

Re: Solr spatial search within the polygon

2014-03-10 Thread David Smiley (@MITRE.org)
Lucene has multiple modules, one of which is "spatial". You'll see it in the source tree checkout underneath the lucene directory. Javadocs: http://lucene.apache.org/core/4_7_0/spatial/index.html SpatialExample.java: https://github.com/apache/lucene-solr/blob/trunk/lucene/spatial/src/test/org/apa

Re: Issue with spatial search

2014-03-10 Thread Smiley, David W.
Solr version should simply use 0 as the default; the last time I did benchmarks it was pretty marginal impact of higher distErrPct. It’s a fairly different story if you are indexing non-point shapes. ~ David From: Steven Bower mailto:smb-apa...@alcyon.net>> Reply-To: "solr-user@lucene

Re: Issue with spatial search

2014-03-10 Thread David Smiley (@MITRE.org)
Correct, Steve. Alternatively you can also put this option in your query after the end of the last parenthesis, as in this example from the wiki: fq=geo:"IsWithin(POLYGON((-10 30, -40 40, -10 -20, 40 20, 0 0, -10 30))) distErrPct=0" ~ David Steven Bower wrote > Only points in t

Re: Issue with spatial search

2014-03-11 Thread David Smiley (@MITRE.org)
-off where it’s increasingly painful to get close to distErrPct=0 if the shapes cover a large area. I have near-term plans to address the index-time non-point shape accuracy but at least you don’t have that case from the scenario you gave. ~ David From: "Steven Bower-2 [via Lucene]"

Re: Doing spatial search on multiple location points

2014-03-17 Thread Smiley, David W.
INT(x y, x y, x y, x y), d) distErrPct=0" (whereas ‘d’ is distance in degrees, not km). ~ David On 3/17/14, 9:28 AM, "Varun Gupta" wrote: >Hi, > >I am trying to find out if solr supports doing a spatial search on >multiple >location points. Basically, while query

Re: Doing spatial search on multiple location points

2014-03-18 Thread Smiley, David W.
.) ~ David On 3/18/14, 2:59 AM, "Varun Gupta" wrote: >Hi David, > >Thanks for the quick reply. > >As I haven't migrated to 4.7 (I am still using 4.6), I tested using OR >clause with multiple geofilt query based phrases and it seems to be >working >great. But I

Single multilingual field analyzed based on other field values

2013-10-28 Thread David Anthony Troiano
Hello, First some background... I am indexing a multilingual document set where documents themselves can contain multiple languages. The language(s) within my documents are known ahead of time. I have tried separate fields per language, and due to the poor query performance I'm seeing with that

Re: Index JTS Point in Solr/Lucene index

2013-10-28 Thread David Smiley (@MITRE.org)
Just follow-ing up with this thread after a round of emails between Shahbaz and I… David Smiley wrote > Ooooh, I see your confusion. You looked at code in an > UpdateRequestProcessor and expected it to work on the client in SolrJ. It > won't work for the reason that the code

Re: geo/spatial search performance comparison using different methods

2013-11-06 Thread Smiley, David W.
f you actually do have single valued spatial to sort on, then definitely don't use SOLR-2155 or RPT for that, use LatLonType. It's surely faster but I haven't measured it. The best multi-valued distance sort option for Solr 4 is currently this: https://issues.apache.org/jira/browse

Re: Question regarding Indexsize with Spatial4j Rectangulars

2013-11-06 Thread Smiley, David W.
umber trades off precision for index size. It can make a big difference. ~ David On 11/6/13 8:20 AM, "Dennis Reichelt" wrote: >Hi, > >we are testing Solr and index a huge amount of files. We integrated a >Spatial4j field which is only used to index rectangulars so we removed &

Re: geo/spatial search performance comparison using different methods

2013-11-06 Thread Smiley, David W.
should do). To be clear, I think you should continue to use SOLR-2155/RPT for spatial filtering -- that's what that technology does best. ~ David On 11/6/13 3:22 PM, "T. Kuro Kurosaka" wrote: >Thank you, David. >I believe the field doesn't need to be multivalued. >C

field collapsing performance in sharded environment

2013-11-13 Thread David Anthony Troiano
Hello, I'm hitting a performance issue when using field collapsing in a distributed Solr setup and I'm wondering if others have seen it and if anyone has an idea to work around. it. I'm using field collapsing to deduplicate documents that have the same near duplicate hash value, and deduplicating

Re: Solr spatial search within the polygon

2013-11-18 Thread Smiley, David W.
en "Intersects" is much faster. IsWithin must look in all places outside your shape (in addition to inside) to attempt to ensure it doesn't include external data. ~ David On 11/15/13 1:36 AM, "Dhanesh Radhakrishnan" wrote: >Hi, >I'm experimenting with solr sp

Re: Solr spatial search within the polygon

2013-11-19 Thread Smiley, David W.
On 11/19/13 4:06 AM, "Dhanesh Radhakrishnan" wrote: >Hi David, >Thank you so much for the detailed reply. I've checked each and every lat >lng coordinates and its a purely polygon. >After some time I did one change in the lat lng indexing. >Changed the indexing

Re: Solr spatial search within the polygon

2013-11-20 Thread Smiley, David W.
now prevents you from searching anywhere out of -90 & +90 degrees longitude. At this point I don't think I can be any more clear. Good luck. ~ David On 11/20/13 12:50 AM, "Dhanesh Radhakrishnan" wrote: >Hi David, >Thank you for your reply >This is my current schem

RE: Range queries with Grouping is slow?

2014-01-07 Thread Smiley, David W.
t wow that's a big difference you noted; it wouldn't hurt to double-check with the debugger that the [* TO *] is treated as a numeric range query instead of a generic term range. ~ David From: Kranti Parisa [kranti.par...@gmail.com] Sent: Tuesday, J

Re: Range queries with Grouping is slow?

2014-01-09 Thread Smiley, David W.
d by definition. >> > >> > Am I missing something here? >> > >> > >> > >> > >> > Thanks, >> > Kranti K. Parisa >> > http://www.linkedin.com/in/krantiparisa >> > >> > >> > >> > On Wed

Re: Indexing spatial fields into SolrCloud (HTTP)

2014-01-13 Thread Smiley, David W.
reference the SpatialStrategy, for example. ~ David From: , "Jim (US-KOP)" mailto:jim.be...@hibu.com>> Date: Friday, January 10, 2014 at 12:15 PM To: "solr-user@lucene.apache.org<mailto:solr-user@lucene.apache.org>" mailto:solr-user@lucene.apache.org>> Cc: "

<    6   7   8   9   10   11   12   13   14   15   >