Re: Investigating Seeming Deadlock

2021-03-05 Thread Mike Drob
Were you having any OOM errors beforehand? If so, that could have caused some GC of objects that other threads still expect to be reachable, leading to these null monitors. On Fri, Mar 5, 2021 at 12:55 PM Stephen Lewis Bianamara < stephen.bianam...@gmail.com> wrote: > Hi SOLR Community, > > I'm i

Re: Partial update bug on solr 8.8.0

2021-03-02 Thread Mike Drob
This looks like a bug that is already fixed but not yet released in 8.9 https://issues.apache.org/jira/plugins/servlet/mobile#issue/SOLR-13034 On Tue, Mar 2, 2021 at 6:27 AM Mohsen Saboorian wrote: > Any idea about this post? > https://stackoverflow.com/q/66335803/141438 > > Regards. >

Re: Asymmetric Key Size not sufficient

2021-02-14 Thread Mike Drob
Future vulnerability reports should be sent to secur...@apache.org so that they can be resolved privately. Thank you On Fri, Feb 12, 2021 at 10:17 AM Ishan Chattopadhyaya < ichattopadhy...@gmail.com> wrote: > Recent versions of Solr use 2048. > > https://github.com/apache/lucene-solr/blob/branch

Re: Ghost Documents or Shards out of Sync

2021-02-01 Thread Mike Drob
To expand on what Jason suggested, if the issue is the non-deterministic ordering due to staggered commits per replica, you may have more consistency with TLOG replicas rather than the NRT replicas. In this case, the underlying segment files should be identical and lead to more predictable results.

Re: Apache Solr Reference Guide isn't accessible

2021-02-01 Thread Mike Drob
Hi Dorion, We are currently working with our infra team to get these restored. In the meantime, the 8.4 guide is still available at https://lucene.apache.org/solr/guide/8_4/ and are hopeful that the 8.8 guide will be back up soon. Thank you for your patience. Mike On Mon, Feb 1, 2021 at 1:58 PM

Re: Solr 8.7.0 memory leak?

2021-01-27 Thread Mike Drob
Are you running these in docker containers? Also, I’m assuming this is a typo but just in case the setting is Xmx :) Can you share the OOM stack trace? It’s not always running out of memory, sometimes Java throws OOM for file handles or threads. Mike On Wed, Jan 27, 2021 at 10:00 PM Luke wrote

Re: NullPointerException in Graph Traversal nodes streaming expression

2021-01-21 Thread Mike Drob
Can you provide a sample expression that would be able to reproduce this? Are you able to try a newer version by chance - I know we've fixed a few NPEs recently, maybe https://issues.apache.org/jira/browse/SOLR-14700 On Thu, Jan 21, 2021 at 4:13 PM ufuk yılmaz wrote: > Solr version 8.4. I’m gett

Re: Cursor Performance Issue

2021-01-13 Thread Mike Drob
You should be using docvalues on your id, but note that switching this would require a reindex. On Wed, Jan 13, 2021 at 6:04 AM Ajay Sharma wrote: > Hi All, > > I have used cursors to search and export documents in solr according to > > https://lucene.apache.org/solr/guide/6_6/pagination-of-resu

Re: Converting a collection name to an alias

2021-01-07 Thread Mike Drob
I believe you may be able to use that command (or some combination of create alias commands) to create an alias from A to A, and then in the future when you want to change it you can have Alias A to collection B (assuming this is the point of the alias in the first place). On Thu, Jan 7, 2021 at 1

Re: SPLITSHARD - data loss of child documents

2020-12-17 Thread Mike Drob
I was under the impression that split shard doesn’t work with child documents, if that is missing from the ref guide we should update it On Thu, Dec 17, 2020 at 4:30 AM Nussbaum, Ronen wrote: > Hi Everyone, > > We're using version 8.6.1 with nested documents. > I used the SPLITSHARD API and afte

Re: solr 8.6.3 and noggit

2020-11-20 Thread Mike Drob
Noggit code was forked into Solr, see SOLR-13427 https://github.com/apache/lucene-solr/blob/releases/lucene-solr/8.6.3/solr/solrj/src/java/org/noggit/ObjectBuilder.java It looks like that particular method was added in 8.4 via https://issues.apache.org/jira/browse/SOLR-13824 Is it possible you're

Re: download binary files will not uncompress

2020-11-03 Thread Mike Drob
> jar@jarfx ~/download $ tar xvf solr-8.6.3.tgz > > gzip: stdin: not in gzip format > tar: Child returned status 1 > tar: Error is not recoverable: exiting now > > > James A. Rome > 116 Claymore Lane > Oak Ridge, TN 37830-7674 > 865 482-5643; Cell: 865 566-7991 >

Re: download binary files will not uncompress

2020-11-03 Thread Mike Drob
Can you check the signatures to make sure your downloads were not corrupted? I just checked and was able to download and uncompress both of them. Also, depending on your version of tar, you don't want the - for your flags... tar xf solr-8.6.3.tgz Mike On Tue, Nov 3, 2020 at 4:15 PM James Rome w

Re: Solr dependency update at Apache Beam - which versions should be supported

2020-10-27 Thread Mike Drob
Piotr, Based on the questions that we've seen over the past month on this list, there are still users with Solr on 6, 7, and 8. I suspect there are still Solr 5 users out there too, although they don't appear to be asking for help - likely they are in set it and forget it mode. Solr 7 may not be

Re: Folding Repeated Letters

2020-10-08 Thread Mike Drob
t;, > "pattern": "(.)\\1+", > "replacement": "$1" > } > ] > } > } > ] > } > > This finds a match... > > http:/

Folding Repeated Letters

2020-10-08 Thread Mike Drob
I'm looking for a way to transform words with repeated letters into the same token - does something like this exist out of the box? Do our stemmers support it? For example, say I would want all of these terms to return the same search results: YES YESSS YYYEEESSS YYEE[...]S I don't know how

Re: Term too complex for spellcheck.q param

2020-10-07 Thread Mike Drob
Right now the only solution is to use a shorter term. In a fuzzy query you could also try using a lower edit distance e.g. term~1 (default is 2), but I’m not sure what the syntax for a spellcheck would be. Mike On Wed, Oct 7, 2020 at 2:59 PM gnandre wrote: > Hi, > > I am getting following erro

Re: Adding solr-core via maven fails

2020-07-02 Thread Mike Drob
Does it fail similarly on 8.5.0 and .1? On Thu, Jul 2, 2020 at 6:38 AM Erick Erickson wrote: > There have been some issues with Maven, see: > https://issues.apache.org/jira/browse/LUCENE-9170 > > However, we do not officially support Maven builds, they’re there as a > convenience, so there may s

Re: [EXTERNAL] Getting rid of Master/Slave nomenclature in Solr

2020-06-24 Thread Mike Drob
Brend, I appreciate that you are trying to examine this issue from multiple sides and consider future implications, but I don’t think that is a stirring argument. By analogy, if we are out of eggs and my wife asks me to go to the store to get some, refusing to do so on the basis that she might cal

Re: [EXTERNAL] Re: Getting rid of Master/Slave nomenclature in Solr

2020-06-18 Thread Mike Drob
I personally think that using Solr cloud terminology for this would be fine with leader/follower. The leader is the one that accepts updates, followers cascade the updates somehow. The presence of ZK or election doesn’t really change this detail. However, if folks feel that it’s confusing, then I

Re: Master Slave Terminology

2020-06-17 Thread Mike Drob
Hi Jan, Can you link to the discussion? I searched the dev list and didn’t see anything, is it on slack or a jira or somewhere else? Mike On Wed, Jun 17, 2020 at 1:51 AM Jan Høydahl wrote: > Hi Kaya, > > Thanks for bringing it up. The topic is already being discussed by > developers, so expect

[ANNOUNCE] Apache Solr 8.5.2 released

2020-05-26 Thread Mike Drob
26 May 2020, Apache Solr™ 8.5.2 available The Lucene PMC is pleased to announce the release of Apache Solr 8.5.2 Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted sea

Re: Solr 8.5.1 startup error - lengthTag=109, too big.

2020-05-26 Thread Mike Drob
Did you have SSL enabled with 8.2.1? The error looks common to certificate handling and not specific to Solr. I would verify that you have no extra characters in your certificate file (including line endings) and that the keystore type that you specified matches the file you are presenting (JKS o

Re: Download a pre-release version? 8.6

2020-05-15 Thread Mike Drob
We could theoretically include this in a 8.5.2 version which should be released soon. The change looks minimally risky to backport? On Fri, May 15, 2020 at 3:43 PM Jan Høydahl wrote: > Check Jenkins: > https://builds.apache.org/view/L/view/Lucene/job/Solr-Artifacts-8.x/lastSuccessfulBuild/artifa

Re: Possible issue with Stemming and nouns ended with suffix 'ion'

2020-05-01 Thread Mike Drob
mmed... > > Best, > Audrey > > On 4/30/20, 6:37 PM, "Jhonny Lopez" > wrote: > > Yes, sounds like worth it. > > Thanks guys! > > -Original Message- > From: Mike Drob > Sent: jueves, 30 de abril de 2020 5:30 p. m. >

Re: Possible issue with Stemming and nouns ended with suffix 'ion'

2020-05-01 Thread Mike Drob
5:37 PM Jhonny Lopez wrote: > Yes, sounds like worth it. > > Thanks guys! > > -Original Message----- > From: Mike Drob > Sent: jueves, 30 de abril de 2020 5:30 p. m. > To: solr-user@lucene.apache.org > Subject: Re: Possible issue with Stemming and nouns ended with

Re: Possible issue with Stemming and nouns ended with suffix 'ion'

2020-04-30 Thread Mike Drob
Is this worth filing a bug/suggestion to the folks over at snowballstem.org? On Thu, Apr 30, 2020 at 4:08 PM Audrey Lorberfeld - audrey.lorberf...@ibm.com wrote: > I agree with Erick. I think that's just how the cookie crumbles when > stemming. If you have some time on your hands, you can integr

Re: Fuzzy search not working

2020-04-14 Thread Mike Drob
Pradeep, First, some background on fuzzy term expansions: 1) A query for foobar~2 is really a query for (foobar OR foobar~1 OR foobar~2) 2) Fuzzy term expansion will only take the first 50 terms found in the index and drop the rest. For implementation notes, see this comment - https://github.com

SolrCloud location for solr.xml

2020-02-28 Thread Mike Drob
Hi Searchers! I was recently looking at some of the start-up logic for Solr and was interested in cleaning it up a little bit. However, I'm not sure how common certain deployment scenarios are. Specifically is anybody doing the following combination: * Using SolrCloud (i.e. state stored in zookee

Re: Outdated information on JVM heap sizes in Solr 8.3 documentation?

2020-02-15 Thread Mike Drob
Erick, Can you drop a link to that Jira here after you create it? Many thanks, Mike On Fri, Feb 14, 2020 at 6:05 PM Erick Erickson wrote: > I just read that page over and it looks way out of date. I’ll raise > a JIRA. > > > On Feb 14, 2020, at 2:55 PM, Walter Underwood > wrote: > > > > Yeah,

Re: Modify partial configsets using API

2019-05-08 Thread Mike Drob
On 2019/05/08 16:52:52, Shawn Heisey wrote: > On 5/8/2019 10:50 AM, Mike Drob wrote: > > Solr Experts, > > > > Is there an existing API to modify just part of my configset, for example > > synonyms or stopwords? I see that there is the schema API, but that is &

Modify partial configsets using API

2019-05-08 Thread Mike Drob
Solr Experts, Is there an existing API to modify just part of my configset, for example synonyms or stopwords? I see that there is the schema API, but that is pretty specific in scope. Not sure if I should be looking at configset API to upload a zip with a single file, or if there are more granul

Re: zero-day exploit security issue

2017-10-16 Thread Mike Drob
Given that the already public nature of the disclosure, does it make sense to make the work being done public prior to release as well? Normally security fixes are kept private while the vulnerabilities are private, but that's not the case here... On Mon, Oct 16, 2017 at 1:20 AM, Shalin Shekhar M

Re: Two separate instances sharing the same zookeeper cluster

2017-09-14 Thread Mike Drob
When you specify the zk string for a solr instance, you typically include a chroot in it. I think the default is /solr, but it doesn't have to be, so you should be able to run with -z zk1:2181/sorl-dev and /solr-prod https://lucene.apache.org/solr/guide/6_6/setting-up-an-external-zookeeper-ensembl

Re: IndexReaders cannot exceed 2 Billion

2017-08-08 Thread Mike Drob
> I have no idea whether you can successfully recover anything from that > index now that it has broken the hard limit. Theoretically, I think it's possible with some very surgical edits. However, I've tried to do this in the past and abandoned it. The code to split the index needs to be able to o

Re: Solr Cloud 6.x - rollback best practice

2017-07-12 Thread Mike Drob
The two collection approach with aliasing is a good approach. You can also use the backup and restore APIs - https://lucene.apache.org/solr/guide/6_6/making-and-restoring-backups.html Mike On Wed, Jul 12, 2017 at 10:57 AM, Vincenzo D'Amore wrote: > Hi, > > I'm moving to Solr Cloud 6.x and I se

Re: (how) do folks use the Cloud Graph (Radial) in the Solr Admin UI?

2017-06-16 Thread Mike Drob
+solr-user Might get a different audience on this list. -- Forwarded message -- From: Christine Poerschke (BLOOMBERG/ LONDON) Date: Fri, Jun 16, 2017 at 11:43 AM Subject: (how) do folks use the Cloud Graph (Radial) in the Solr Admin UI? To: d...@lucene.apache.org Any thoughts o

Re: Can solrcloud be running on a read-only filesystem?

2017-06-02 Thread Mike Drob
To throw out one possibility, a read only file systems has no (low?) possibility of corruption. If you have a static index then you shouldn't need to be doing any recovery. Would still need to run ZK with RW filesystem, but mybe Solr could work? On Fri, Jun 2, 2017 at 10:15 AM, Erick Erickson

Re: Solr Web Crawler - Robots.txt

2017-06-01 Thread Mike Drob
Isn't this exactly what Apache Nutch was built for? On Thu, Jun 1, 2017 at 6:56 PM, David Choi wrote: > In any case after digging further I have found where it checks for > robots.txt. Thanks! > > On Thu, Jun 1, 2017 at 5:34 PM Walter Underwood > wrote: > > > Which was exactly what I suggested.

Re: Performance warning: Overlapping onDeskSearchers=2 solr

2017-05-17 Thread Mike Drob
You're committing too frequently, so you have new searchers getting queued up before the previous ones have been processed. You have several options on how to deal with this. Can increase commit interval, add hardware, or reduce query warming. I don't know if uncommenting that section will help b

Re: SOLR as nosql database store

2017-05-10 Thread Mike Drob
> The searching install will be able to rebuild itself from the data storage install when that is required. Is this a use case for CDCR? Mike On Tue, May 9, 2017 at 6:39 AM, Shawn Heisey wrote: > On 5/9/2017 12:58 AM, Bharath Kumar wrote: > > Thanks Hrishikesh and Dave. We use SOLR cloud with

Re: Both main and replica are trying to access solr_gc.log.0.current file

2017-04-29 Thread Mike Drob
o we point them to different log directories/config? > > Regards, > Edwin > > > On 30 April 2017 at 00:36, Mike Drob wrote: > > > Are you running both Solr instances in the same hardware and pointing > them > > at the same log directories/config? > > > &g

Re: Both main and replica are trying to access solr_gc.log.0.current file

2017-04-29 Thread Mike Drob
Are you running both Solr instances in the same hardware and pointing them at the same log directories/config? On Sat, Apr 29, 2017, 2:56 AM Zheng Lin Edwin Yeo wrote: > Hi, > > I'm using Solr 6.4.2 on SolrCloud, and I'm running 2 replica of Solr. > > When I start the replica, I will encounter t

Re: Too many Soft commits and opening searchers realtime

2015-07-07 Thread Mike Drob
Are the clients that are posting updates requesting commits? On Tue, Jul 7, 2015 at 4:29 PM, Summer Shire wrote: > HI All, > > Can someone help me understand the following behavior. > I have the following maxTimes on hard and soft commits > > yet I see a lot of Opening Searchers in the log > org

Pretty Print segments_N

2015-06-12 Thread Mike Drob
I'm doing some debugging work on a solr core, and would find it useful to be able to pretty print the contents of the segments_N file in the index. Is there already good functionality for this, or will I need to write up my own utility using SegmentInfos? Thanks, Mike

Re: indexing java byte code in classes / jars

2015-05-08 Thread Mike Drob
What do the various Java IDEs use for indexing classes for field/type/variable/method usage search? I imagine it's got to be bytecode. On Fri, May 8, 2015 at 2:40 PM, Tomasz Borek wrote: > Out of curiosity: why bytecode? > > pozdrawiam, > LAFK > > 2015-05-08 21:31 GMT+02:00 Mark : > > > I lookin

Re: ApacheCon 2015 at Austin, TX

2015-04-12 Thread Mike Drob
ApacheCon is starting tomorrow, so seeing if I pulling up this thread yields any new replies this time. I'm hanging out in Austin, looking forward to some good conversations and sessions! On Wed, Feb 18, 2015 at 9:14 PM, CP Mishra wrote: > Dmitry, that would be great. > > CP > > On Thu, Feb 12,

Re: Which one is it "cs" or "cz" for Czech language?

2015-03-17 Thread Mike Drob
Probably a historical artifact. cz is the country code for the Czech Republic, cs is the language code for Czech. Once, cs was also the country code for Czechosolvakia, leading some folks to accidentally conflate the two. On Tue, Mar 17, 2015 at 12:35 PM, Eduard Moraru wrote: > Hi, > > First of

Re: Checkout the source Code to the Release Version of Solr?

2015-02-17 Thread Mike Drob
The SVN source is under tags, not branches. http://svn.apache.org/repos/asf/lucene/dev/tags/lucene_solr_4_10_3/ On Tue, Feb 17, 2015 at 4:39 PM, O. Olson wrote: > Thank you Hrishikesh. Funny how GitHub is not mentioned on > http://lucene.apache.org/solr/resources.html > > I think common-build.

Re: How to exclude selected filter (facet) from search result?

2015-02-02 Thread Mike Drob
Umang, I believe this mailing list strips images. You might have better luck uploading your image to a 3rd party hosting site and providing a link. Thanks, Mike On Mon, Feb 2, 2015 at 12:35 PM, Umang Agrawal wrote: > Hi > > Could you please suggest how to exclude selected filter from solr sear

Re: Connection Reset Errors with Solr 4.4

2015-01-23 Thread Mike Drob
gt; Thanks, > Nishanth > > On Tue, Jan 20, 2015 at 10:46 AM, Mike Drob wrote: > > > Are we sure this isn't SOLR-6931? > > > > On Tue, Jan 20, 2015 at 11:39 AM, Nishanth S > > wrote: > > > > > Hello All, > > > > > > We are

Re: Connection Reset Errors with Solr 4.4

2015-01-20 Thread Mike Drob
Are we sure this isn't SOLR-6931? On Tue, Jan 20, 2015 at 11:39 AM, Nishanth S wrote: > Hello All, > > We are running solr cloud 4.4 with 30 shards and 3 replicas with real time > indexing on rhel 6.5.The indexing rate is 3K Tps now.We are running into an > issue with replicas going into recover