Re: Could not find collection when upgrading to Solr 7

2018-02-25 Thread Shawn Heisey

On 2/24/2018 8:05 AM, Marvin Bredal Lillehaug wrote:

This is for doing local development and running tests.
So starting with embedded zookeeper in SolrCloud mode is intentional. The
idea is to just run the script setting up the folder structure for the
cores and everything is up and running, ready for indexing and querying
locally.
The applications using Solr also use Zookeeper for configuration and
coordination of their own stuff.

There are no persmission issues.

I have put the scripts here https://github.com/computerlove/start-solr

The file config specifies what version to use. When running run-dev-solr.sh
that version is downloaded, solrhome is copied to the right place and the
instance is started.


I grabbed the git repo and proceeded to fiddle with it.

As you said, it works when the solr version that is downloaded is 6.6.2, 
but fails with 7.2.1.  Based on what gets logged and what I know about 
how Solr does initialization, I'm betting the bootstrap feature is 
broken.  I can't tell what's wrong, but I think that's probably where 
the problem is.


But as I already said, I strongly recommend that you don't use that 
feature.  At all.


The cores that are in the git repo are empty -- there's no index.  If 
the source cores are empty when you run the script "for real", then I 
have an alternate idea for you to try:


Instead of copying the cores and bootstrapping a non-cloud install into 
a cloud install:  Remove the bootstrap option. Create the collections 
using "bin/solr create", with -replicationFactor 1 -shards 1, and the -d 
option pointing at the config you want uploaded for the collection.  You 
could even go with a higher shard count if you want.  Increasing 
replicationFactor is probably possible, but as all this would be running 
on a single server, there's no point.


Thanks,
Shawn



Re: Could not find collection when upgrading to Solr 7

2018-02-25 Thread Marvin Bredal Lillehaug
Thank your for your reply.
I will take a look at both the code for bootstrapping, and look at what you
are suggesting to determine what to do.

On Sun, Feb 25, 2018 at 5:17 PM, Shawn Heisey  wrote:

> On 2/24/2018 8:05 AM, Marvin Bredal Lillehaug wrote:
>
>> This is for doing local development and running tests.
>> So starting with embedded zookeeper in SolrCloud mode is intentional. The
>> idea is to just run the script setting up the folder structure for the
>> cores and everything is up and running, ready for indexing and querying
>> locally.
>> The applications using Solr also use Zookeeper for configuration and
>> coordination of their own stuff.
>>
>> There are no persmission issues.
>>
>> I have put the scripts here https://github.com/computerlove/start-solr
>>
>> The file config specifies what version to use. When running
>> run-dev-solr.sh
>> that version is downloaded, solrhome is copied to the right place and the
>> instance is started.
>>
>
> I grabbed the git repo and proceeded to fiddle with it.
>
> As you said, it works when the solr version that is downloaded is 6.6.2,
> but fails with 7.2.1.  Based on what gets logged and what I know about how
> Solr does initialization, I'm betting the bootstrap feature is broken.  I
> can't tell what's wrong, but I think that's probably where the problem is.
>
> But as I already said, I strongly recommend that you don't use that
> feature.  At all.
>
> The cores that are in the git repo are empty -- there's no index.  If the
> source cores are empty when you run the script "for real", then I have an
> alternate idea for you to try:
>
> Instead of copying the cores and bootstrapping a non-cloud install into a
> cloud install:  Remove the bootstrap option. Create the collections using
> "bin/solr create", with -replicationFactor 1 -shards 1, and the -d option
> pointing at the config you want uploaded for the collection.  You could
> even go with a higher shard count if you want.  Increasing
> replicationFactor is probably possible, but as all this would be running on
> a single server, there's no point.
>
> Thanks,
> Shawn
>
>


-- 
med vennlig hilsen,
Marvin B. Lillehaug


security authentication API via solrj?

2018-02-25 Thread Peter Sturge
Hi,
I was wondering if 7.2.1 solrj had native support for the
security/authentication endpoint? I couldn't find anything in the docs
about it, but maybe someone has some experience with it?
Note: This is about adding/deleting users on a solr instance using solrj,
not authenticating (that is well documented).
Thanks,
Peter


Solr 7. Why UpdateRequestProcessorChain does not take any effect?

2018-02-25 Thread FiMko
Hello,

Could you please help, for some reason my custom updateRequestProcessorChain
does not take any effect.
I am using Solr v.7.2.1 with two nodes.
I have the following schema:



In my solrconfig.xml I added:



I enabled TRACE logging for UpdateRequestProcessorChain,
CloneFieldUpdateProcessorFactory and DistributedUpdateProcessorFactory (but
still cannot see this in the logs).

After I add a new document like {"text_en":"test"}, the field
"text_en_exact" is never populated with this value by
CloneFieldUpdateProcessorFactory.

What am I doing wrong?



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Solr 7. Why UpdateRequestProcessorChain does not take any effect?

2018-02-25 Thread FiMko
Also in the logs I can see that my custom "mychain" processor is registered
on both nodes:






--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html


Re: Solr 7. Why UpdateRequestProcessorChain does not take any effect?

2018-02-25 Thread Shawn Heisey

On 2/25/2018 2:10 PM, FiMko wrote:

Could you please help, for some reason my custom updateRequestProcessorChain
does not take any effect.
I am using Solr v.7.2.1 with two nodes.
I have the following schema:



In my solrconfig.xml I added:



I enabled TRACE logging for UpdateRequestProcessorChain,
CloneFieldUpdateProcessorFactory and DistributedUpdateProcessorFactory (but
still cannot see this in the logs).

After I add a new document like {"text_en":"test"}, the field
"text_en_exact" is never populated with this value by
CloneFieldUpdateProcessorFactory.


The things you referenced did not make it to the mailing list.  What I 
have quoted above is exactly what made it to the mailing list.


By going to the Nabble forum (which is a mirror of the list) I was able 
to see the two things you referenced.  Nabble does not always send 
everything that its users include to the list.


The logs you mentioned in your second message aren't there, not even on 
Nabble.


The information you included as your schema is not useful. If we end up 
needing that information, and I'm not sure we will, then we need the 
actual schema contents, not information from the Schema tab in the admin UI.


The information from solrconfig.xml is just an update chain definition.  
Unless you *activate* that update chain, it will not take effect.  You 
can only have one update chain active, so if you're currently using the 
"add-unknown-fields" chain commonly found in examples, that will no 
longer be running once you activate this new one.


Why are you using an update chain to do what the schema can do natively 
with copyField?


Thanks,
Shawn



Re: security authentication API via solrj?

2018-02-25 Thread Shawn Heisey

On 2/25/2018 1:28 PM, Peter Sturge wrote:

I was wondering if 7.2.1 solrj had native support for the
security/authentication endpoint? I couldn't find anything in the docs
about it, but maybe someone has some experience with it?
Note: This is about adding/deleting users on a solr instance using solrj,
not authenticating (that is well documented).


At first I was looking for how to use authentication with SolrJ. I came 
up with this:



Looks like this is not available when using the sugar objects like 
SolrQuery.  To use authentication, it seems you have to create the 
request objects yourself.


https://lucene.apache.org/solr/guide/7_2/basic-authentication-plugin.html#using-basic-auth-with-solrj


Then I noticed you were talking about the actual security endpoint -- 
adding users.


I have been looking over the objects available in SolrJ, and I do not 
see anything useful.  It looks like you might need a new request object 
class, implemented similar to DirectXMLRequest, but using JSON and not 
XML.  It might be possible to make it an implicitly defined class rather 
than creating a whole class file.


A proper sugar class for handling the security endpoint should be 
created.  I would do it, but I'm not sure how.


Thanks,
Shawn



Re: Indexing timeout issues with SolrCloud 7.1

2018-02-25 Thread Erick Erickson
DBQ is something of a heavyweight action. Basically in order to
preserve ordering it has to lock out updates while it executes since
all docs (which may live on all shards) have to be deleted before
subsequent adds of one of the affected docs is processed. In order to
do that, things need to be locked.

Delete-by-id OTOH, can use the normal optimistic locking to insure
proper ordering. So if object_id is your , this may be much
more robust if you delete-by-id

Best,
Erick

On Sat, Feb 24, 2018 at 1:37 AM, Deepak Goel  wrote:
> From the error list, i can see multiple errors:
>
> 1. Failure to recover replica
> 2. Peer sync error
> 3. Failure to download file
>
> On 24 Feb 2018 03:10, "Tom Peters"  wrote:
>
> I included the last 25 lines from the logs from each of the five nodes
> during that time period.
>
> I _think_ I'm running into issues with bulking up deleteByQuery. Quick
> background: we have objects in our system that may have multiple
> availability windows. So when we index an object, will store it as separate
> documents each with their own begins and expires date. At index time we
> don't know if the all of the windows are still valid or not, so we remove
> all of them with a deleteByQuery (e.g. deleteByQuery=object_id:12345) and
> then index one or more documents.
>
> I ran an isolated test a number of times where I indexed 1500 documents in
> this manner (deletes then index). In Solr 3.4, it takes about 15s to
> complete. In Solr 7.1, it's taking about 5m. If I remove the deleteByQuery,
> the indexing times are nearly identical.
>
> When run in normal production mode where we have lots of processes indexing
> at once (~20 or so), it starts to cause lots of issues (which you see
> below).
>
>
> Please let me know if anything I mentioned is unclear. Thanks!
>
>
>
>
> solr2-a:
> 2018-02-23 04:09:36.551 ERROR (updateExecutor-2-thread-2672-
> processing-http:solr2-b:8080//solr//mycollection_shard1_replica_n1
> x:mycollection_shard1_replica_n6 r:core_node9 n:solr2-a.vam.be.cmh.
> mycollection.com:8080_solr s:shard1 c:mycollection) [c:mycollection
> s:shard1 r:core_node9 x:mycollection_shard1_replica_n6] o.a.s.u.
> ErrorReportingConcurrentUpdateSolrClient error
> 2018-02-23 04:09:36.551 ERROR (updateExecutor-2-thread-2692-
> processing-http:solr2-d:8080//solr//mycollection_shard1_replica_n11
> x:mycollection_shard1_replica_n6 r:core_node9 n:solr2-a.vam.be.cmh.
> mycollection.com:8080_solr s:shard1 c:mycollection) [c:mycollection
> s:shard1 r:core_node9 x:mycollection_shard1_replica_n6] o.a.s.u.
> ErrorReportingConcurrentUpdateSolrClient error
> 2018-02-23 04:09:36.551 ERROR (updateExecutor-2-thread-2711-
> processing-http:solr2-e:8080//solr//mycollection_shard1_replica_n4
> x:mycollection_shard1_replica_n6 r:core_node9 n:solr2-a.vam.be.cmh.
> mycollection.com:8080_solr s:shard1 c:mycollection) [c:mycollection
> s:shard1 r:core_node9 x:mycollection_shard1_replica_n6] o.a.s.u.
> ErrorReportingConcurrentUpdateSolrClient error
> 2018-02-23 04:09:36.552 ERROR (qtp1595212853-32739) [c:mycollection
> s:shard1 r:core_node9 x:mycollection_shard1_replica_n6]
> o.a.s.u.p.DistributedUpdateProcessor
> Setting up to try to start recovery on replica http://solr2-b:8080/solr/
> mycollection_shard1_replica_n1/
> 2018-02-23 04:09:36.552 ERROR (qtp1595212853-32739) [c:mycollection
> s:shard1 r:core_node9 x:mycollection_shard1_replica_n6]
> o.a.s.u.p.DistributedUpdateProcessor
> Setting up to try to start recovery on replica http://solr2-d:8080/solr/
> mycollection_shard1_replica_n11/
> 2018-02-23 04:09:36.552 ERROR (qtp1595212853-32739) [c:mycollection
> s:shard1 r:core_node9 x:mycollection_shard1_replica_n6]
> o.a.s.u.p.DistributedUpdateProcessor
> Setting up to try to start recovery on replica http://solr2-e:8080/solr/
> mycollection_shard1_replica_n4/
> 2018-02-23 04:09:38.217 ERROR (updateExecutor-2-thread-2712-
> processing-http:solr2-e:8080//solr//mycollection_shard1_replica_n4
> x:mycollection_shard1_replica_n6 r:core_node9 n:solr2-a.vam.be.cmh.
> mycollection.com:8080_solr s:shard1 c:mycollection) [c:mycollection
> s:shard1 r:core_node9 x:mycollection_shard1_replica_n6] o.a.s.u.
> ErrorReportingConcurrentUpdateSolrClient error
> 2018-02-23 04:09:38.217 ERROR (updateExecutor-2-thread-2726-
> processing-http:solr2-d:8080//solr//mycollection_shard1_replica_n11
> x:mycollection_shard1_replica_n6 r:core_node9 n:solr2-a.vam.be.cmh.
> mycollection.com:8080_solr s:shard1 c:mycollection) [c:mycollection
> s:shard1 r:core_node9 x:mycollection_shard1_replica_n6] o.a.s.u.
> ErrorReportingConcurrentUpdateSolrClient error
> 2018-02-23 04:09:38.217 ERROR (updateExecutor-2-thread-2727-
> processing-http:solr2-b:8080//solr//mycollection_shard1_replica_n1
> x:mycollection_shard1_replica_n6 r:core_node9 n:solr2-a.vam.be.cmh.
> mycollection.com:8080_solr s:shard1 c:mycollection) [c:mycollection
> s:shard1 r:core_node9 x:mycollection_shard1_replica_n6] o.a.s.u.
> ErrorReportingConcurrentUpdateSolrClient erro

Re: Could not find collection when upgrading to Solr 7

2018-02-25 Thread Shalin Shekhar Mangar
Bootstrapping is not broken but the default in Solr 7 is to set the cluster
property legacyCloud=false which means that bootstrapped cores cannot
register new collections automatically anymore. If you want to rely on the
old behavior then you need to set the cluster property legacyCloud=true.
This was also mentioned in the upgrade section for Solr 7 in CHANGES.txt

On Mon, Feb 26, 2018 at 1:28 AM, Marvin Bredal Lillehaug <
marvin.lilleh...@gmail.com> wrote:

> Thank your for your reply.
> I will take a look at both the code for bootstrapping, and look at what you
> are suggesting to determine what to do.
>
> On Sun, Feb 25, 2018 at 5:17 PM, Shawn Heisey 
> wrote:
>
> > On 2/24/2018 8:05 AM, Marvin Bredal Lillehaug wrote:
> >
> >> This is for doing local development and running tests.
> >> So starting with embedded zookeeper in SolrCloud mode is intentional.
> The
> >> idea is to just run the script setting up the folder structure for the
> >> cores and everything is up and running, ready for indexing and querying
> >> locally.
> >> The applications using Solr also use Zookeeper for configuration and
> >> coordination of their own stuff.
> >>
> >> There are no persmission issues.
> >>
> >> I have put the scripts here https://github.com/computerlove/start-solr
> >>
> >> The file config specifies what version to use. When running
> >> run-dev-solr.sh
> >> that version is downloaded, solrhome is copied to the right place and
> the
> >> instance is started.
> >>
> >
> > I grabbed the git repo and proceeded to fiddle with it.
> >
> > As you said, it works when the solr version that is downloaded is 6.6.2,
> > but fails with 7.2.1.  Based on what gets logged and what I know about
> how
> > Solr does initialization, I'm betting the bootstrap feature is broken.  I
> > can't tell what's wrong, but I think that's probably where the problem
> is.
> >
> > But as I already said, I strongly recommend that you don't use that
> > feature.  At all.
> >
> > The cores that are in the git repo are empty -- there's no index.  If the
> > source cores are empty when you run the script "for real", then I have an
> > alternate idea for you to try:
> >
> > Instead of copying the cores and bootstrapping a non-cloud install into a
> > cloud install:  Remove the bootstrap option. Create the collections using
> > "bin/solr create", with -replicationFactor 1 -shards 1, and the -d option
> > pointing at the config you want uploaded for the collection.  You could
> > even go with a higher shard count if you want.  Increasing
> > replicationFactor is probably possible, but as all this would be running
> on
> > a single server, there's no point.
> >
> > Thanks,
> > Shawn
> >
> >
>
>
> --
> med vennlig hilsen,
> Marvin B. Lillehaug
>



-- 
Regards,
Shalin Shekhar Mangar.


Re: Could not find collection when upgrading to Solr 7

2018-02-25 Thread Marvin Bredal Lillehaug
Ah, didn't get that legacyCloud was related to bootstrap
Have a new script working without bootstrapping.
Thank you for your input!

On Mon, Feb 26, 2018 at 7:36 AM, Shalin Shekhar Mangar <
shalinman...@gmail.com> wrote:

> Bootstrapping is not broken but the default in Solr 7 is to set the cluster
> property legacyCloud=false which means that bootstrapped cores cannot
> register new collections automatically anymore. If you want to rely on the
> old behavior then you need to set the cluster property legacyCloud=true.
> This was also mentioned in the upgrade section for Solr 7 in CHANGES.txt
>
> On Mon, Feb 26, 2018 at 1:28 AM, Marvin Bredal Lillehaug <
> marvin.lilleh...@gmail.com> wrote:
>
> > Thank your for your reply.
> > I will take a look at both the code for bootstrapping, and look at what
> you
> > are suggesting to determine what to do.
> >
> > On Sun, Feb 25, 2018 at 5:17 PM, Shawn Heisey 
> > wrote:
> >
> > > On 2/24/2018 8:05 AM, Marvin Bredal Lillehaug wrote:
> > >
> > >> This is for doing local development and running tests.
> > >> So starting with embedded zookeeper in SolrCloud mode is intentional.
> > The
> > >> idea is to just run the script setting up the folder structure for the
> > >> cores and everything is up and running, ready for indexing and
> querying
> > >> locally.
> > >> The applications using Solr also use Zookeeper for configuration and
> > >> coordination of their own stuff.
> > >>
> > >> There are no persmission issues.
> > >>
> > >> I have put the scripts here https://github.com/
> computerlove/start-solr
> > >>
> > >> The file config specifies what version to use. When running
> > >> run-dev-solr.sh
> > >> that version is downloaded, solrhome is copied to the right place and
> > the
> > >> instance is started.
> > >>
> > >
> > > I grabbed the git repo and proceeded to fiddle with it.
> > >
> > > As you said, it works when the solr version that is downloaded is
> 6.6.2,
> > > but fails with 7.2.1.  Based on what gets logged and what I know about
> > how
> > > Solr does initialization, I'm betting the bootstrap feature is
> broken.  I
> > > can't tell what's wrong, but I think that's probably where the problem
> > is.
> > >
> > > But as I already said, I strongly recommend that you don't use that
> > > feature.  At all.
> > >
> > > The cores that are in the git repo are empty -- there's no index.  If
> the
> > > source cores are empty when you run the script "for real", then I have
> an
> > > alternate idea for you to try:
> > >
> > > Instead of copying the cores and bootstrapping a non-cloud install
> into a
> > > cloud install:  Remove the bootstrap option. Create the collections
> using
> > > "bin/solr create", with -replicationFactor 1 -shards 1, and the -d
> option
> > > pointing at the config you want uploaded for the collection.  You could
> > > even go with a higher shard count if you want.  Increasing
> > > replicationFactor is probably possible, but as all this would be
> running
> > on
> > > a single server, there's no point.
> > >
> > > Thanks,
> > > Shawn
> > >
> > >
> >
> >
> > --
> > med vennlig hilsen,
> > Marvin B. Lillehaug
> >
>
>
>
> --
> Regards,
> Shalin Shekhar Mangar.
>



-- 
med vennlig hilsen,
Marvin B. Lillehaug