Re: solr 5 leaving tomcat, will I be the only one fearing about this?

John Bickerstaff Mon, 10 Oct 2016 07:47:58 -0700

For what it's worth / in case it's helpful...

I haven't dealt with JDBC in this context so I can't offer anything useful
there...


You can reference the data in Zookeeper when creating a new collection - so
you don't need to put the configs anywhere on the Solr boxes themselves.

It's not automatic (as far as I know) but the following command line or
HTTP line will do it... (In my case "expath" was the name of the collection
AND the name of the collection's configuration in Zookeeper.)

Note: in my case the solr user owned the files and so I had to use the
following syntax to "run as" solr user...  The command inside the quotes is
the important part.  The -n is what allows you to reference an
already-existing configuration setting "node" in Zookeeper.

sudo su - solr -c "/opt/solr/bin/solr create -c expath -n expath -shards 1
-replicationFactor 2"

For the HTTP line, replace localhost with the ip or hostname of your
running Solr instance

http://localhost:8983/solr/admin/collections?action=CREATE&name=expath&numShards=1&replicationFactor=3&maxShardsPerNode=1&collection.configName=expath


On Mon, Oct 10, 2016 at 5:59 AM, Aristedes Maniatis <a...@maniatis.org>
wrote:

> Hi Shalin
>
> Thanks for your reply.
>
> On 10/10/16 3:28pm, Shalin Shekhar Mangar wrote:
> > As far as configuration is concerned -- everything Solr specific goes to
> > zookeeper. The solr.xml can also be put into zookeeper. JNDI is not
> > supported -- can you explain why you need it?
>
> Where do I put the JDBC connection data for fetching my source documents?
> Do I have to write my passwords to the zookeeper cluster also?
>
>
> We also use com.vividsolutions.jts.geom so we need to add this jar
> somewhere. The only places that seem to work are inside the Jetty
> installation because that's where the classloader will find it.
> ${solr.installation.dir}/server/lib or into ${solr.installation.dir}/
> server/solr-webapp/webapp/WEB-INF/lib/
>
> In the longer term I'll get rid of that jar since equivalent functionality
> is now available in Solr. But I'm not there yet.
>
>
> > Can cluster properties solve
> > the problem? or replica properties? Both of those can go into zookeeper.
>
> Can I create a collection by just adding configuration to zookeeper?
>
>
> > Integration testing is possible using MiniSolrCloudCluster which is built
> > to be used programatically. It starts a real zookeeper instance along
> with
> > real solr instances on jetties.
>
> MiniSolrCloudCluster might just be the solution I need for this thing.
> Thanks.
>
>
> > Most other things are by design e.g. whole cluster changes on
> configuration
> > change otherwise sneaky bugs where one node misses a config change are
> > easily possible and very very difficult to debug. But you can e.g.
> create a
> > new collection with just one node, stage your changes, run health checks
> > and update the configuration of the main collection.
> >
> > The old master/slave setup was very simple. The cloud is a whole new ball
> > game. Having control of jetty has given Solr a lot of flexibility. At
> this
> > time I discourage anyone from changing anything inside Jetty's config. If
> > there are certain things that are not possible without then please let us
> > know so figure out how to build such things in Solr itself. We want jetty
> > to be a implementation detail and no more.
> >
> > If you have suggestions on how we can fix some of these problems, please
> > speak up.
>
> Mostly it would be nice to have more hooks in the startup scripts. So I'd
> like to pass additional folders to add to the classpath. Or JNDI settings.
>
> Perhaps my problems are related to the fact that I expect to store my
> configuration in version control. Zookeeper then has to be driven by some
> release process (in my case Jenkins with gradle scripts to push the data
> into production). But that means I'm now storing my database passwords in
> version control alongside configuration. Mixing up passwords and
> configuration is bad, which is why JNDI was invented and is useful. I'm not
> sure how Solr/Zookeeper intends us to solve this.
>
> Next, let's say our data centre burns to the ground and I need to
> provision everything from scratch.
>
> 1. I want to be able to install Solr/Zookeeper. I've scripted this in
> saltstack so we are all good.
> 2. Then I press "release" in Jenkins. This pushes my configuration into
> Zookeeper.
> 3. But Solr now has no collections, even though they are defined in
> zookeeper, so now I need to run another script to create them and start the
> population of the data.
> 4. Do I now need some other command or process to push the JDBC passwords
> into Zookeeper from my secure store?
>
> Could Solr automatically provision new collections purely from the data
> stored in Zookeeper?
>
>
> Cheers
> Ari
>
>
>
> > On Sun, Oct 9, 2016 at 5:41 AM, Aristedes Maniatis <a...@maniatis.org>
> wrote:
> >
> >> On 9/10/16 2:09am, Shawn Heisey wrote:
> >>> One of the historical challenges on this mailing list is that we were
> >>> rarely aware of what steps the user had taken to install or start Solr,
> >>> and we had to support pretty much any scenario.  Since 5.0, the number
> >>> of supported ways to deploy and start Solr is greatly reduced, and
> those
> >>> ways were written by the project, so we tend to have a better
> >>> understanding of what is happening when a user starts Solr.  We also
> >>> usually know the relative location of the logfiles and Solr's data.
> >>
> >>
> >> This migration is causing a lot of grief for us as well, and we are
> still
> >> struggling to get all the bits in place. Before:
> >>
> >> * gradle build script
> >> * gradle project includes our own unit tests, run in jenkins
> >> * generates war file
> >> * relevant configuration is embedded into the build
> >> * deployment specific variables (db uris, passwords, ip addresses)
> >> conveniently contained in one context.xml file
> >>
> >>
> >> Now:
> >>
> >> * Solr version is no longer bound to our tests or configuration
> >>
> >> * configuration is now scattered in three places:
> >>  - zookeeper
> >>  - solr.xml in the data directory
> >>  - jetty files as part of the solr install that you need to replace (for
> >> example to set JNDI properties)
> >>
> >> * deployment is also scattered:
> >>  - Solr platform specific package manager (pkg in FreeBSD in my case,
> >> which I've had to write myself since it didn't exist)
> >>  - updating config files above
> >>  - writing custom scripts to push Zookeeper configuration into
> production
> >>  - creating collections/cores using the API rather than in a config file
> >>
> >> * unit testing no longer possible since you can't run a mock zookeeper
> >> instance
> >>
> >> * zookeeper is very hard to integrate with deployment processes (salt,
> >> puppet, etc) since configuration is no longer a set of version
> controlled
> >> files
> >>
> >> * you can't change the configuration of one node as a 'soft deployment':
> >> the whole cluster needs to be changed at once
> >>
> >> If we didn't need a less broken replication solution, I'd stay on Solr4
> >> forever.
> >>
> >>
> >> I really liked the old war deployment. It bound the solr version and
> >> configuration management into our version controlled source repository
> >> except for one context.xml file that contained server specific
> deployment
> >> options. Nice.
> >>
> >> The new arrangement is a mess.
> >>
> >>
> >> Ari
> >>
> >>
> >>
> >> --
> >> -------------------------->
> >> Aristedes Maniatis
> >> GPG fingerprint CBFB 84B4 738D 4E87 5E5C  5EFA EF6A 7D2E 3E49 102A
> >>
> >
> >
> >
>
>
> --
> -------------------------->
> Aristedes Maniatis
> GPG fingerprint CBFB 84B4 738D 4E87 5E5C  5EFA EF6A 7D2E 3E49 102A
>

Re: solr 5 leaving tomcat, will I be the only one fearing about this?

Reply via email to