On 12/28/2012 10:34 PM, Otis Gospodnetic wrote:
Hi,
I'm not sure what that autoCommit with 0 values does. Does it effectively
disable autocommits? I hope so, else this may be a problem.
Otis,
I have 0 in my config for autocommit on my production 3.5.0 servers, and
have had since 1.4.0. I
I would suggest asking this on the zookeeper user list.
And let us know here what you find out, I'd be interested.
Note, zookeeper, as I understand it, uses its own protocol, so to some
reasonable extent it probablmy depends on yr load balancer. Also, as I
understand it, zookeeper maintains activ
The code (4.x) suggests that an autoCommit of 0 or negative or not present
in the config disables autoCommit, but time and document count-based commit
are independent:
protected UpdateHandlerInfo loadUpdatehandlerInfo() {
return new UpdateHandlerInfo(get("updateHandler/@class",null),
It's a bit confusing. It's entirely normal to see terms in your index when
you do low-level term walking that you didn't put there for trie fields. I
think of it as meta-data for navigational purposes.
No JIRA that I know of, the UI is reporting terms actually in your index
albiet ones that ar
Hi,
You can easily crawl it with wget to get a local copy.
Otis
Solr & ElasticSearch Support
http://sematext.com/
On Dec 29, 2012 4:54 PM, "d_k" wrote:
> Hello,
>
> I'm setting up Solr inside an intranet without an internet access and
> I was wondering if there is a way to obtain the data dump
Otis,
As of now I have disabled caches. And we are hardly running any queries at
this point. I filter mostly on string fields and two int fields, 2 dates
(one is a dynamic date field) and one dynamic string field. Same goes for
faceting also, except I do not use facets on the dynamic field.
In
Should that be setup as a public service then (like Wikipedia dump)?
Because I need one too and I don't think it is a good idea for DDOSing Wiki
with crawlers. And I bet, there will be some 'challenges' during scraping.
Regards,
Alex.
P.s. In fact, it would make an interesting example to have
I'd take it to Infra, although I think demand for this is so low...
Otis
Solr & ElasticSearch Support
http://sematext.com/
On Dec 29, 2012 8:14 PM, "Alexandre Rafalovitch" wrote:
> Should that be setup as a public service then (like Wikipedia dump)?
> Because I need one too and I don't think it
Sorry,
What's Infra? A mailing list? Demand is probably low for Solr, but may be
sufficient for all Apache's individual projects. I guess one way to check
is too see in Apache logs if there is a lot of scrapers running (by user
agents).
Anyway, for Solr specifically, an acceptable substitute coul
A zookeeper ensemble should be a fairly reliable, large enough no.of
machines(3+ typically 5,7,9) for a quorum.
So adding a load balancer on top will just add a hop and
decrease performance, and also add a failure point in the system.
that being said there needs to be a way to provide solr with a
Hi,
Sorry, by infra I meant ASF infrastructure people. There's a mailing list
and a JIRA project for infra stuff.
Otis
Solr & ElasticSearch Support
http://sematext.com/
On Dec 29, 2012 8:45 PM, "Alexandre Rafalovitch" wrote:
> Sorry,
>
> What's Infra? A mailing list? Demand is probably low for
11 matches
Mail list logo