? I don't have a handy place for making these
publicly accessible at the moment.
thanks,
jeff
On 3/31/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
C# and Java are so similar, perhaps the Java client in SOLR-20 could
learn something from yours (or vice-versa).
-Yonik
I've read through the list entries here, the Lucene list, and the wiki docs
and am not resolving a major pain point for us. We've been trying to
determine what could possibly cause us to hit this in our given environment,
and am hoping more eyes on this issue can help.
Our scenario: 150MB index
What would make things consistent for the client api's is a prescribed set
of implementations for a solr release. For example, executing searches with
these parameters, support for facets requires those parameters, updates
should be called in this manner, etc. For lack of a better term, a
loosel
Ryan - I'm working on cleanup to release this thing for the world to enjoy.
-- j
On 3/31/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
Yes yes!
On 3/31/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
> We built our first search system architecture around Lucene.Net back in
can use to
estimate the cache consumption rate for facet queries?
-- j
On 4/1/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
I've read through the list entries here, the Lucene list, and the wiki
docs and am not resolving a major pain point for us. We've been trying to
determine wh
On 4/2/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
On 4/1/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
> Our scenario: 150MB index, 14 documents, read/write servers in place
> using standard replication. Running Tomcat 5.5.17 on Redhat Enterprise
> Linux 4. Java con
ra eyes on this, much appreciated.
-- j
On 4/2/07, Mike Klaas <[EMAIL PROTECTED]> wrote:
On 4/2/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
> With facet queries and the fields used, what qualifies as a "large"
number
> of values? The wiki uses U.S. states as an exa
Sorry for the confusion. We do have caching disabled. I was asking the
question because I wasn't certain if the configurable cache settings applied
throughout, or if the FieldCache in lucene still came in play.
The two integer-based facets are single valued per document. The
string-based facet
Major version is 1.0. The bits are from a nightly build from early
September 2006.
We do have plans to upgrade solr soon.
On 4/2/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
On 4/2/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
> We are doing incremental updates, and we optim
Yonik - is this the JIRA entry you're referring to?
http://issues.apache.org/jira/browse/LUCENE-754
On 4/2/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
On 4/2/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
> We are doing incremental updates, and we optimize quite
upport on this issue.
cheers,
j
On 4/2/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
On 4/2/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
> Yonik - is this the JIRA entry you're referring to?
>
> http://issues.apache.org/jira/browse/LUCENE-754
Yes. But from the heap dump
Whenever I've encountered this, the cause has nearly always been starting
tomcat with the proper current working directory.
I went through the example install a few weeks ago, line by line, from the
wiki page for Tomcat and it ran fine. I'm running 5.5.17, and have done
this on both FC5 and FC6.
This one caught us as well.
Refer to
http://lucene.apache.org/java/docs/queryparsersyntax.html#Escaping%20Special%20Charactersfor
understanding what characters need to be escaped for your queries.
On 4/4/07, galo <[EMAIL PROTECTED]> wrote:
Hi,
I have an index consisting on the following fie
happy with
this C# client.
grts,m
"Jeff Rodenburg" <[EMAIL PROTECTED]>
31/03/2007 18:00
Please respond to
solr-user@lucene.apache.org
To
solr-user@lucene.apache.org
cc
Subject
C# API for Solr
We built our first search system architecture around Lucene.Net back in
If I'm contributing new source files (separate project entirely) through
JIRA, so the source code files need to contain the boilerplate Apache
license/disclaimers and the like? This is new code and a new project (C#),
and the wiki page on contributions (
http://wiki.apache.org/solr/HowToContribut
Whoops, typo: "...do the source code files need to contain the boilerplate
Apache license."
On 4/6/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
If I'm contributing new source files (separate project entirely) through
JIRA, so the source code files need to contain t
I prefer B.
Otis
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Simpy -- http://www.simpy.com/ - Tag - Search - Share
- Original Message
From: Jeff Rodenburg <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Friday, April 6, 2007 11:16:28 AM
Subject: Re: Question about code contribution
Whoops, typo: "...do the source c
suggestions for improvement are nice, and
helping hands are the best.
Until there's a better home for it, you can download the bits from JIRA at:
https://issues.apache.org/jira/browse/SOLR-205
cheers,
jeff r.
is
needed. I should probably add a wiki entry on the Solr site as well.
- Nunit tests need to be added. These always get complex when involving
distributed systems, but such is life.
-- jeff
On 4/10/07, JimS <[EMAIL PROTECTED]> wrote:
Thanx for the great contribution Jeff! A hand clap
Not yet from us, but I'm thinking about a nagios plugin for Solr. It would
be tomcat-based for the http stuff, however.
On 4/18/07, Walter Underwood <[EMAIL PROTECTED]> wrote:
Is there a good spot to track request rate in Solr? Has anyone
built a monitor?
wunder
--
Search Guru
Netflix
You might want to have a look at this patch in case it breaks stuff for us
On 4/20/07, Henrib <[EMAIL PROTECTED]> wrote:
Updated (forgot the patch for Servlet).
http://www.nabble.com/file/7996/solr-trunk-src.patch solr-trunk-src.patch
The change should still be compatible with the trunk it is
Sorry everybody, please ignore my last message. Gmail protects from
accidental reply-alls, but not replies to mailing lists :).
(Thanks to the developers for all the fantastic work on Solr, btw.
Saving us a tonne of work.)
-Jeff
On 4/21/07, Jeff Kavanagh <[EMAIL PROTECTED]> wrote:
You
those that have checked out the code, is it working for you? Does it
make sense?
thanks,
jeff r.
We're controlling this with Tomcat configuration on our end. I'm not a
servlet-container guru, but I would imagine similar capabilities exist on
Jetty, et al.
-- j
On 5/24/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
Is there a good way to force an index to be read-only?
I could configure a
David -
It depends on what "distributed" means in your question.
If you're looking for high availability, that can be accomplished through
typical load balancing schemes for the servlet container that's running
solr. Solr helps out in this respect with a replication scheme using rsync
that keep
I've been exploring distributed search, as of late. I don't know about the
"next solr" but I could certainly see a "distributed solr" grow out of such
an expansion.
In terms of the FederatedSearch wiki entry (updated last year), has there
been any progress made this year on this topic, at least
Mike - thanks for the comments. Some responses added below.
On 6/7/07, Mike Klaas <[EMAIL PROTECTED]> wrote:
I've implemented a highly-distributed search engine using Solr (200m
docs and growing, 60+ servers). It is not a Solr-based solution in
the vein of FederatedSearch--it is a higher-le
On 6/7/07, Rafael Rossini <[EMAIL PROTECTED]> wrote:
Hi, Jeff and Mike.
Would you mind telling us about the architecture of your solutions a
little bit? Mike, you said that you implemented a highly-distributed
search
engine using Solr as indexing nodes. What does that mean? Yo
/docs/api/index.html.
As always, send your praise or complaints this direction.
cheers,
jeff r.
Thanks for the heads-up Michael.
csproj files have been added to source.
-- jeff
On 6/13/07, Michael Plax <[EMAIL PROTECTED]> wrote:
Hello Jeff,
Thank you for posting Solrsharp.
I just check it out and it looks like projects (src\SolrSharp.csproj ,
example\Example.csproj) are missing f
to the server. Try the standard
step-through troubleshooting routines to see what messages are being passed
back from the server.
-- j
On 6/19/07, Michael Plax <[EMAIL PROTECTED]> wrote:
Hello Jeff,
thank you again for updating files.
I just run with some problems. I don
ix this issue.
I don't have a 1.2 instance to test this against available to me right now,
but can check this later. Michael, try updating your SolrSearcher.cs file
for this content-type setting to see if that resolves your issue.
thanks,
jeff r.
permit the SolrSchema.IsValidUpdateIndexDocument to
yield true when default value fields aren't present in the update request.
thanks,
jeff r.
On 6/20/07, Michael Plax <[EMAIL PROTECTED]> wrote:
Hello,
Yonik and Jeff thank you for your help.
You are right this was content-type issue.
in order to run exam
Thanks to Yonik, Michael, Ryan, (and others) for some recent help on various
issues discovered with Solrsharp. We were able to discover a few issues
with the library relative to the Solr 1.2 release. Those issues have been
remedied and have been pushed into source control.
The Solrsharp source
great, thanks Yonik.
On 6/20/07, Yonik Seeley <[EMAIL PROTECTED]> wrote:
On 6/21/07, Jeff Rodenburg <[EMAIL PROTECTED]> wrote:
> As an aside, it would be nice to record these issues more granularly in
> JIRA. Could we get a component created for our client library, similar
I have a search use case that requires that I use the results of search from
IndexA and apply them as a query component of a second search to IndexB.
(The nature of the data doesn't allow me to combine these indexes).
At present, this is handled at the client level: search one index, get the
resul
On 7/3/07, Walter Underwood <[EMAIL PROTECTED]> wrote:
The Atom Publishing Protocol would be a good choice for a rest API to
Solr.
That comes with a spec, interop testing, and an active community.
+1
Nope, other than just oversight.
I just modified the QueryParameter class to change the _boost and Boost
variable & property to type float, and all works well. I'll log an issue in
JIRA and update the source.
thanks otis,
jeff
On 7/5/07, Otis Gospodnetic <[EMAIL PROTECTED]&
Thanks Ryan. Comments below.
On 7/5/07, Ryan McKinley <[EMAIL PROTECTED]> wrote:
I just took a quick look at solrsharp. I don't really have to use it
yet, so this is not an in depth review.
I like the templated SearchResults -- that seems useful.
That has proven to be extremely useful in
tage of those are
welcome.
cheers,
jeff r.
max score: 1.3632475
I am just puzzled as to why the first 3 result scores are like that.
--
Regards,
Jeff
we still need to use CMS?
--
Thanks,
Jeff Courtade
M: 240.507.6116
Thanks for that...
I am just starting to look at this I was unaware of the license debacle.
Automated testing up to 10 is great.
I am still curious about the GC1 being supported now...
On Wed, Sep 26, 2018 at 10:25 AM Zisis T. wrote:
> Jeff Courtade wrote
> > Can we use GC1 garbage c
unning 6.6.2. We also run it on our
> 4.10.4 master/slave cluster.
>
> wunder
> Walter Underwood
> wun...@wunderwood.org
> http://observer.wunderwood.org/ (my blog)
>
> > On Sep 26, 2018, at 7:37 AM, Jeff Courtade
> wrote:
> >
> > Thanks for that...
&g
around 47.5 GB per server.
APX 2million docs per shard
--
Jeff Courtade
M: 240.507.6116
running 6.6.2. We also run it on our
> > > 4.10.4 master/slave cluster.
> > >
> > > wunder
> > > Walter Underwood
> > > wun...@wunderwood.org
> > > http://observer.wunderwood.org/ (my blog)
> > >
> > > > On Sep 26,
APX=approximately sorry
On Wed, Sep 26, 2018, 2:09 PM Shawn Heisey wrote:
> On 9/26/2018 9:45 AM, Jeff Courtade wrote:
> > We are considering a move to solr 7.x my question is Must we use cloud?
> We
> > currently do not and all is well. It seems all work is done ref
The CMS settings are very nearly what we use after tons of load testing we
changed newratio to 2 and it cut the 10 second pauses way down for us
huge heap though
On Wed, Sep 26, 2018, 2:17 PM Shawn Heisey wrote:
> On 9/26/2018 9:35 AM, Jeff Courtade wrote:
> > My concern with us
ituation."
> > >
> > >
> > > On Wed, Sep 26, 2018 at 11:08 AM Walter Underwood <
> wun...@wunderwood.org>
> > > wrote:
> > >
> > > > We’ve been running G1 in prod for at least 18 months. Our biggest
> cluster
> > > > i
achieve that in October 2018 is Oracle 11.
If you are doing production or commercial work you have to use openjdk or
buy a license. Such a mess
On Wed, Sep 26, 2018, 4:04 PM Christopher Schultz <
ch...@christopherschultz.net> wrote:
> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA
We run an old master/slave solr 4.3.0 solr cluster
14 nodes 7/7
indexes average 47/5 gig per shard around 2 mill docs per shard.
We have constant daily additions and a small amount of deletes.
We optimize nightly currently and it is a system hog.
Is it feasible to never run optimize?
I ask bec
We use 4.3.0 I found that we went into gc hell as you describe with small
newgen. We use CMS gc as well
Using newration=2 got us out of that 3 wasn't enough...heap of 32 gig
only
I have not gone over 32 gig as testing showed diminishing returns over 32
gig. I only was brave enough to go to 4
Hi,
I am working n doing a simple point upgrade from solr 7.6 to 7.7 cloud.
6 servers
3 zookeepers
one simple test collection using the prepackages _default config.
i stop all solr servers leaving the zookeepers up.
change out the binaries and put the solr.in.sh file back in place with
memory a
I
will be happy to update the mailing list when I figure this out for
everyone's Mutual entertainment.
--
Jeff Courtade
M: 240.507.6116
On Fri, Feb 15, 2019, 12:33 PM Erick Erickson Hmmm. I'm assuming that "nothing in the logs" is node/logs/solr.log, and
> that
> you'
This particular cve came out in the mailing list. Fed 12th
CVE-2017-3164 SSRF issue in Apache Solr
I need to know what the exploit for this could be?
can a user send a bogus shards param via a web request and get a local file?
What does an attack vector look like for this?
I am being aske
adjacent
> web endpoint via a GET request.
>
> Note that this can only impact you if your Solr instance can be directly
> accessed by untrusted sources.
>
> HTH
>
> On Thu, Feb 28, 2019 at 11:54 AM Jeff Courtade
> wrote:
>
> > This particular cve came out in the
The only way I found to track GC times was by truning on GC logging and the
writing cronjob data collection script and graphing it in zabbix
On Mon, Mar 18, 2019 at 12:34 PM Erick Erickson
wrote:
> Attachments are pretty aggressively stripped by the apache mail server, so
> it didn’t come throug
So,
I had a problem when at a customer site. They use zabbix for data
collection and alerting.
The solr server had been setup to use only jmx metrics.
the jvm was unstable and would lock up for a period of time and the metrics
and counters would be all screwed up. Because it was using jmx to ale
that give you a good way to navigate the GC
> events, GCViewer is free though.
>
> Best,
> Erick
>
> > On Mar 18, 2019, at 10:17 AM, Jeff Courtade
> wrote:
> >
> > So,
> >
> > I had a problem when at a customer site. They use zabbix for data
> &
I've been developing locally on OSX and am now going through the process of
automating the installation on AWS Ubuntu. I have created a core, added my
fields and then untarred the data directory on my Ubuntu instance,
restarted solr (to hopefully reindex), but no documents are seen.
Nor are any er
write.lock
Thanks,
Jeff
On Thu, Feb 1, 2018 at 1:37 PM, Shawn Heisey wrote:
> On 2/1/2018 11:14 AM, Jeff Dyke wrote:
>
>> I've been developing locally on OSX and am now going through the process
>> of
>> automating the installation on AWS Ubuntu. I have created a core,
ckson
wrote:
> One note, be _very_ sure you copy in binary mode..
>
> On Thu, Feb 1, 2018 at 1:33 PM, Shawn Heisey wrote:
> > On 2/1/2018 12:56 PM, Jeff Dyke wrote:
> >> That's exactly what i thought as well. The only difference and i can
> try
> >> to
I’ve been messing around with the Solr 7.2 autoscaling framework this week.
Some things seem trivial, but I’m also running into questions and issues. If
anyone else has experience with this stuff, I’d be glad to hear it.
Specifically:
Context:
-One collection, consisting of 42 shards, where up
bly the issue.
>
> Best,
> Erick
>
> On Thu, Feb 1, 2018 at 4:34 PM, Shawn Heisey wrote:
> > On 2/1/2018 4:32 PM, Jeff Dyke wrote:
> >> I just created a tar file, actually a tar.gz file and scp'd to a
> server, at
> >> first i was worried that the gzip
ot;, "node":"#ANY"} , means don't put more than 7
replicas of the collection (irrespective of the shards) in a given
node
what do you mean by distinct 'RF' ? I think we are screwing up the
terminologies a bit here
On Wed, Feb 7, 2018
base jetty its as simple as passing arguments to start.jar, but can't
find how to solve it with solr and the embedded jetty.
Thanks,
Jeff
A
LOT of other security around it, b/c i am trying to access it from outside
my VPC.
Thanks again!
Jeff
On Fri, Feb 23, 2018 at 1:53 PM, Jason Gerlowski
wrote:
> Hi Jeff,
>
> I haven't tested your exact use case regarding H/2, but the "bin/solr"
> startup script has a
thread and my curiosity.
Best,
Jeff
On Fri, Feb 23, 2018 at 3:11 PM, Jeff Dyke wrote:
> Thanks for the tip Jason. I didn't see the -j option there or here
> https://lucene.apache.org/solr/guide/7_2/solr-
> control-script-reference.html
>
> I'll keep this short, i tried
> On 2/23/2018 1:28 PM, Jeff Dyke wrote:
> > Answering a bit of my own question, the underlying jetty would have to be
> > built with it, and get pushed into its jar directory.
> >
> > I think i'll put nginx in front of this, do a quick proxy forcing 1.1 and
> > m
I'm not sure where you're documents are coming from but i would find this
from a 403/404 in an S3 bucket if the permissions were not correct.
But ultimately Walters last sentence is the best next step.
On Mon, Mar 5, 2018 at 12:38 PM, Walter Underwood
wrote:
> I presume this error is from SolrJ
I have a large 7.2 index with nested documents and many shards.
For each result (parent doc) in a query, I want to gather a relevance-ranked
subset of the child documents. It seemed like the subquery transformer would be
ideal:
https://lucene.apache.org/solr/guide/7_2/transforming-result-docume
The backup/restore still requires setting up a shared filesystem on all your
nodes though right?
I've been using the fetchindex trick in my solrcloud_manager tool for ages now:
https://github.com/whitepages/solrcloud_manager#cluster-commands
Some of the original features in that tool have been
r the duration
of the restore
But the former isn't tenable if you're sharding due to space constraints, and
the latter can't be easily predicted.
On 3/28/18, 11:30 AM, "Shawn Heisey" wrote:
On 3/28/2018 10:34 AM, Jeff Wartes wrote:
> The backup/restore s
gt; Right, there is a shared filesystem requirement. It would be nice if this
> Solr feature could be enhanced to have more options like backing up
> directly to another SolrCloud using replication/fetchIndex like your cool
> solrcloud_manager thing.
>
> On Wed, Ma
m isn't a query so it isn't parsed. So I have no way to
dereference the "$row.[shard]".
On 3/27/18, 4:00 PM, "Jeff Wartes" wrote:
I have a large 7.2 index with nested documents and many shards.
For each result (parent doc) in a query, I want to g
There're some edge cases around the response based on the timing. In case it's
useful:
Here's the bit from solrcloud-haft: (java)
https://github.com/bloomreach/solrcloud-haft/blob/d357476daafc4cff5f30dc726de6a2b38f37dd0d/src/main/java/com/bloomreach/bstore/highavailability/utils/SolrInteractionU
just am not
understanding something basic.
J
--
Jeff Courtade
M: 240.507.6116
ything.
>
> On Mon, Jun 4, 2018, 23:45 Jeff Courtade wrote:
>
> > Hi,
> >
> > This I think is a very simple question.
> >
> > I have a solr 4.3 master slave setup.
> >
> > Simple replication.
> >
> > The master and slave were both running
To be clear I deleted the actual index files out from under the running
master
On Mon, Jun 4, 2018, 2:25 PM Jeff Courtade wrote:
> So are you saying it should have?
>
> It really acted like a normal function this happened on 5 different pairs
> in the same way.
>
>
> On M
This is what I thought too.
It happened on all 5 really weird behavior. I entirely expected blank
indexes on the replica
On Mon, Jun 4, 2018, 2:38 PM Aman Tandon wrote:
> Hi Jeff,
>
> I suppose there should be slave configuration in solrconfig files which
> says to ping master to c
I am thankful for that!
Could you point me at something that explains this maybe?
J
On Mon, Jun 4, 2018, 4:31 PM Shawn Heisey wrote:
> On 6/4/2018 12:15 PM, Jeff Courtade wrote:
> > This was strange as I would have thought the replica would have
> replicated
> > an empty ind
2018 at 5:44 PM, Walter Underwood
> wrote:
> > Check the logs. I bet it says something like “refusing to fetch empty
> index.”
> >
> > wunder
> > Walter Underwood
> > wun...@wunderwood.org
> > http://observer.wunderwood.org/ (my blog)
> >
>
Nothing in the logs it's like it didn't happen.
So I think I need to address my logging levels log4j
On Tue, Jun 5, 2018, 12:35 AM Jeff Courtade wrote:
> Yes unix.
>
> It was an amazing moment.
>
>
>
> On Mon, Jun 4, 2018, 11:28 PM Erick Erickson
> wrote:
ent...
Generally
--
Jeff Courtade
M: 240.507.6116
On Mon, Apr 15, 2019, 9:33 AM SOLR4189 wrote:
> Hi all,
>
> I have a collection with many shards. Each shard is in separate SOLR node
> (VM) has 40Gb index size, 4 CPU and SSD.
>
> When I run performance checking with 50GB RAM (10Gb
Hi we have a new setup of solr 7.7 without cloud in a master/slave setup
Periodically our core stops responding to queries and must be
restarted on the slave.
Two hosts
is06 solr 7.7 master
ss06 solr 7.7 slave
simple replication is setup no solr cloud
so on the primary is06 we see this error
401 - 486 of 486 matches
Mail list logo