I won't assert "total" mastery as a requirement. Degrees of mastery are
sufficient. But even then, even "partial" mastery of some rather basic areas
of Solor can be quite daunting.
It is enlightening to consider just how many nooks and crannies of Solr
there are to master, and how many reasonable levels of mastery there are.
Spatial... the final frontier.
-- Jack Krupansky
-----Original Message-----
From: Walter Underwood
Sent: Sunday, June 16, 2013 7:27 PM
To: solr-user@lucene.apache.org
Subject: Re: Adding pdf/word file using JSON/XML
1. Total mastery of a product is a strange requirement. That would would be
a huge trivia contest that would include all the vestigial bad bits. For
example, I feel no need to master the Porter stemmer. I have no idea how to
do geo search in Solr, though I'm sure I could learn it pretty quickly if
needed.
2. Someone who expects partial update in a search engine, or transactions,
has a deep misunderstandings of the tradeoffs you make for what search can
do. That isn't mastery of arcane details, that is search 101.
Here are Rob Pike's rules for a good software architecture:
1. Simple things are simple.
2. Hard things are possible.
3. You don't need to understand the entire system to use part of it.
I think Solr comes pretty close to that. It doesn't do as well on #1 as
Ultraseek did, but it is better on #2.
If you really need search with transactions with field updates, that is
really hard. You can buy it from Mark Logic. It works great and they charge
what it is worth.
wunder
Former Principle Architect Infoseek/Inktomi/Verity Ultraseek
Former Search Guy Netflix
Search Guy Chegg
On Jun 16, 2013, at 3:05 PM, Jack Krupansky wrote:
Jan, you made no mention of "mastering" Solr - which was the crux of my
comments.
I think everyone agrees that anyone can download and "use" Solr, in a
basic sense, with minimal effort. The issue is how far the average
application developer can get beyond "start" towards "mastery" without a
detailed cheat sheet and eventually intensive guidance, if not outright
exasperation and pain. How many of the many thousands of Solr deployments
didn't hit some kind of wall where they had the impression that Solr
should be able to do something easily and found that was not the case
(multi-word synonyms come to mind.)
Oh, and yes, by my standards, MOST software IS "bad" and "hard to use".
The level of training and books is certainly an indicator of the level of
"badness". Some of Solr is indeed "not so bad" - while other parts are
have at least some elements of "extreme badness" (NPE for a missing or
invalid parameters is a mark of extreme badness.)
[Again, my apologies to Roland - none of these comments reflect on his
original inquiry! Except, that Solr's divergence from a true, pure REST
API is certainly one of the elements of its "badness". The fact that
SolrCell does not support partial update as a true REST CRUD API should,
is a good example of relative "badness" in Solr.]
-- Jack Krupansky
-----Original Message----- From: Jan Høydahl
Sent: Sunday, June 16, 2013 4:16 PM
To: solr-user@lucene.apache.org
Subject: Re: Adding pdf/word file using JSON/XML
Hi,
I've never heard the complaint that Solr is hard to use. To the contrary,
most people I come across have downloaded Solr themselves, walked through
the tutorial and praise the simplicity with which they can start indexing
and searching content.
When they come to us asking for consultancy or training, they are already
in love with the product, they use it but realize that great search is so
much more than just getting the HTTP requests or XML right. So while any
"average Java developer" will be able to download and use Solr within an
hour or two (my statement - even PHP developers can do that :-) ), that's
just the beginning of it all.
With your reasoning, all software for which training classes exist are bad
and hard to use. Our training classes do not focus on the technology
itself, but best practices to achieve good search user experience *using*
Solr. This is a skill not even seasoned SQL developers have.
--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com
15. juni 2013 kl. 21:39 skrev Jack Krupansky <j...@basetechnology.com>:
[My apologies to Roland for "hijacking" his original thread for this
rant! Look what you started!!]
And I will stand by my statement: "Solr is too much of a beast for
average app developers to master."
And the key word there, in case a too-casual reader missed it is
"master" - not "use" in the sense of hack something together or solving a
niche application for a typical Solr deployment, but master in the sense
of having a high level of confidence about the vast bulk (even if not
absolutely 100%) of the subject matter, Solr itself.
I mean, generally, on average what percentage of Solr's many features
has the average Solr app-deployer actually "mastered"?
And, what I am really referring to is not what expertise the pioneers and
"expert" Solr solution consultants have had, but the level of expertise
required for those who are to come in the years ahead who simply want to
focus on their application without needing to become a "Solr expert"
first.
The context of my statement was the application "devs" referenced earlier
in this thread who were struggling because the Solr API was not 100% pure
RESTful. As the respondent indicated, they were much happier to have a
cleaner, more RESTful API that they as app developers can deal with, so
that they wouldn't have to "master" all of the bizarre inconsistencies of
Solr itself (e.g., just the knowledge that SolrCell doesn't support
partial/atomic update.)
And, the real focus of my statement, again in this particular context" is
the actual application devs, the guys focused on the actual application
subject matter itself, not the "Solr Experts" or "Solr solution
architects" who do have a lot higher mastery of Solr than the "average"
application devs.
And if my statement were in fact false, questions such as began this
thread would never have come up. The level of traffic for Solr User would
be essentially zero if it were really true that average application
developers can easily "master" Solr.
And there would be zero need so many of these Solr training classes if
Solr were so easy to "master". In fact, the very existence of so many
Solr training classes effectively proves my point. And that's just for
"basic" Solr, not any of the many esoteric points such as at the heart of
this particular thread (i.e., SolrCell not supporting partial/atomic
update.)
And, in conclusion, my real interest is in helping the many "average"
application developers who post inquiries on this Solr user list for the
simple reason that they ARE in fact "struggling" with Solr.
Personally, I would suggest that a typical (average) successful deployer
of Solr would be more readily characterized as having "survived" the Solr
deployment process rather than having achieved a truly deep "mastery" of
Solr. They may have achieved confidence about exactly what they have
deployed, but do they also have great confidence that they know exactly
what will happen if they make slight and subtle changes or what exactly
the fix will be for certain runtime errors? For the "average application
developer" I'm talking about, not the elite expert Solr consultants.
One final way of putting it. If a manager or project leader wanted to
staff a dev position to be "in-house Solr expert", can they just hire any
old average Java programmer with no Solr experience and expect that he
will rapidly "master" Solr?
I mean, why would so many recruiters be looking for a "Solr expert" or
engaging the services of Solr sonsultancies if mastery of Solr by
"average application developers" was a reality?!
[I want to hear Otis' take on this!]
-- Jack Krupansky
-----Original Message----- From: Grant Ingersoll
Sent: Saturday, June 15, 2013 1:47 PM
To: solr-user@lucene.apache.org
Subject: Re: Adding pdf/word file using JSON/XML
On Jun 15, 2013, at 12:54 PM, Alexandre Rafalovitch <arafa...@gmail.com>
wrote:
On Sat, Jun 15, 2013 at 10:35 AM, Grant Ingersoll <gsing...@apache.org>
wrote:
That being said, it truly amazes me that people were ever able to
implement Solr, given some of the FUD in this thread. I guess those
tens of thousands of deployments out there were all done by above
average devs...
I would not classify the thread as FUD.
I was just referring to the part about how Solr isn't something average
devs can do, which I think is FUD.
At any rate, I think the ExtractingReqHandler could be updated to allow
for metadata, etc. to be passed in with the raw document itself and a
patch would be welcome. It's something the literals stand in for now as
a lightweight proxy, but clearly there is an opportunity for more to be
passed in.=
--
Walter Underwood
wun...@wunderwood.org